JPH04218887A

JPH04218887A - Character data concatenator

Info

Publication number: JPH04218887A
Application number: JP3069941A
Authority: JP
Inventors: Kaoru Nakabayashi; 薫中林
Original assignee: Seiko Epson Corp
Current assignee: Seiko Epson Corp
Priority date: 1990-06-26
Filing date: 1991-04-02
Publication date: 1992-08-10
Anticipated expiration: 2015-07-10
Also published as: JP3061066B2

Abstract

PURPOSE:To dispense with manual work for the elimination of an overlapped part by a user, to prevent a correction error from occurring, and to save time and labor by automatically finding out the overlapped part, and connecting it to character data after eliminating the overlapped part on one side when part of plural pieces of character data are overlapped. CONSTITUTION:The upper half of printing type data 80 of a letter, etc., is read in by a handy scanner, and image data 81 can be obtained. The image data 81 is analyzed by a character recognition part 90, and a first character data group 120 is generated. Also, image data 82 is obtained from the lover half of the printing type data 80, and it is analyzed by the recognition part 90, and a second character data group 121 is generated. The first character data group 120 is stored in a first character data storage part 10, and the second character data group 121 in a second character data storage part 20, respectively. Since the third and fourth rows of the first character data group 120 and the first and second rows of the second character data group 121 stored in such way are overlapped with each other, one of them is eliminated in following processing.

Description

[Detailed description of the invention]

【０００１】0001

【産業上の利用分野】本発明は、文字データ連結装置に
関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a character data concatenation device.

【０００２】0002

【従来の技術】文字データの連結は単純に第１文字デー
タ群の末尾と第２文字データ群の先頭を連結していた。2. Description of the Related Art Character data is simply connected by connecting the end of a first character data group and the beginning of a second character data group.

【０００３】0003

【発明が解決しようとする課題】しかし、単純な連結で
は不便な場合がある。例えば、雑誌の記事などの長い文
章をハンディスキャナを用いて複数回に分けて読み取っ
て、この画像データを文字認識手段で文字データとした
場合、１回の読み込みで得られる文字データと次の読み
込みで得られる文字データの一部が重複してしまうこと
がある。このような場合に、文字データを単純に連結す
ると重複部分を後から人手で確認して削除しなければな
らなかった。[Problems to be Solved by the Invention] However, simple connection may be inconvenient. For example, if a long text such as a magazine article is read in multiple parts using a handy scanner, and this image data is converted into character data using a character recognition means, the character data obtained from one reading and the next reading are Some of the character data obtained may be duplicated. In such cases, if the character data were simply concatenated, the duplicated portions had to be manually checked and deleted later.

【０００４】本発明の目的は、第１文字データ群と第２
文字データ群の一部が重複していても、自動的にその部
分を探索して、一方の重複部分を取り除いて２つの文字
データ群を連結する文字データ連結装置を提供しようと
いうものである。[0004] An object of the present invention is to
To provide a character data linking device that automatically searches for a duplicate part of a character data group, removes one of the duplicate parts, and connects two character data groups.

【０００５】[0005]

【課題を解決するための手段】第１の発明の文字データ
連結装置には、任意の部分データ単位ごとにデータアク
セスできる第１および第２文字データ格納部と、第１文
字データ格納部に格納された第１文字データ群と第２文
字データ格納部に格納された第２文字データ群とを比較
して重複する部分データを検出する検出手段と、検出手
段で検出された重複する部分データの一方を取り除いて
、第１文字データ群と第２文字データ群とを連結する連
結手段とが備えられている。[Means for Solving the Problems] The character data concatenation device of the first invention includes first and second character data storage units that can access data in units of arbitrary partial data, and a character data storage unit that stores data in the first character data storage unit. detecting means for detecting overlapping partial data by comparing the first character data group stored in the first character data group and the second character data group stored in the second character data storage unit; Connecting means is provided for connecting the first character data group and the second character data group by removing one of them.

【０００６】また、第２の発明の文字データ連結装置に
は、任意の部分データ単位ごとにデータアクセスできる
第１および第２文字データ格納部と、第１文字データ格
納部に格納された第１文字データ群の行方向または列方
向にみた末尾の部分データと第２文字データ格納部に格
納された第２文字データ群の任意の部分データとを読み
出し、この任意の部分データが第２文字データ群の末尾
の部分データである場合にこの部分データの読み出し後
に終了信号を発する読出手段と、読出手段で読み出され
た第１文字データ群の末尾の部分データと第２文字デー
タ群の任意の部分データの内容を照合する照合手段と、
照合手段の結果が一致の場合には第２文字データ群の一
致した部分データの次の部分データを伝える信号を発し
、照合手段の結果が不一致の場合で読出手段から終了信
号を受けていない場合には第２文字データ群の次の部分
データを任意の部分データとして読み出すように読出手
段に指令を送り、照合手段の結果が不一致の場合で読出
手段から終了信号を受けとった場合には第２文字データ
群の先頭の部分データを伝える信号を発する制御手段と
、第１文字データ群全体を第１文字データ格納装置から
読み出し、制御手段からの信号によって伝えられた部分
データから末尾の部分データまでを第２文字データ格納
装置から読み出して、第１文字データ群の後に第２文字
データ群を連結する連結手段とが備えられている。[0006] The character data concatenation device of the second invention also includes first and second character data storage sections that can access data in arbitrary partial data units, and first and second character data storage sections that can access data in units of arbitrary partial data units; Read out the partial data at the end of the character data group in the row or column direction and any partial data of the second character data group stored in the second character data storage section, and use this arbitrary partial data as the second character data. reading means that issues an end signal after reading out partial data when the partial data is the last partial data of the group; a collation means for collating the contents of the partial data;
When the result of the collation means is a match, a signal is issued to convey the next partial data of the matched partial data of the second character data group, and when the result of the collation means is a mismatch and the end signal has not been received from the reading means. sends a command to the reading means to read out the next partial data of the second character data group as arbitrary partial data, and when the result of the collation means is a mismatch and an end signal is received from the reading means, the second a control means for emitting a signal conveying the first partial data of the character data group; and a control means for reading the entire first character data group from the first character data storage device, from the partial data conveyed by the signal from the control means to the last partial data. A connecting means is provided for reading out the character data from the second character data storage device and connecting the second character data group after the first character data group.

【０００７】さらに、第３の発明の文字データ連結装置
には、任意の部分データ単位ごとにデータアクセスでき
る第１および第２文字データ格納部と、第１文字データ
格納部に格納された第１文字データ群の行方向または列
方向にみた末尾の１または２以上の部分データを読み出
し、この部分データと同一個数の部分データを第２文字
データ格納部に格納された第２文字データ群の先頭から
読み出し、第２文字データ群の末尾の部分データまで読
み出した場合にこの読み出し後に終了信号を発する読出
手段と、読出手段で読み出された第１文字データ群と第
２文字データ群の１または２以上の部分データの内容を
照合する照合手段と、照合手段の結果が一致の場合には
第２文字データ群から読み出した末尾の部分データの次
の部分データを伝える信号を発し、照合手段の結果が不
一致の場合で読出手段から終了信号を受けていない場合
には読み出す部分データの数を１つ増やすように読出手
段に指令を送り、照合手段の結果が不一致の場合で読出
手段から終了信号を受けとった場合には第２文字データ
群の先頭の部分データを伝える信号を発する制御手段と
、第１文字データ群全体を第１文字データ格納装置から
読み出し、制御手段からの信号によって伝えられた部分
データから末尾の部分データまでを第２文字データ格納
装置から読み出して、第１文字データ群の後に第２文字
データ群を連結する連結手段とが備えられている。Furthermore, the character data concatenation device of the third invention includes first and second character data storage sections that can access data in arbitrary partial data units, and first and second character data storage sections that can access data in arbitrary partial data units; One or more pieces of partial data at the end of the character data group in the row or column direction are read out, and the same number of partial data as this partial data is read out from the beginning of the second character data group stored in the second character data storage section. reading means that issues an end signal after reading out when the last partial data of the second character data group is read; and one or more of the first character data group and second character data group read by the reading means. A collation means that collates the contents of two or more partial data; if the results of the collation means match, a signal is sent to convey the partial data next to the last partial data read from the second character data group; If the results do not match and the end signal has not been received from the reading means, a command is sent to the reading means to increase the number of partial data to be read by one, and if the results of the matching means do not match, the reading means sends an end signal. a control means for emitting a signal conveying the first partial data of the second character data group when receiving the first character data group; Connecting means is provided for reading the partial data to the last partial data from the second character data storage device and connecting the second character data group after the first character data group.

【０００８】[0008]

【作用】第１の発明の文字データ連結装置によれば、検
出手段によって、第１文字データ格納部に格納された第
１文字データ群と、第２文字データ格納部に格納された
第２文字データ群との重複部分が検出される。そして、
連結手段では、この重複部分の一方が取り除かれ、第１
文字データ群と第２文字データ群とが１つの文字データ
として連結される。[Operation] According to the character data concatenation device of the first invention, the detection means detects the first character data group stored in the first character data storage section and the second character stored in the second character data storage section. Overlapping portions with data groups are detected. and,
In the connecting means, one of the overlapping parts is removed and the first
The character data group and the second character data group are concatenated as one character data.

【０００９】第２の発明の文字データ連結装置によれば
、まず、読出手段によって、第１文字データ格納部に格
納された第１文字データ群の行方向または列方向にみた
末尾の部分データが読み出される。また、第２文字デー
タ格納部に格納された第２文字データ群の任意の部分デ
ータが読み出される。次に、照合手段によって、第１文
字データ群の末尾の部分データと第２文字データ群の任
意の部分データの内容が照合される。この照合手段の照
合の結果が一致の場合には、第２文字データ群の一致し
た部分データの次の部分データを伝える信号が発せられ
る。また、この照合手段の結果が不一致の場合で読出手
段から終了信号を受けていない場合には、第２文字デー
タ群の次の部分データを任意の部分データとして読み出
すように読出手段に指令が送られる。さらに、照合手段
の結果が不一致の場合で読出手段から終了信号を受けと
った場合には、第２文字データ群の先頭の部分データを
伝える信号が発せられる。そして、連結手段では、第１
文字データ群の全部が第１文字データ格納装置から読み
出され、制御手段からの信号によって伝えられた部分デ
ータから末尾の部分データまでが第２文字データ格納装
置から読み出され、第１文字データ群の後に第２文字デ
ータ群が連結される。According to the character data concatenation device of the second invention, first, the reading means reads the last partial data in the row direction or column direction of the first character data group stored in the first character data storage section. Read out. Further, arbitrary partial data of the second character data group stored in the second character data storage section is read out. Next, the matching means matches the content of the last partial data of the first character data group with the content of arbitrary partial data of the second character data group. If the result of the matching by the matching means is a match, a signal is issued to transmit the next partial data of the matched partial data of the second character data group. Further, if the result of the collation means is a mismatch and the end signal is not received from the reading means, a command is sent to the reading means to read out the next partial data of the second character data group as arbitrary partial data. It will be done. Further, if the result of the collation means is a mismatch and a termination signal is received from the reading means, a signal is issued to convey the first partial data of the second character data group. In the connecting means, the first
The entire character data group is read out from the first character data storage device, and the partial data from the partial data transmitted by the signal from the control means to the last partial data is read out from the second character data storage device, and the first character data A second character data group is concatenated after the group.

【００１０】第３の発明の文字データ連結装置は、１ま
たは２以上の部分データを照合して重複部分を検出する
点が第２の発明の文字データ連結装置と異なる。つまり
、第３の発明の文字データ連結装置の読出手段では、第
１文字データ群の行方向または列方向にみた末尾の１ま
たは２以上の部分データが読み出され、また、この部分
データと同一個数の部分データを第２文字データ群の先
頭から読み出される。そして、これらの部分データが照
合手段で照合されて、重複部分が検出されるのである。The character data concatenation device of the third invention differs from the character data concatenation device of the second invention in that it collates one or more partial data to detect overlapping portions. That is, the reading means of the character data concatenation device of the third invention reads out one or more partial data at the end of the first character data group in the row direction or column direction, and also reads out one or more partial data that is the same as this partial data. The number of partial data is read from the beginning of the second character data group. These partial data are then compared by a matching means to detect overlapping parts.

【００１１】[0011]

【実施例】以下、第１の発明から第３の発明までの一実
施例について、図１から図６を用いて説明する。図１は
本実施例の構成ブロック図である。本実施例の文字デー
タ連結装置１は、第１文字データ群が格納された第１文
字データ格納部１０と、第２文字データ群が格納された
第２文字データ格納部２０と、第１文字データ格納部１
０と第２文字データ格納部２０から行または列または文
字列の部分データを読み出す読出手段３０と、読出手段
３０で読み出された部分データを照合する照合手段４０
と、読出手段３０を制御する制御手段５０と、第１文字
データ群と第２文字データ群を連結する連結手段６０と
から構成されている。第１文字データ格納部１０および
第２文字データ格納部２０は、任意の部分データ単位ご
とにデータアクセスできるデータ構造を備えている。こ
の任意の部分データには、例えば、行単位の部分データ
や、列単位の部分データや、文節単位の部分データや、
１文字単位の部分データなどがある。また、連結手段で
接続された文字データは文字データ３格納部７０に格納
される。[Embodiment] Hereinafter, one embodiment of the first to third inventions will be described with reference to FIGS. 1 to 6. FIG. 1 is a block diagram of the configuration of this embodiment. The character data concatenation device 1 of this embodiment includes a first character data storage section 10 in which a first character data group is stored, a second character data storage section 20 in which a second character data group is stored, and a first character data storage section 20 in which a first character data group is stored. Data storage section 1
0 and reading means 30 for reading partial data of a row, column, or character string from the second character data storage section 20, and collation means 40 for collating the partial data read by the reading means 30.
, a control means 50 for controlling the reading means 30, and a connecting means 60 for connecting the first character data group and the second character data group. The first character data storage section 10 and the second character data storage section 20 have a data structure that allows data access for each arbitrary partial data unit. This arbitrary partial data includes, for example, partial data in row units, partial data in column units, partial data in bunsetsu units,
There is partial data for each character. Further, the character data connected by the connecting means is stored in the character data 3 storage section 70.

【００１２】次に、本実施例の処理の内容について、図
２のフローチャートを用いて説明する。まず、手紙など
の活字データ８０の上半分をハンディスキャナで読み込
み、画像データ８１を獲得する（ステップ１００）。こ
の画像データ８１を文字認識部９０で画像解析して、第
１文字データ群１２０を生成する（ステップ１０１）。また、活字データ８０の下半分をハンディスキャナで読
み込み、画像データ８２を獲得する（ステップ１０２）
。この画像データ８２を文字認識部９０で画像解析して
、第２文字データ群１２１を生成する（ステップ１０３
）。そして、第１文字データ１２０群を第１文字データ
格納部１０に、第２文字データ群１２１を第２文字デー
タ格納部２０にそれぞれ格納する。このように格納され
た第１文字データ群１２０の３、４行目と第２文字デー
タ群１２１の１、２行目は重複しているので、以後の処
理でこの内の一方を取り除くのである。Next, the contents of the processing of this embodiment will be explained using the flowchart shown in FIG. First, the upper half of printed data 80 such as a letter is read with a handy scanner to obtain image data 81 (step 100). This image data 81 is analyzed by the character recognition unit 90 to generate a first character data group 120 (step 101). Further, the lower half of the printed data 80 is read with a handy scanner to obtain image data 82 (step 102).
. This image data 82 is analyzed by the character recognition unit 90 to generate a second character data group 121 (step 103
). Then, the first character data group 120 is stored in the first character data storage section 10, and the second character data group 121 is stored in the second character data storage section 20, respectively. Since the 3rd and 4th lines of the first character data group 120 and the 1st and 2nd lines of the second character data group 121 stored in this way overlap, one of them will be removed in the subsequent processing. .

【００１３】重複した行を取り除くために、まず、第１
文字データ格納部１０と第２文字データ格納部２０から
部分データである１行ごとの文字データを読出手段３０
を用いて読み出す。具体的には、まず、第２文字データ
群１２１の行数を示す検索カウンタに「１」（１行目を
示す。）を設定する（ステップ１０４）。次に、第１文
字データ群１２０の末尾の行を読み出す（ステップ１０
５）。そして、ステップ１０４で設定された検索カウン
タを用いて、第２文字データ群１２１の任意の１行を読
み出し（ステップ１０６）、読み出し後に検索カウンタ
に１を加える（ステップ１０７）。ステップ１０６から
ステップ１１１まではループ構造になっており、検索カ
ウンタによって第２文字データ群１２１の第１行目から
順番に文字データを読み出していく。そして、この処理
で第２文字データ群１２１の末尾の行まで読み出したた
めに、ステップ１０６で正しく文字データを読み出せな
くなった場合（ステップ１０８）には、検索カウンタ「
１」にした上で（ステップ１０９）、連結手段の処理を
行う。また、末尾の行まで読み出していない場合には、
照合手段４０に処理を移し、第１文字データ群１２０と
第２文字データ群１２１からそれぞれ読み出された文字
データを照合する（ステップ１１０）。この照合によっ
て、文字データが一致した場合には、連結手段６０に処
理を移す（ステップ１１１）。また、一致しない場合に
は、ステップ１０６に処理を戻して、第２文字データ群
１２１の次の行の文字データを読み出す。連結手段６０
では、第１文字データ群の全部の行と第２文字データ群
の検索カウンタで示す行以降を連結する（ステップ１１
２）。[0013] In order to remove duplicate lines, first
Reading means 30 for character data for each line, which is partial data, from the character data storage section 10 and the second character data storage section 20;
Read out using . Specifically, first, a search counter indicating the number of lines in the second character data group 121 is set to "1" (indicating the first line) (step 104). Next, the last line of the first character data group 120 is read (step 10
5). Then, using the search counter set in step 104, an arbitrary line of the second character data group 121 is read (step 106), and after reading, 1 is added to the search counter (step 107). Steps 106 to 111 have a loop structure, and the search counter sequentially reads character data from the first line of the second character data group 121. If the last line of the second character data group 121 has been read in this process and the character data cannot be read out correctly in step 106 (step 108), the search counter "
1'' (step 109), and then processes the connection means. Also, if the last line has not been read,
The process is transferred to the matching means 40, and the character data read out from the first character data group 120 and the second character data group 121 are compared (step 110). As a result of this comparison, if the character data match, the processing is transferred to the linking means 60 (step 111). If they do not match, the process returns to step 106 and the next line of character data of the second character data group 121 is read. Connecting means 60
Now, all the lines of the first character data group are connected to the lines after the line indicated by the search counter of the second character data group (step 11).
2).

【００１４】なお、照合手段の処理をより正確に行うた
めには、複数行を用いて比較処理を行えばよい。この場
合には、まず、ステップ１０５で第１文字データ群の末
尾の行から順番に検索カウンタで示す行数だけ読み出す
。また、ステップ１０６でも同様に、第２文字データ群
の１行目から検索カウンタが示す行数だけ読み出す。そして、ステップ１１０で第１文字データ群と第２文字
データ群の複数行の照合を行うのである。[0014] In order to perform the processing of the collation means more accurately, the comparison processing may be performed using a plurality of lines. In this case, first, in step 105, the number of lines indicated by the search counter is read out sequentially from the last line of the first character data group. Similarly, at step 106, the number of lines indicated by the search counter is read from the first line of the second character data group. Then, in step 110, multiple lines of the first character data group and the second character data group are compared.

【００１５】次に、本実施例で連結される文字データの
例を、図３〜図５の文字データ連結図を用いて説明する
。Next, examples of character data concatenated in this embodiment will be explained using character data concatenation diagrams shown in FIGS. 3 to 5.

【００１６】図３（ａ）は第１文字データ群の３、４行
目と第２文字データ群の１、２行目が重複している場合
の連結例である。この例では、第１文字データ群の末尾
に、第２文字データ群の３行目を接続して、第３文字デ
ータを作成している。図３（ｂ）についても同様である
。FIG. 3(a) shows an example of concatenation where the third and fourth lines of the first character data group and the first and second lines of the second character data group overlap. In this example, the third line of the second character data group is connected to the end of the first character data group to create the third character data. The same applies to FIG. 3(b).

【００１７】図４（ａ）は第１文字データ群の３〜５列
目と第２文字データ群の１〜３列目が重複している場合
の連結例である。この例では、第１文字データ群の末尾
に、第２文字データ群の４列目を接続して、第３文字デ
ータを作成している。図４（ｂ）は第１文字データ群の
１７〜１９列目と第２文字データ群の１〜３列目が重複
している場合の連結例である。この例では、第１文字デ
ータ群の末尾に、第２文字データ群の４列目を接続して
、第３文字データを作成している。FIG. 4(a) shows an example of concatenation where the third to fifth columns of the first character data group and the first to third columns of the second character data group overlap. In this example, the fourth column of the second character data group is connected to the end of the first character data group to create third character data. FIG. 4B is an example of a connection in which the 17th to 19th columns of the first character data group and the 1st to 3rd columns of the second character data group overlap. In this example, the fourth column of the second character data group is connected to the end of the first character data group to create third character data.

【００１８】図５（ａ）は第１文字データ群の文字列「
初夏の候」が第２文字データ群に存在しない場合の連結
例である。この例では、第１文字データ群の末尾に、第
２文字データ群の先頭を接続して、第３文字データを作
成している。第５図（ｂ）は第１文字データ群の６ワー
ド目の文字列「Ｅｐｓｏｎ」が第２文字データ群の４ワ
ード目と重複している場合の連結例である。この例では
、第１文字データ群の末尾に、第２文字データ群の４ワ
ード目を接続して、第３文字データを作成している。FIG. 5(a) shows the character string "
This is an example of a concatenation when "early summer weather" does not exist in the second character data group. In this example, the beginning of the second character data group is connected to the end of the first character data group to create third character data. FIG. 5(b) is an example of concatenation where the character string "Epson" in the 6th word of the first character data group overlaps with the 4th word in the second character data group. In this example, the fourth word of the second character data group is connected to the end of the first character data group to create third character data.

【００１９】次に、本実施例をパーソナルコンピュータ
で実現した場合のハードウェア構成について、図６を用
いて説明する。パーソナルコンピュータ２１０は、制御
手段５０として動作するＣＰＵ２１１と、読出手段３０
や照合手段４０などのプログラムが格納されたＲＡＭ２
１２と、ＢＩＯＳなどのシステムプログラムが格納され
たＲＡＭ２１３とから構成される。また、このパーソナ
ルコンピュータ２１０には、第１文字データ群や第２文
字データ群などを表示するＣＲＴ２２０と、ユーザから
の指令が入力されるキーボード２２１と、雑誌の記事な
どを入力するハンディスキャナ２２２と、ハンディスキ
ャナ２２２などで入力された文字データが格納される磁
気ディスク２２３とが備えられている。Next, the hardware configuration when this embodiment is implemented on a personal computer will be explained using FIG. 6. The personal computer 210 includes a CPU 211 that operates as a control means 50 and a reading means 30.
RAM 2 in which programs such as the checking means 40 and the like are stored
12, and a RAM 213 in which system programs such as BIOS are stored. The personal computer 210 also includes a CRT 220 for displaying a first character data group, a second character data group, etc., a keyboard 221 for inputting commands from the user, and a handy scanner 222 for inputting magazine articles and the like. , and a magnetic disk 223 in which character data input using a handy scanner 222 or the like is stored.

【００２０】本実施例は、特にハンディスキャナ２２２
を用いて入力した複数の文字データを連結する際に効果
を発揮する。つまり、ハンディスキャナ２２２は入力部
の幅が小さいため、通常は数回に分けて文字データ上を
走査させる必要がある。この走査においては、入力抜け
が生じないように端の部分を重複させながら入力させる
ことが一般に行われている。本実施例を用いれば、この
時に生じる重複部分の削除が自動的に行えるのである。This embodiment is particularly applicable to the handy scanner 222.
It is effective when concatenating multiple character data input using . In other words, since the width of the input section of the handy scanner 222 is small, it is usually necessary to scan the character data in several steps. In this scanning, input is generally performed while overlapping the end portions to prevent input omissions. Using this embodiment, it is possible to automatically delete the duplicated portions that occur at this time.

【００２１】なお、本実施例で行または列または文字列
のいずれの部分データを用いるかは、ユーザによる指令
によって決定しても良いし、あるいは２以上の部分デー
タを併用しても良い。併用する例としては、まず行単位
で重複部分の照合を行い、次に列単位で重複部分の照合
を行って、連結する手段などがある。In this embodiment, which partial data of rows, columns, or character strings to use may be determined by a user's command, or two or more partial data may be used in combination. An example of a combination of methods includes a means of first collating overlapping parts on a row-by-row basis, then collating overlapping parts on a column-by-column basis, and then concatenating them.

【００２２】[0022]

【発明の効果】本発明の文字データ連結装置であれば、
複数の文字データの一部が重複する場合に、自動的に重
複部分を探し出して、一方の重複部分を取り除いて文字
データを連結することができる。[Effects of the Invention] The character data concatenation device of the present invention provides
When parts of multiple character data overlap, the overlapping parts can be automatically found, one of the overlapping parts can be removed, and the character data can be concatenated.

【００２３】したがって、複数の文字データを連結する
作業時に、利用者が重複部分を手作業で削除する必要が
なくなる。このために、修正ミスの発生を防ぐことがで
き、さらに手間と時間が節約できる。[0023] Therefore, when working to concatenate a plurality of character data, there is no need for the user to manually delete overlapping parts. For this reason, it is possible to prevent the occurrence of correction errors and further save labor and time.

【００２４】特に、本発明の文字データ連結装置の応用
として、雑誌の記事などの長い文章をハンディスキャナ
で複数回に分けて読み取り、この画像データを文字認識
手段で文字データとし、そして、このようにして得られ
た複数個の文字データを連結させて元の文章を再現する
場合に効果を発揮する。In particular, as an application of the character data linking device of the present invention, a long text such as a magazine article is read in multiple parts with a handy scanner, this image data is converted into character data by a character recognition means, and It is effective when reproducing the original text by concatenating multiple pieces of character data obtained by

[Brief explanation of the drawing]

【図１】本実施例の構成ブロック図である。FIG. 1 is a configuration block diagram of the present embodiment.

【図２】本実施例の処理の内容を示すフローチャートで
ある。FIG. 2 is a flowchart showing the contents of processing in this embodiment.

【図３】本実施例の文字データの連結処理を示す概念図
である。FIG. 3 is a conceptual diagram showing character data concatenation processing in this embodiment.

【図４】本実施例の文字データの連結処理を示す概念図
である。FIG. 4 is a conceptual diagram showing character data concatenation processing in this embodiment.

【図５】本実施例の文字データの連結処理を示す概念図
である。FIG. 5 is a conceptual diagram showing a character data concatenation process according to the present embodiment.

【図６】本実施例のハードウェア構成図である。FIG. 6 is a hardware configuration diagram of this embodiment.

[Explanation of symbols]

１…文字データ連結装置１０…第１文字データ格納部２０…第２文字データ格納部３０…読出手段４０…照合手段５０…制御手段６０…連結手段 1...Character data concatenation device 10...First character data storage section 20...Second character data storage section 30...Reading means 40... Verification means 50...control means 60...Connection means

Claims

[Claims]

1. First and second character data storage sections capable of accessing data for each arbitrary partial data unit;
Detection means for detecting overlapping partial data by comparing a first character data group stored in the character data storage section and a second character data group stored in the second character data storage section;
A character data concatenation device comprising: concatenation means for concatenating a first character data group and a second character data group by removing one of the overlapping partial data detected by the detection means.

2. First and second character data storage sections capable of accessing data for each arbitrary partial data unit;
reading out partial data at the end of the first character data group stored in the character data storage section as viewed in the row or column direction and arbitrary partial data of the second character data group stored in the second character data storage section; , reading means that issues an end signal after reading this partial data when this arbitrary partial data is the last partial data of the second character data group; and the end of the first character data group read by the reading means. a collating means for collating the contents of the partial data of the second character data group with arbitrary partial data of the second character data group; and, if the result of the collating means is a match, the next partial data of the matched partial data of the second character data group; and transmitting a signal to the reading means to read out the next partial data of the second character data group as arbitrary partial data if the result of the matching means is a mismatch and the end signal is not received from the reading means. a control means for transmitting a command and, if the result of the collating means is a mismatch and receiving an end signal from the reading means, a signal for transmitting the first partial data of a second character data group; and a first character data group. The entire character data storage device is read out from the first character data storage device, and the partial data from the partial data transmitted by the signal from the control means to the last partial data is read out from the second character data storage device. A character data concatenation device comprising a concatenation means for concatenating two character data groups.

3. First and second character data storage sections capable of accessing data for each arbitrary partial data unit;
One or more pieces of partial data at the end in the row or column direction of the first character data group stored in the character data storage section are read, and the same number of partial data as this partial data is stored in the second character data storage section. reading means for reading from the beginning of the second character data group stored in the second character data group and issuing an end signal after the reading when the last partial data of the second character data group is read; a collation means for collating the contents of one or more partial data of the character data group and the second character data group; and, if the result of the collation means is a match, the content of the last partial data read from the second character data group; It issues a signal that conveys the next partial data, and instructs the reading means to increase the number of partial data to be read by one if the result of the collating means is a mismatch and an end signal has not been received from the reading means. sending,
control means for emitting a signal conveying the first partial data of the second character data group when the result of the collation means does not match and a termination signal is received from the reading means; One character data storage device is read out, and partial data from the partial data transmitted by the signal from the control means to the last partial data is read out from the second character data storage device, and a second character data group is read out after the first character data group.
A character data concatenation device comprising a concatenation means for concatenating a group of character data.