JPH09244869A

JPH09244869A - Document reading-aloud system

Info

Publication number: JPH09244869A
Application number: JP8053045A
Authority: JP
Inventors: Mitsumasa Tanaka; 三雅田中
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1996-03-11
Filing date: 1996-03-11
Publication date: 1997-09-19
Anticipated expiration: 2016-03-11
Also published as: JP3094896B2

Abstract

PROBLEM TO BE SOLVED: To read a KANA(Japanese syllabary)-KANJI(Chinese character) mixed document aloud as a document creator intends by analyzing processing operation that tag information detected in the KANAKANJI mixed document indicates and reading the KANA-KANJI mixed document aloud according to the processing operation. SOLUTION: A tag retrieval part 12 of a tag information processing part 28 retrieves tag information in the object KANA-KANJI mixed document 11 to be read aloud and a tag information analysis part 13 decides the kind of the retrieved tag information and performs the processing corresponding to the kind of the tag. The read-aloud buffer 21 of read-aloud software (reading- aloud means) 20 stores document data to be read aloud and a text processing part 22 uses a Japanese dictionary 24 where reading information and accent information on words are recorded according to the processing operation that the tag information analyzed by the tag information analysis part 13 to convert the inputted KANA-KANJI mixed document 11 into a pronunciation symbol sequence; and a speech synthesizing process part 23 converts the pronunciation sequence into a speech waveform, and a speech output part 26 outputs the speech waveform.

Description

Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、漢字仮名混じり文
章に対して言語処理を施し、その結果を音声合成する事
により音声として読み上げる文章読み上げ方式に関す
る。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a text-to-speech system in which a text mixed with kanji and kana is subjected to language processing, and the result is voice-synthesized to read as a voice.

【０００２】[0002]

【従来の技術】近年、日本語の漢字仮名混じり文章に対
して言語処理を施し、その処理結果に対して音声合成処
理を行うことにより、音声として読み上げる文章読み上
げ方式に関する技術が実用化されている。2. Description of the Related Art In recent years, a technology relating to a text-to-speech system which reads aloud as a voice has been put into practical use by performing language processing on a sentence mixed with Japanese kanji and kana and performing voice synthesis processing on the processing result. .

【０００３】従来、この種の読み上げ方式では、漢字仮
名混じり文章内の単語に対する読み情報やアクセント情
報等を登録した単語辞書を参照して言語処理が行われ、
その結果を用いて読み上げが行われていた。Conventionally, in this type of reading system, language processing is carried out by referring to a word dictionary in which reading information, accent information, etc. for words in a sentence containing kanji and kana are registered.
The results were read aloud.

【０００４】図１１は、上述した読み上げ方式の従来例
の構成を示すブロック図である。FIG. 11 is a block diagram showing a configuration of a conventional example of the above-mentioned reading system.

【０００５】図１１に示す従来例においては、読み上げ
の対象となる漢字仮名混じり文章１と、単語の読み情報
やアクセント情報を記録している日本語辞書４と、日本
語辞書４を使用して、漢字仮名混じり文章１を発音記号
列に変換するテキスト処理部２と、発音記号列を音声波
形として記録する音声データ５と、発音記号列を音声波
形に変換する音声合成処理部３と、音声波形を出力する
音声出力部６から構成されている。In the conventional example shown in FIG. 11, a sentence 1 containing kanji and kana to be read, a Japanese dictionary 4 in which reading information and accent information of words are recorded, and a Japanese dictionary 4 are used. , A text processing unit 2 for converting a sentence 1 mixed with Kanji and Kana into a phonetic symbol string, voice data 5 for recording a phonetic symbol string as a voice waveform, a voice synthesis processing unit 3 for converting a phonetic symbol sequence into a voice waveform, and a voice. The audio output unit 6 outputs a waveform.

【０００６】図１１に示す構成の動作について説明する
と、漢字仮名混じり文章１は、テキスト処理部２に入力
される。テキスト処理部２では、単語の読みの確定処理
を行う。この単語の読みの確定処理においては、単語辞
書４を参照することにより単語の読みが確定される。To explain the operation of the configuration shown in FIG. 11, a sentence 1 containing kanji and kana is input to the text processing unit 2. The text processing unit 2 carries out a process of confirming the reading of words. In this word reading determination process, the word reading is determined by referring to the word dictionary 4.

【０００７】また、文章内に埋め込まれている特殊単語
情報を取得し、解析することにより単語の読みが確定さ
れる。Further, the reading of the word is fixed by acquiring and analyzing the special word information embedded in the sentence.

【０００８】また、ルビ（振り仮名）付きの単語を抽出
することにより単語の読みが確定される。Further, the word reading is fixed by extracting the word with ruby (furigana).

【０００９】また、ページ内の読みの区切り文字を抽出
し、現在のページ内で読み上げることにより文章と次の
ページで読み上げる文章の区分けが実行される。Further, the reading delimiter character in the page is extracted and read out in the current page, whereby the sentence and the sentence read out in the next page are classified.

【００１０】テキスト処理部２では、読みの確定を行っ
た後に発音記号列に変換し、音声合成部３へ出力する。
音声合成部３では、音声データ５を用いて、発音記号列
を音声波形に変換し、音声出力部６で音声として出力す
る。In the text processing section 2, after the pronunciation is confirmed, the text processing section 2 converts it into a phonetic symbol string and outputs it to the voice synthesizing section 3.
The voice synthesizing unit 3 converts the phonetic symbol string into a voice waveform using the voice data 5, and outputs the voice waveform at the voice output unit 6.

【００１１】なお、読み上げ方式を用いるものにおいて
は、表示部を備え、読み上げと同時に読み上げ文章を表
示するものがある。この種のものにおいては、漢字仮名
混じり文章１がテキスト処理部２へ出力されるとともに
表示部に出力され、表示部はテキスト処理部２における
処理動作に合わせて漢字仮名混じり文章１を１ページ分
づつ表示するように構成される。Some reading methods use a reading system to display a reading sentence simultaneously with reading. In this type, the sentence 1 mixed with kanji and kana is output to the text processing unit 2 and is also output to the display unit, and the display unit corresponds to the processing operation in the text processing unit 2 and one sentence of the sentence 1 mixed with kanji kana is displayed. It is configured to display one by one.

【００１２】漢字仮名混じり文章中の単語には、固有名
詞や特殊読みを必要とする単語等が含まれている場合が
あり、そのような単語が前記の単語辞書に登録されてい
ないときには文章の作者が意図する読みを行えないこと
があった。Words in a mixed Kanji / Kana sentence may include proper nouns or words that require special reading. If such a word is not registered in the above word dictionary, the sentence Sometimes the author could not read what he intended.

【００１３】上記の問題点を解決するための従来例とし
ては、例えば、特開平４−３３１９９８号公報に開示さ
れるように、単語辞書に登録されていない単語に対し
て、読み方やアクセントなどの情報を定義した特殊単語
情報を文中に埋め込むか、脚注のように文章外に記述
し、それを用いることにより読み上げを行う方式があ
る。As a conventional example for solving the above-mentioned problems, for example, as disclosed in Japanese Patent Laid-Open No. 4-331998, reading, accent, etc., of a word that is not registered in a word dictionary are There is a method in which special word information that defines information is embedded in a sentence or described outside a sentence like a footnote and used to read aloud.

【００１４】また、読みが難解な単語、もしくは特別な
読ませ方をさせる単語等に対してルビを振る場合がある
が、このような場合、単語を読み上げた後にルビ自体も
読み上げるという、不自然な読み上げ方を行うことがあ
った。In addition, ruby may be shaken for a word that is difficult to read or a word that causes a special reading. In such a case, ruby itself is read after reading the word. I sometimes read aloud.

【００１５】上記の問題点を解決するための従来例とし
ては、例えば特開平５−１８９１９４号公報や特開平１
−３００３３４号公報に開示されるように、ルビ付きの
単語については読み上げを行わず、ルビのみを読み上
げ、以降にこの単語が出現する場合には、ルビで指定さ
れた読み方で読み上げを行う方式がある。As a conventional example for solving the above-mentioned problems, for example, JP-A-5-189194 and JP-A-1 are available.
As disclosed in Japanese Unexamined Patent Publication No. 300334/1993, a method of not reading a word with ruby, reading only ruby, and when this word appears thereafter, a method of reading according to the reading specified by ruby is used. is there.

【００１６】さらに、ページ単位で文章の読み上げを行
う文章読み上げ方式において、単語がページ間にまたが
っている場合、その単語が辞書に登録されていたとして
も言語処理を行うときには、意味の異なる単語として処
理されてしまい、不自然な読み上げを行うことがあっ
た。Further, in a sentence reading system for reading a sentence on a page-by-page basis, if a word spans pages, even if the word is registered in a dictionary, it will be treated as a word having a different meaning when performing language processing. It was processed and sometimes read unnaturally.

【００１７】このため、ページ内の最後に出現する読み
の区切り文字（例えば、句読点や括弧等と言った記号）
までの文章を読み上げ、残りの文章は次のページと結合
して、改ページされたときに読み上げる方式が考えられ
る。また、ページ内で漢字と平仮名が最後に切り替わる
箇所を区切りと判定する方式も考えられる。Therefore, the reading delimiter that appears at the end of the page (for example, symbols such as punctuation marks and parentheses)
It is conceivable to read out the sentences up to and combine the rest of the sentences with the next page, and then read them out when the page breaks. In addition, a method may be considered in which the part where the kanji and hiragana are switched last in the page is determined as the break.

【００１８】[0018]

【発明が解決しようとする課題】上述した特開平４−３
３１９９８号公報に開示される方式では、特殊単語の表
記情報と音韻、音律情報をコロン（：）で区分し、特殊
単語情報全体をセミコロン（；）で挟んで、処理対象と
なる漢字仮名混じり文章に予め埋め込むこととされてい
るが、特殊単語情報を示す記号（ここではセミコロン）
と同じ記号が、漢字仮名混じり文章の本文中に記述され
ている場合には、その記号を特殊単語情報と判別して処
理するため、読み間違い等の誤動作を起こすという問題
点があった。DISCLOSURE OF THE INVENTION Problems to be Solved by the Invention
In the method disclosed in Japanese Laid-Open Patent Publication No. 31998, the special word notation information, the phoneme, and the temperament information are separated by a colon (:), and the entire special word information is sandwiched by a semicolon (;), and a sentence containing a kanji kana to be processed It is supposed to be embedded in the above, but a symbol that indicates special word information (here, a semicolon)
If the same symbol is described in the text of a sentence mixed with kanji and kana, the symbol is processed as being judged as special word information, which causes a problem such as a reading error.

【００１９】また、特開平５−１８９１９４号公報や特
開平１−３００３３４号公報に開示される方式では、ル
ビが付加されている単語を繰り返し読み上げることを禁
止する制御しか行っておらず、それ以外の単語について
は読み上げを実行してしまうため、括弧や”○”等とい
った記号文字が単語辞書に登録されている場合には、そ
れについても読み上げてしまうことがあり、必ずしも文
章作成者の意図したものとはならないという問題点があ
った。Further, in the methods disclosed in Japanese Patent Laid-Open Nos. 5-189194 and 1-330034, only the control for prohibiting repeated reading of a word to which ruby is added is performed, but otherwise. Because the word is read aloud, if a symbol character such as parentheses or “○” is registered in the word dictionary, it may be read aloud as well. There was a problem that it was not a thing.

【００２０】さらに、ページ単位で読み上げを実行する
ときの読みの区切りの検出方式では、上記の区切り文字
を読みの区切りと判定する場合、区切り文字がページの
終わり近辺に出現しないときには次のページに持ち越す
文字列が多くなってしまう。この場合、メモリが多く必
要になるという問題点があり、さらに、表示部を備える
ものにおいては、表示されるページが次のページに切り
替わってから読み上げるため、表示されていない文章が
長々と読み上げられることとなり、使用者が違和感を感
じるという問題点があった。Further, in the reading delimiter detection method when reading aloud on a page-by-page basis, when the delimiter character is judged to be a reading delimiter, if the delimiter character does not appear near the end of the page, the next page is read. There are many strings to carry over. In this case, there is a problem that a large amount of memory is required. Furthermore, in the case of the one provided with the display unit, the page to be displayed is switched to the next page and then read out. Therefore, there is a problem that the user feels uncomfortable.

【００２１】また、漢字と平仮名が最後に切り替わる箇
所を読みの区切りと判定する場合には、”深い”等のよ
うに漢字と平仮名から構成される単語のとき、”深”
と”い”に分割され、”しん”、”い”と読み上げられ
てしまうため、単語の持つ読み方が損なわれるという問
題点があった。When it is determined that the place where the kanji and hiragana are switched last is a reading delimiter, when the word is composed of kanji and hiragana such as "deep", "deep"
Since it is divided into "i" and read out as "shin" and "i", there is a problem that the reading of words is impaired.

【００２２】本発明は、上述したような従来の技術が有
する問題点に鑑みてなされたものであって、文章作成者
の意図する読み方で文章を読み上げることができるとと
もにページ単位で文章の読み上げを行うときに違和感の
ない読み上げを行うことができる文章読み上げ方式を実
現することにある。The present invention has been made in view of the problems of the above-described conventional techniques, and it is possible to read a sentence according to the reading intended by the sentence creator and read the sentence in page units. It is to realize a text-to-speech system that allows reading without any discomfort.

【００２３】[0023]

【課題を解決するための手段】本発明の文章読み上げ方
式は、処理対象となる漢字仮名混じり文章として、所定
の処理動作を行う旨を示すタグ情報が挿入されたものが
用いられ、１ページ分毎に入力される前記漢字仮名混じ
り文章を読み上げる文章読み上げ方式であって、前記漢
字仮名混じり文章中のタグ情報を検索するタグ情報検索
手段と、検索の結果検出されたタグ情報が示す処理動作
を解析するタグ情報解析手段と、前記タグ情報解析手段
により解析されたタグ情報が示す処理動作に応じて入力
された前記漢字仮名混じり文章を読み上げる読み上げ実
行手段とを有することを特徴とする。According to the text-to-speech system of the present invention, a text containing kanji and kana with tag information indicating that a predetermined processing operation is to be performed is used as a text to be processed. A sentence reading method for reading a sentence containing the kanji and kana mixed for each input, a tag information search means for searching tag information in the sentence containing the kanji and kana, and a processing operation indicated by tag information detected as a result of the search. It is characterized by comprising tag information analyzing means for analyzing and reading-out executing means for reading out the sentence containing the kanji and kana input according to the processing operation indicated by the tag information analyzed by the tag information analyzing means.

【００２４】この場合、タグ情報が、タグ情報の開始を
示す第１の記号と、タグ情報の終了を示す第２の記号
と、各記号間に挟まれた処理動作の種類を示す識別子か
ら構成され、タグ情報検索手段は前記第１および第２の
記号を検索することによりタグ情報の検索を行い、タグ
情報解析手段は前記識別子によりタグ情報が示す処理動
作を解析することとしてもよい。In this case, the tag information is composed of a first symbol indicating the start of the tag information, a second symbol indicating the end of the tag information, and an identifier indicating the type of processing operation sandwiched between the symbols. The tag information search means may search the tag information by searching the first and second symbols, and the tag information analysis means may analyze the processing operation indicated by the tag information by the identifier.

【００２５】また、タグ情報が示す処理動作の種類とし
て読み方を指定する処理があり、タグ情報として、読み
方を指定する旨の識別子および音韻・音律が付された第
１のタグ情報と、前記第１のタグ情報に付された識別子
に処理範囲の終了を示す第３の記号がさらに付された第
２のタグ情報とがあり、読み上げ実行手段は、第１およ
び第２のタグ情報の間の文字列については第１のタグ情
報内の音韻・音律として読み上げることとしてもよい。Further, there is a process for designating the reading as the type of the processing operation indicated by the tag information, and the tag information includes the first tag information to which the identifier for designating the reading and the phoneme / temperament are added, and the first tag information. And the second tag information in which the third symbol indicating the end of the processing range is further added to the identifier attached to the first tag information. The character string may be read aloud as a phoneme or temperament in the first tag information.

【００２６】また、タグ情報が示す処理動作の種類とし
て読み上げを禁止する処理があり、タグ情報として、読
み上げを禁止する旨の識別子が付された第３のタグ情報
と、前記第３のタグ情報に付された識別子に処理範囲の
終了を示す第３の記号がさらに付された第４のタグ情報
とがあり、読み上げ実行手段は、第３および第４のタグ
情報の間の文字列については読み上げないこととしても
よい。Further, there is a process of prohibiting reading aloud as a type of processing operation indicated by the tag information, and the tag information includes third tag information to which an identifier for prohibiting reading aloud is added, and the third tag information. And the fourth tag information in which the third symbol indicating the end of the processing range is further added to the identifier attached to the reading identifier, and the reading execution means is not limited to the character string between the third and fourth tag information. You may not read it aloud.

【００２７】さらに、タグ情報が示す処理動作の種類と
して読みの区切りを示すタグ情報があり、タグ情報とし
て、読みの区切りを示す旨の識別子が付された第５のタ
グ情報が用いられ、読み上げ実行手段は、第５のタグ情
報以降の文字列については連続的な読み上げを行わず、
次に入力される漢字仮名混じり文章とともに読み上げる
こととしてもよい。Further, there is tag information indicating a reading break as the type of processing operation indicated by the tag information, and the fifth tag information with an identifier indicating the reading break is used as the tag information and read out. The execution means does not continuously read the character strings after the fifth tag information,
It may be read aloud together with a sentence containing a mixture of kanji and kana to be input next.

【００２８】「作用」上記のように構成される本発明に
おいては、タグ情報が示す処理に応じた読み上げが読み
上げ実行手段によって行われるので、様々な読み上げ処
理を行うことが可能となっている。[Operation] In the present invention configured as described above, since the reading execution means performs the reading according to the processing indicated by the tag information, various reading processing can be performed.

【００２９】タグ情報の開始と終了は第１の記号、第２
の記号で示されるが、タグ情報の処理の種類を識別子に
よって行われるので、漢字仮名混じり文章中の第１の記
号、第２の記号については識別子が付与されているか否
かにより確認することができる。The start and end of the tag information are the first symbol and the second symbol.
The type of processing of tag information is performed by an identifier, so it is possible to confirm whether the first symbol and the second symbol in a sentence mixed with Kanji and Kana have an identifier. it can.

【００３０】タグ情報が示す処理としては、読み方指
定、読み上げ禁止または読みの区切り等が挙げられる。
読み方指定、または読み上げ禁止の場合には、第１のタ
グ情報と第２のタグ情報の間の漢字仮名混じり文章、ま
たは、第３のタグ情報と第４のタグ情報の間の漢字仮名
混じり文章についてのみ行われるが、上記タグ情報と同
じ記号が、漢字仮名混じり文章中に記述されている場合
でも、その記号を上記タグ情報と間違うことがないた
め、誤動作を防ぐことができる。Examples of the processing indicated by the tag information include reading specification, reading prohibition, reading breaks, and the like.
When reading is specified or reading is prohibited, a sentence containing kanji kana characters between the first tag information and the second tag information, or a sentence containing kanji kana characters between the third tag information and the fourth tag information However, even if the same symbol as the above tag information is described in a sentence containing kanji and kana, the symbol is not mistaken for the above tag information, so that a malfunction can be prevented.

【００３１】また、読み方が難解な単語や特別な読み方
を行う単語を読み問違えることが無くなるため、文章作
成者の意図した読み方で音声合成がなされる。Further, since it is possible to avoid misinterpretation of a difficult-to-read word or a word to be read specially, the voice synthesis is performed according to the reading intended by the sentence creator.

【００３２】また、ルビが付加された単語を繰り返し読
み上げたり、括弧や”○”等の記号文字の読み上げを禁
止することができるため、聞き取りやすい音声合成がな
される。Further, it is possible to read aloud a word to which ruby is added repeatedly, and to prohibit reading of symbol characters such as parentheses and "○", so that speech synthesis that is easy to hear is performed.

【００３３】また、単語がページにまたがっているとき
でも、最適な位置で読みの区切りを予め指定できるた
め、少ないメモリで制御でき、かつ、読み間違いをせ
ず、聞き取りやすい音声合成がなされる。Further, even when a word spans a page, the reading delimiter can be designated in advance at an optimum position, so that the control can be performed with a small memory, and the reading can be performed easily without being erroneously read.

【００３４】[0034]

【発明の実施の形態】次に、本発明の実施形態について
図面を参照して説明する。Next, embodiments of the present invention will be described with reference to the drawings.

【００３５】まず、本実施形態で用いられるタグ情報の
書式について図５乃至図７を参照して説明する。First, the format of the tag information used in this embodiment will be described with reference to FIGS.

【００３６】本実施形態では、タグ情報の開始を表す第
１の記号として“＜”文字を用いられている。タグ情報
は、“＜”文字の次に、タグ情報の種類を表す文字列が
記述されている。In this embodiment, the "<" character is used as the first symbol indicating the start of tag information. In the tag information, a character string representing the type of tag information is described after the “<” character.

【００３７】タグ情報により示される情報の種類として
は、図５に示される読み方指定の情報（内容識別子：Ｔ
ＣＨＧ）、図６に示される読み上げ禁止の情報（内容識
別子：ＴＮＯＰ）、図７に示される読みの区切りの情報
（内容識別子：ＴＥＯＴ）がある。As the type of information indicated by the tag information, reading designation information (content identifier: T shown in FIG. 5 is used.
CHG), reading prohibited information (content identifier: TNOP) shown in FIG. 6, and reading delimiter information (content identifier: TEOT) shown in FIG.

【００３８】タグ情報が図５に示される単語の読み方を
指定するものであるときには、“＜”文字の次に単語の
読み情報やアクセント情報を記述する。次に、タグ情報
の終わりを表す第２の記号としての“＞”文字を記述す
る。When the tag information specifies how to read the word shown in FIG. 5, the word reading information and accent information are described after the "<" character. Next, the ">" character as the second symbol indicating the end of the tag information is described.

【００３９】また、タグ情報が、図５に示される単語の
読み方を指定するタグ情報と図６に示される単語の読み
上げの禁止を指定するタグ情報のときには、タグ情報の
種類を表す文字列（内容識別子）の前に第３の記号であ
る“／”文字を付加したタグ情報を用い、処理の対象と
なる単語の範囲を特定する。Further, when the tag information is the tag information for designating the reading of the words shown in FIG. 5 and the tag information for designating the prohibition of reading the words as shown in FIG. 6, a character string indicating the kind of the tag information ( The tag range in which the third symbol "/" is added before the content identifier) is used to specify the range of words to be processed.

【００４０】図５に示される単語の読み方を指定するタ
グ情報の場合には、単語の読み方を示す音韻・音律を示
す必要があるが、図６に示される単語の読み上げの禁止
を指定するタグ情報の場合には、単に処理範囲の読み上
げを行わないだけでよい。また、図７に示される読みの
区切りの情報は改頁のように用いられるものであり、処
理範囲がないために“／”文字を付加したタグ情報は必
要とされない。このように処理に応じて必要とされるタ
グ情報の内容も異なるものとなり、本実施形態において
は以下のタグ情報が用いられている。In the case of the tag information for designating the reading of words shown in FIG. 5, it is necessary to indicate the phoneme and temperament indicating the reading of words, but the tag for designating the prohibition of reading of words shown in FIG. In the case of information, it is not necessary to simply read the processing range. Further, the reading delimiter information shown in FIG. 7 is used like a page break, and tag information added with the "/" character is not required because there is no processing range. In this way, the contents of the tag information required depending on the processing also differ, and the following tag information is used in this embodiment.

【００４１】第１のタグ情報：読み方指定の範囲開始を
示すタグ情報第２のタグ情報：読み方指定の範囲終了を示すタグ情報第３のタグ情報：読み上げ禁止の範囲開始のタグ情報第４のタグ情報：読み上げ禁止の範囲終了のタグ情報第５のタグ情報：読みの区切りのタグ情報本実施形態でのタグ情報の検索は、“＜”文字を検索す
ることにより行われる。“＜”文字が検出されると、次
に、続く文字列が上記の各内容識別子であるかが確認さ
れ、異なる場合には漢字仮名混じり文章の本文中の
“＜”文字として認識する。また、直後に現れる第２の
記号である“＞”文字を検索することより、タグ情報の
終了を確認する。First tag information: Tag information indicating the start of the range of reading designation Second tag information: Tag information indicating the end of the range of reading designation Third tag information: Tag information of the beginning of the reading prohibited range Tag information: Tag information at the end of the reading prohibited range Fifth tag information: Tag information for reading delimiter The tag information search in the present embodiment is performed by searching for the "<" character. When the "<" character is detected, it is then checked whether or not the following character string is each of the above content identifiers, and if they are different, it is recognized as the "<" character in the text of the kanji-kana mixed sentence. Also, the end of the tag information is confirmed by searching for the second symbol “>” character that appears immediately after.

【００４２】図１は本実施形態の構成を表すブロック図
であり、読み上げの対象となる漢字仮名混じり文章１１
と、漢字仮名混じり文章１１内のタグ情報の処理を行う
タグ情報処理部２８と、漢字仮名混じり文章１１の中の
文字列の表示位置やサイズを確定する表示ソフト１４
と、処理された文字列を表示する文章表示部１９と、漢
字仮名混じり文章１１を解析して音声波形に変換する読
み上げソフト２０（読み上げ実行手段）と、音声波形を
出力する音声出力部２６と、次のページを読み上げると
きに準備される文字列を格納する持ち越しバッファ２７
から構成される。FIG. 1 is a block diagram showing the configuration of this embodiment, in which a sentence 11 mixed with kanji and kana to be read is read.
And a tag information processing unit 28 that processes tag information in the kanji-kana mixed sentence 11 and display software 14 that determines the display position and size of the character string in the kanji-kana mixed sentence 11.
A sentence display unit 19 for displaying the processed character string, reading software 20 (reading execution means) for analyzing the sentence 11 mixed with Kanji and Kana and converting it into a voice waveform, and a voice output unit 26 for outputting the voice waveform. , Carry-over buffer 27 that stores the character string prepared when reading the next page
Consists of

【００４３】タグ情報処理部２８は、タグ情報を検索す
るタグ検索部１２と、検索したタグ情報の種類を判別し
てタグの種類に応じた処理を行うタグ情報解析部１３か
ら構成されている。The tag information processing unit 28 is composed of a tag searching unit 12 for searching the tag information and a tag information analyzing unit 13 for discriminating the kind of the searched tag information and performing a process according to the kind of the tag. .

【００４４】表示ソフト１４は、表示する文章データを
格納する表示バッファ１５と、文章データ中の文字列の
位置やサイズなどを確定するテキスト整形手段１６と、
文字列の形を記録しているフォントデータ１８と、フォ
ントデータ１８を使用して、表示するときのドットデー
タを作成するドットデータ作成手段１７から構成されて
いる。The display software 14 includes a display buffer 15 for storing the text data to be displayed, a text shaping means 16 for determining the position and size of a character string in the text data,
It is composed of font data 18 for recording the shape of a character string and dot data creation means 17 for creating dot data for display using the font data 18.

【００４５】読み上げソフト２０は、読み上げる文章デ
ータを格納する読み上げバッファ２１と、単語の読み情
報やアクセント情報を記録している日本語辞書２４と、
日本語辞書２４を使用して漢字仮名混じり文章１１を発
音記号列に変換するテキスト処理部２２と、発音記号列
を音声波形として記録している音声データ２５と、発音
記号列を音声波形に変換する音声合成処理部２３から構
成されている。上記の各部における処理は不図示の制御
装置によって行われている。The reading software 20 includes a reading buffer 21 for storing text data to be read, a Japanese dictionary 24 for recording reading information and accent information of words,
A text processing unit 22 for converting a sentence 11 mixed with Kanji and Kana into a phonetic symbol string using a Japanese dictionary 24, voice data 25 in which the phonetic symbol string is recorded as a voice waveform, and a phonetic symbol string for converting into a voice waveform. It is composed of a voice synthesis processing section 23. The processing in each of the above units is performed by a control device (not shown).

【００４６】図２は本実施形態における処理動作をまと
めて示すフローチャート、図３は図２中のタグ情報検索
処理Ａ３における処理手順を示すフローチャート、図４
は図２中のタグ情報解析処理Ａ４における処理手順を示
すフローチャートである。FIG. 2 is a flow chart collectively showing the processing operation in the present embodiment, FIG. 3 is a flow chart showing the processing procedure in the tag information search processing A3 in FIG. 2, and FIG.
6 is a flowchart showing a processing procedure in tag information analysis processing A4 in FIG.

【００４７】まず、本実施形態における全体的な処理手
順について図２を参照して説明する。First, the overall processing procedure in this embodiment will be described with reference to FIG.

【００４８】漢字仮名混じり文章１１は１ページ分毎の
文書データとしてタグ情報検索部１２に入力される（ス
テップＡ１）。次に、入力された１ページ分の文書デー
タと後述する持ち越しバッファ２７内の文書データを結
合する（ステップＡ２）。続いて、タグ情報検索部１２
にて結合した文章データ中にタグ情報が存在するかが検
索され（ステップＡ３）、この後、ステップＡ３にてタ
グ情報が検出されたかを確認する（ステップＡ４）。The text 11 mixed with kanji and kana is input to the tag information retrieval unit 12 as document data for each page (step A1). Next, the input document data for one page and the document data in the carry-over buffer 27 described later are combined (step A2). Then, the tag information search unit 12
It is searched whether or not the tag information is present in the combined text data in (step A3), and then it is confirmed whether or not the tag information is detected in step A3 (step A4).

【００４９】ステップＡ４にてタグ情報が検出されたこ
とが確認された場合には、タグ情報解析部１３にて検出
されたタグ情報の種類を判別し、その種類に応じて文章
データを解析し（ステップＡ５）、解析後、ステップＡ
３に戻り上記動作を繰り返す。When it is confirmed that the tag information is detected in step A4, the type of the tag information detected by the tag information analysis unit 13 is determined, and the sentence data is analyzed according to the type. (Step A5), Step A after analysis
3 and the above operation is repeated.

【００５０】ステップＡ４にてタグ情報が検出されない
ことが確認された場合には、表示ソフト１４のテキスト
整形手段１６において表示バッファ１５内に格納されて
いる文章データの文字列の表示位置や表示サイズなどを
決定する。また、ドットデータ作成手段１７においては
フォントデータ１８を使用して表示時の文字列を作成
し、表示部１９により表示する（ステップＡ６）。When it is confirmed that the tag information is not detected in step A4, the display position and the display size of the character string of the sentence data stored in the display buffer 15 by the text shaping means 16 of the display software 14. And so on. Further, the dot data creating means 17 creates a character string for display using the font data 18 and displays it on the display unit 19 (step A6).

【００５１】次に、読み上げソフト２０のテキスト処理
手段２２において、読み上げバッファ２１内に格納され
ている文章データを日本語辞書２４を使用して発音記号
列に変換し、また、音声合成処理部２３においては、音
声データ２５を使用して音声波形に変換して音声出力部
２６で音声出力する（ステップＡ７）。Next, in the text processing means 22 of the reading software 20, the sentence data stored in the reading buffer 21 is converted into a phonetic symbol string using the Japanese dictionary 24, and the speech synthesis processing section 23 is also used. In step S7, the audio data 25 is used to convert the audio waveform into an audio waveform, and the audio output unit 26 outputs the audio (step A7).

【００５２】取得した１ページ分の文章データの表示と
読み上げが終了すると、ページ番号を更新した（ステッ
プＡ８）後に、更新したページ番号が漢字仮名混じり文
章の最終ページを超えたかを確認する（ステップＡ
９）。ステップＡ９での確認の結果、最終ページを超え
ていないことが確認された場合には、ステップＡ１へ戻
って上記の動作を繰り返し、最終ページを超えたことが
確認された場合には、表示および読み上げ処理を終了す
る。When the display and reading of the acquired sentence data for one page are completed, the page number is updated (step A8), and then it is confirmed whether the updated page number exceeds the final page of the sentence containing the kanji and kana (step A8). A
9). As a result of the confirmation in step A9, if it is confirmed that the last page is not exceeded, the process returns to step A1 and the above operation is repeated, and if it is confirmed that the last page is exceeded, display and The reading process ends.

【００５３】次に、ステップＡ３にて行われるタグ情報
検索処理について、図３を参照して、詳細に説明する。Next, the tag information search processing performed in step A3 will be described in detail with reference to FIG.

【００５４】ステップＡ３におけるタグ情報検索処理で
は、まず、文章データ中のタグ情報の開始を示す記号
“＜”を検索し（ステップＢ１）、検索できたかを確認
する（ステップＢ２）。In the tag information search processing in step A3, first, the symbol "<" indicating the start of tag information in the text data is searched (step B1), and it is confirmed whether the search was successful (step B2).

【００５５】タグ情報の開始を示す記号“＜”が検出さ
れたときには、タグ情報以前の文字列を表示バッファ１
５に格納し（ステップＢ３）、さらに、次の文字が内容
識別子であるかを確認する（ステップＢ４）。続く文字
が識別子でない場合には、検出された記号“＜”はタグ
情報の開始を表す記号ではなく、本文中に記述されてい
る“＜”文字であると判別して“＜”文字を表示バッフ
ァ１５に格納する（ステップＢ７）。When the symbol "<" indicating the start of tag information is detected, the character string before the tag information is displayed in the display buffer 1.
5 (step B3) and further confirms whether the next character is a content identifier (step B4). If the following character is not an identifier, the detected symbol “<” is not a symbol that indicates the start of tag information, but is determined to be the “<” character described in the text and the “<” character is displayed. The data is stored in the buffer 15 (step B7).

【００５６】ステップＢ４での確認の結果、続く文字が
“＜”でないことが確認された場合には、タグ情報以前
の文字列を読み上げバッファ２１に格納し（ステップＢ
５）、返値を“検索できた”に設定して（ステップＢ
６）、タグ情報検索処理を終了する。As a result of the confirmation in step B4, if it is confirmed that the following character is not "<", the character string before the tag information is stored in the reading buffer 21 (step B
5) Set the return value to "I was able to search" (step B
6), the tag information search process ends.

【００５７】ステップＢ２において、“＜”文字を検索
できないときは、文字列を表示バッファ１５および読み
上げバッファ２１にそれぞれ格納し（ステップＢ８，ス
テップＢ９）、返値を“検索できない”に設定して（ス
テップＢ１０）、タグ情報検索処理を終了する。In step B2, when the "<" character cannot be retrieved, the character strings are stored in the display buffer 15 and the reading buffer 21 (step B8, step B9), and the return value is set to "cannot be retrieved". (Step B10), the tag information search process ends.

【００５８】次に、ステップＡ５におけるタグ情報解析
処理について、図４を参照して、詳細に説明する。Next, the tag information analysis processing in step A5 will be described in detail with reference to FIG.

【００５９】ステップＡ５におけるタグ情報解析処理に
おいては、まず、タグ情報の種類を判別することが行わ
れ（ステップＣ１）、タグ情報の種類に応じてそれぞれ
異なる処理が行われる。In the tag information analysis processing in step A5, first, the type of tag information is determined (step C1), and different processing is performed depending on the type of tag information.

【００６０】タグ情報の種類には図５乃至図７に示した
ように、読み方指定、読み上げ禁止、読みの区切りの３
種類がある。As shown in FIGS. 5 to 7, there are three types of tag information: reading designation, reading prohibition, and reading delimiter.
There are types.

【００６１】タグ情報が単語の読み方を指定するタグ情
報である場合には、タグ情報内に記述されている単語の
読み情報やアクセント情報（音韻・音律情報）を取得し
（ステップＣ２）、単語の読み情報やアクセント情報を
読み上げバッファ２１に格納する（ステップＣ３）。次
に、本タグ情報での処理の対象となった単語の範囲の終
了を示すタグ情報を検出し（ステップＣ４）、本タグ情
報と範囲の終了タグ情報に囲まれた文字列を表示・バッ
ファ１５に格納して（ステップＣ５）、タグ情報解析処
理を終了する。When the tag information is the tag information designating the reading of the word, the reading information and accent information (phonological / temperal information) of the word described in the tag information is acquired (step C2), and the word is read. The reading information and the accent information of are stored in the reading buffer 21 (step C3). Next, the tag information indicating the end of the range of the word processed by this tag information is detected (step C4), and the character string surrounded by this tag information and the end tag information of the range is displayed / buffered. It stores in 15 (step C5), and ends the tag information analysis process.

【００６２】ステップＣ１での判別の結果、タグ情報の
種類が読み上げを禁止するタグ情報である場合には、本
タグ情報での処理の対象となった単語の範囲の終了を示
すタグ情報を検出し（ステツプＣ６）、本タグ情報と範
囲の終了タグ情報に囲まれた文字列を表示バッファ１５
に格納して（ステツプＣ７）、タグ情報解析処理を終了
する。As a result of the discrimination in step C1, when the type of the tag information is the tag information for which reading is prohibited, the tag information indicating the end of the range of the word to be processed by this tag information is detected. (Step C6), the character string surrounded by this tag information and the end tag information of the range is displayed in the display buffer 15
(Step C7), and the tag information analysis process ends.

【００６３】タグ情報の種類がページ内の読みの区切り
を示すタグ情報であるときは、本タグ情報以降の文字列
を持ち越しバッファ２７に格納して（ステップＣ８）、
タグ情報解析処理を終了する。When the type of the tag information is the tag information indicating the reading delimiter in the page, the character strings after this tag information are stored in the carry-over buffer 27 (step C8).
The tag information analysis process ends.

【００６４】[0064]

【実施例】次に、図面を参照して、本発明の実施例につ
いて説明する。Next, an embodiment of the present invention will be described with reference to the drawings.

【００６５】図８（ａ）〜（ｃ）のそれぞれは、単語の
読み方を指定するタグ情報“＜ＴＣＨＧ＞”を用いたと
きの文章データ、表示結果および音声出力結果を示す図
である。Each of FIGS. 8A to 8C is a diagram showing the sentence data, the display result, and the voice output result when the tag information "<TCHG>" designating the reading of the word is used.

【００６６】音声出力の対象が専門文章である場合、図
８（ｂ）に示すように、“ＨＤＤ”や“ＧＢ”のよう
に、専門用語を省略して、アルファベットで記述するこ
とが多々ある。しかし、全ての専門用語を日本語辞書２
４（図１参照）に登録することは不可能であるため、音
声出力するときは、“えっちでぃでぃ”や“じーびー”
のようにアルファベットそのものを読み上げてしまって
いた。また、“×”のような記号文字の場合、日本語辞
書２４に“ばつ”としか登録されていないときには、
“ろっぴゃくよんじゅうばつよんひゃくはちじゅ
う”のように意味の通らない読み方をしてしまってい
た。When the target of voice output is a technical sentence, technical terms are often omitted and written in alphabets, such as "HDD" and "GB", as shown in FIG. 8B. . However, all the technical terms in Japanese dictionary 2
Since it is impossible to register in 4 (see Fig. 1), when outputting voice, "ETCHIDEDI" or "JeeBee"
I read the alphabet itself like. Further, in the case of a symbol character such as "x", when only "Batsu" is registered in the Japanese dictionary 24,
I was reading something that didn't make sense, such as "Roppyakujyo batsu yonhyakuhachiju".

【００６７】本発明では上述のような読み方指定のタグ
情報を用いることによ上記のような不都合が発生するこ
とを防止している。In the present invention, the use of the tag information specifying the reading as described above prevents the inconvenience described above from occurring.

【００６８】図８（ａ）に示すように、単語の読み方を
指定するタグ情報を用いて、“システム構成は以下の通
りである。＜ＴＣＨＧ，ハードディスクドライブ＞
ＨＤＤ＜／ＴＣＨＧ＞１＜ＴＣＨＧ，ギガバイト
＞ＧＢ＜／ＴＣＨＧ＞メモリ３２＜ＴＣＨＧ，
メガバイト＞ＭＢ＜／ＴＣＨＧ＞マウスあり
解像度６４０＜ＴＣＨＧ，かける＞ × ＜／ＴＣＨ
Ｇ＞４８０”のように文章データを記述する。このよう
な文章データに対して、図２乃至図５に示した手順によ
る処理を行うことにより、図８（ｃ）に示すように、
“しすてむこうせいはいかのとおりである。はーどでぃ
すくどらいぶいちぎがばいとめもりさんじゅうに
めがばいとまうすありかいぞうどろっぴゃくよ
んじゅうかけるよんひゃくはちじゅう”と音声出力
することができる。As shown in FIG. 8A, "system configuration is as follows using tag information designating how to read a word. <TCHG, hard disk drive>"
HDD </ TCHG> 1 <TCHG, Gigabyte> GB </ TCHG> Memory 32 <TCHG,
Megabyte> MB </ TCHG> Mouse available
Resolution 640 <TCHG, multiply> × </ TCH
The text data is described as G> 480 ". By performing the processing according to the procedure shown in FIGS. 2 to 5 on such text data, as shown in FIG.
“How is it? Can be output as voice.

【００６９】本実施例では、専門用語の読み方を指定す
る例を挙げて説明したが、読み方が難解な単語や人名、
文章作成者が特別な読み方を指定させたい単語の読み方
を指定することも当然可能である。In the present embodiment, an example of specifying how to read technical terms has been described, but words and person names that are difficult to read,
Of course, it is also possible for the sentence creator to specify the reading of a word for which he wants to specify a special reading.

【００７０】図９（ａ）〜（ｃ）のそれぞれは、単語の
読み上げを禁止するタグ情報“＜ＴＮＯＰ＞”を用いた
ときの文章データ、表示結果および音声出力結果を示す
図である。FIGS. 9A to 9C are diagrams showing the sentence data, the display result, and the voice output result when the tag information "<TNOP>" for prohibiting the reading of words is used.

【００７１】漢字仮名混じり文章１１（図１参照）で
は、文章中の項目を強調するために飾り文字を用いる場
合がある。このような飾り文字が日本語辞書２４に登録
されていた場合、図９（ｂ）に示すような“○今日の献
立”という表示結果について、“まるきょうのこんだ
て”と音声出力することがある。また、“胡淑（こしょ
う）”のようにルビが振られている単語については“こ
しょうこしょう”と繰り返して音声出力することがあ
る。In the sentence 11 with kanji and kana mixed (see FIG. 1), decorative characters may be used to emphasize items in the sentence. When such a decorative character is registered in the Japanese dictionary 24, a voice output of "Marukyo no Kodate" is output for the display result of "○ today's menu" as shown in FIG. 9 (b). There is. In addition, for words such as “kosho” with ruby, the word “koshokosho” may be repeatedly output as voice.

【００７２】本発明では、上述のような単語の読み上げ
を禁止するタグ情報を用いることにより、上記のような
不都合が生じることが防止される。In the present invention, the use of the tag information for prohibiting the reading of words as described above prevents the above inconvenience from occurring.

【００７３】図９（ａ）に示すように、単語の読み上げ
を禁止するタグ情報を用いて、“＜ＴＮＯＰ＞○＜／Ｔ
ＮＯＰ＞今日の献立＜＜１２月２５日＞今日は七面
鳥の丸焼きを作ります。＜ＴＮＯＰ＞☆＜／ＴＮＯＰ
＞材料・七面鳥・塩・胡淑＜ＴＮＯＰ＞（こしょ
う）＜／ＴＮＯＰ＞・醤油＜ＴＮＯＰ＞（しょうゆ）
＜／ＴＮＯＰ）＞”のように文章データを記述する。こ
のような文章データに対して、図２乃至図５に示した手
順による処理を行うことにより、図９（ｃ）に示すよう
に、“きょうのこんだてじゆうにがつにじゆうごにち
きようはしちめんちょうのまるやきをつくります。
ざいりょうしちめんちょうしおこしょうしょう
ゆ”と音声出力することができる。また、本実施例で
は、図９（ａ）に示すように“＜”文字を本文中に使用
するときは、“＜＜” と記述して、本文とタグ情報と
を区別している。As shown in FIG. 9A, by using the tag information for prohibiting the reading of words, "<TNOP> ○ <// T
NOP>Today's Menu <<< December 25th> Today, I will make a roasted turkey. <TNOP> ☆ </ TNOP
＞ Ingredients ・ Turkey ・ Salt ・ Pepper <TNOP> (kosho) </ TNOP> ・ Soy sauce <TNOP> (soy sauce)
</ TNOP)> ”, the text data is described. By performing the processing shown in FIGS. 2 to 5 on such text data, as shown in FIG. 9C, "Today, we will make a round sword that will be used for daily life.
In this embodiment, when the “<” character is used in the text as shown in FIG. 9 (a), it is described as “<<<”. Then, the text and the tag information are distinguished.

【００７４】図１０（ａ）〜（ｃ）のそれぞれは、ペー
ジの読みの区切りを指定するタグ情報“＜ＴＥＯＴ＞”
を用いたときの文章データ、表示結果および音声出力結
果を示す図である。In each of FIGS. 10A to 10C, tag information "<TEOT>" for designating a page break is specified.
It is a figure which shows the text data when using, a display result, and a voice output result.

【００７５】漢字仮名混じり文章１が複数ページから構
成されている場合、図１０（ｂ）に示すように単語がペ
ージ間にまたがってしまうことがある。音声出力がペー
ジ単位で制御されている場合、図９（ｂ）に示すように
“越後”という単語が“越”と“後”に分かれるため、
“えちご”という読み方をする単語ではなく、“えつ”
と“のち”という読み方をする単語と解釈し、読み問違
えてしまう。When the sentence 1 containing kanji and kana is composed of a plurality of pages, the words may extend over the pages as shown in FIG. 10 (b). When the voice output is controlled on a page-by-page basis, the word "Echigo" is divided into "Etsu" and "Rear" as shown in FIG.
"Etsu" instead of the word "Echigo"
It is interpreted as a word that is read as "nochi" and misinterpreted.

【００７６】本発明では、上述したようなページの読み
の区切りを指定するタグ情報を用いることにより上記の
ような不都合が生じることを防止している。In the present invention, the use of the tag information for designating the page breaks as described above prevents the above inconvenience from occurring.

【００７７】図１０（ａ）に示すように、ページの読み
の区切りを指定するタグ情報を用いて、次のページに持
ち越す文字列の前に“＜ＴＥＯＴ＞”を記述する。この
文章データに対して、図２乃至図５に示した手順による
処理を行うことにより、図１０（ｃ）に示すように、
“越”は次のページに持ち越され、“えちご”として音
声出力される。As shown in FIG. 10A, "<TEOT>" is described before the character string to be carried over to the next page by using the tag information for designating the page reading delimiter. By performing the processing according to the procedure shown in FIGS. 2 to 5 on this text data, as shown in FIG.
"Etsu" is carried over to the next page and output as "Echigo".

【００７８】[0078]

【発明の効果】以上のように本発明によれば、タグ情報
と本文を区別する手段を具備しているため、本文中にタ
グ情報の開始を示す語句を使用することができる。As described above, according to the present invention, since the means for distinguishing the tag information from the text is provided, it is possible to use the word or phrase indicating the start of the tag information in the text.

【００７９】また、単語の読み方指定のタグ情報や単語
の読みを禁止するタグ情報を使用することにより、文章
作成者の意図した読み方を行う文章読み上げ方式を提供
することができる。Further, by using the tag information for designating the reading of the word and the tag information for prohibiting the reading of the word, it is possible to provide a sentence reading system for performing the reading intended by the sentence creator.

【００８０】また、ページ単位で文章の読み上げを行う
場合、ページ間に単語がまたがった場合でも、ページの
読みの区切りを指定するタグ情報を使用することによ
り、少ないメモリで制御でき、かっ読み間違いをせず、
間き取りやすい文章読み上げ方式を提供することができ
る。Further, when reading a sentence page by page, even if words are spread over pages, it is possible to control with a small memory by using the tag information for designating a page reading delimiter. Without
It is possible to provide a text-to-speech system that is easy to catch.

[Brief description of drawings]

【図１】本発明の一実施例のシステム構成図である。FIG. 1 is a system configuration diagram of an embodiment of the present invention.

【図２】図１に示した実施例の流れ図である。2 is a flow chart of the embodiment shown in FIG.

【図３】図１に示した実施例の流れ図である。FIG. 3 is a flow chart of the embodiment shown in FIG.

【図４】図１に示した実施例の流れ図である。FIG. 4 is a flow chart of the embodiment shown in FIG.

【図５】図１に示した実施例におけるタグ情報の説明図
である。5 is an explanatory diagram of tag information in the embodiment shown in FIG.

【図６】図１に示した実施例における単語の読み方指定
のタグ情報の一例を示す図である。6 is a diagram showing an example of tag information for designating the reading of words in the embodiment shown in FIG.

【図７】図１に示した実施例における単語の読み方指定
のタグ情報の一例を示す図である。FIG. 7 is a diagram showing an example of tag information for designating word reading in the embodiment shown in FIG. 1;

【図８】図１に示した実施例における単語の読み方指定
のタグ情報の一例を示す図である。8 is a diagram showing an example of tag information for designating a reading method of words in the embodiment shown in FIG.

【図９】図１に示した実施例における単語の読み上げ禁
止のタグ情報の一例を示す図である。9 is a diagram showing an example of tag information for prohibiting reading of words in the embodiment shown in FIG.

【図１０】図１に示した実施例におけるページの読みの
区切り指定のタグ情報の一例を示す図である。FIG. 10 is a diagram showing an example of tag information for designating page reading breaks in the embodiment shown in FIG. 1;

【図１１】従来例のシステム構成図である。FIG. 11 is a system configuration diagram of a conventional example.

[Explanation of symbols]

１１漢字仮名混じり文章１２タグ情報検索部１３タグ情報解析部１４表示ソフト１５表示バッファ１６テキスト整形手段１７ドットデータ作成手段１８フォントデータ１９文章表示部２０読み上げソフト２１読み上げバッファ２２テキスト処理部２３音声合成処理部２４日本語辞書２５音声データ２６音声出力部２７持ち越しバッファ 11 Texts mixed with Kanji and Kana 12 Tag information search section 13 Tag information analysis section 14 Display software 15 Display buffer 16 Text shaping means 17 Dot data creation means 18 Font data 19 Text display section 20 Reading software 21 Reading buffer 22 Text processing section 23 Speech synthesis Processing unit 24 Japanese dictionary 25 Voice data 26 Voice output unit 27 Carry-over buffer

Claims

[Claims]

1. A kanji-kana mixed sentence to be processed is one in which tag information indicating that a predetermined processing operation is to be inserted is used, and the kanji-kana mixed sentence input every one page is read out. A sentence reading method, comprising: tag information searching means for searching tag information in a sentence containing the kanji and kana; tag information analyzing means for analyzing processing operation indicated by tag information detected as a result of the search; A text-to-speech system, comprising: a text-to-speech execution means that reads out the text containing the kanji and kana input according to the processing operation indicated by the tag information analyzed by the means.

2. The sentence reading system according to claim 1, wherein the tag information is sandwiched between a first symbol indicating the start of the tag information, a second symbol indicating the end of the tag information, and each symbol. The tag information searching means searches the tag information by searching the first and second symbols, and when the first symbol is detected, the tag information is continuously searched. The sentence reading system is characterized in that the tag information analyzing means analyzes the processing operation indicated by the tag information by the identifier when it is continuous.

3. The text-to-speech system according to claim 2, wherein there is a process of designating a reading as the type of processing operation indicated by the tag information, and the tag information includes an identifier for designating the reading and a phoneme / phonetic temperament. And the second tag information in which the identifier attached to the first tag information is further attached with a third symbol indicating the end of the processing range. A text-to-speech system characterized in that a character string between the first and second tag information is read as a phoneme / temperal in the first tag information.

4. The sentence reading method according to claim 2 or 3, wherein there is a process of prohibiting reading as a type of processing operation indicated by the tag information, and the tag information has an identifier indicating that reading is prohibited. And the fourth tag information in which the identifier attached to the third tag information is further attached with a third symbol indicating the end of the processing range. A text-to-speech system, which does not read a character string between the third and fourth tag information.

5. The sentence reading system according to claim 2, wherein there is tag information indicating a reading break as a type of processing operation indicated by the tag information, and the reading break is included as the tag information. Fifth tag information with an identifier indicating that is used is used, and the reading execution means does not continuously read the character string after the fifth tag information, and the next input kanji-kana mixed sentence. A text-to-speech system characterized by reading aloud along with it.