JPH11194790A

JPH11194790A - Voice recognition actuator

Info

Publication number: JPH11194790A
Application number: JP9368279A
Authority: JP
Inventors: Atsushi Yamauchi; 敦史山内
Original assignee: Kyocera Corp
Current assignee: Kyocera Corp
Priority date: 1997-12-29
Filing date: 1997-12-29
Publication date: 1999-07-21
Anticipated expiration: 2017-12-29
Also published as: JP3519259B2

Abstract

(57)【要約】【課題】音声認識部にとって最も認識し易い発声の仕
方で音声入力することを学習できる音声認識作動装置を
提供する。【解決手段】話者の音声を入力して音声信号を出力す
る音声入力部１２と、音声信号を入力して話者の音声を
認識する音声認識部１８と、名義名の音声パターンを登
録する音声パターン登録部２２と、予め学習メッセージ
が登録された学習メッセージ登録部４８と、相手方から
の通話音声を出力する音声出力部２６と、音声認識部１
８により認識された名義名及びこれに対応するダイヤル
番号を表示する表示部３０と、音声認識部１８からの信
号に基づいて所定の処理動作を制御する制御部２０とを
備えた音声認識作動装置１０において、名義名の音声入
力時は、音声出力部２６から学習メッセージが音声出力
されるようにした。 (57) [Summary] [PROBLEMS] To provide a speech recognition operation device that can learn to input a speech in a manner of utterance that is most easily recognized by a speech recognition unit. SOLUTION: A voice input unit 12 which inputs a voice of a speaker and outputs a voice signal, a voice recognition unit 18 which receives a voice signal and recognizes a voice of the speaker, and registers a voice pattern of a nominal name. A voice pattern registration unit 22, a learning message registration unit 48 in which a learning message is registered in advance, a voice output unit 26 for outputting a voice of a call from the other party, and a voice recognition unit 1.
A voice recognition actuating device including a display unit 30 for displaying the nominal name recognized by the control unit 8 and a dial number corresponding to the name, and a control unit 20 for controlling a predetermined processing operation based on a signal from the voice recognition unit 18 In 10, at the time of voice input of the nominal name, the voice output unit 26 outputs a learning message by voice.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、人間が入力した音
声を認識してこれに基づいて適宜作動する音声認識作動
装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice recognition operating device that recognizes voice input by a human and operates appropriately based on the voice.

【０００２】[0002]

【従来の技術】従来の音声認識作動装置としては、例え
ば、特開昭６１−１４４１５７号公報に記載されたよう
な、ダイヤル番号に対応する電話加入者の名前（名義
名）をマイクロフォンに向かって発声することにより、
電話の音声認識部がその名義名を認識して、それに対応
するダイヤル番号を自動的に呼び出すことができる電話
機に係る音声ダイヤル装置がある。2. Description of the Related Art As a conventional voice recognition operating device, for example, a telephone subscriber's name (name) corresponding to a dialed number is directed to a microphone as described in Japanese Patent Application Laid-Open No. 61-144157. By speaking,
2. Description of the Related Art There is a voice dial device related to a telephone capable of recognizing a name by a voice recognition unit of the telephone and automatically calling a corresponding dial number.

【０００３】このような電話機に係る音声ダイヤル装置
は、自動車電話機や携帯型電話機に用いることができ、
手動でダイヤルやテンキーを操作しなくとも電話をかけ
ることが可能となり、他の作業で手がふさがっている場
合でも電話をかけることが可能となる。[0003] Such a voice dial device relating to a telephone can be used for an automobile telephone or a portable telephone.
It is possible to make a call without manually operating the dial or the numeric keypad, and it is possible to make a call even if the hand is full for other work.

【０００４】[0004]

【発明が解決しようとする課題】しかしながら、このよ
うな従来の音声ダイヤル装置においては、登録時にダイ
ヤル番号の名義名を音声入力する際、どのように発声し
て入力すれば音声認識部にとって最も認識し易いかをユ
ーザーがまだ知らない場合、或いは慣れていない場合、
音声認識部にとって認識しにくい発声の仕方で音声入力
することにより、発呼時にうまく音声認識部が認識でき
ないでスムーズにダイヤル番号を呼び出すことができな
いおそれがある。However, in such a conventional voice dialing device, when the name of the dialed number is voice-inputted during registration, how to utter and input the name is the most recognized by the voice recognition unit. If the user does not know yet or is unfamiliar with it,
By inputting a voice in a manner of utterance that is difficult for the voice recognition unit to recognize, there is a possibility that the voice recognition unit may not be able to recognize the voice when calling and the dial number may not be smoothly called.

【０００５】そこで本発明は、上記問題点に鑑みて、登
録や音声認識の動作における音声入力についてまだ習熟
していないユーザーであっても、音声認識部にとって最
も認識し易い発声の仕方で音声入力することを学習でき
る音声認識作動装置を提供することを課題とするもので
ある。[0005] In view of the above problems, the present invention provides a speech inputting method that is most easily recognized by the speech recognizing unit even if the user is not yet proficient in speech input in the registration and speech recognition operations. It is an object of the present invention to provide a speech recognition operating device capable of learning to do.

【０００６】[0006]

【課題を解決するための手段】上記課題を解決するため
に、本発明による音声認識作動装置は、話者の音声を入
力して電気的な音声信号を出力する音声入力部と、前記
音声信号を入力して前記話者の音声を認識する音声認識
部と、相手方の電話加入者の名義名の音声パターンを登
録する音声パターン登録部と、予め学習メッセージが登
録された学習メッセージ登録部と、相手方からの通話音
声を出力する音声出力部と、前記音声認識部により認識
された前記名義名及び／又はこれに対応するダイヤル番
号を表示する表示部と、前記音声認識部からの信号に基
づいて登録、認識、読み取り、表示、その他の所定の処
理動作を制御する制御部とを備えた音声認識作動装置に
おいて、前記名義名の音声入力時は、前記音声出力部か
ら前記学習メッセージが音声出力される構成としたもの
である。In order to solve the above-mentioned problems, a voice recognition operating device according to the present invention comprises a voice input section for inputting a voice of a speaker and outputting an electrical voice signal; , A voice recognition unit that recognizes the voice of the speaker by inputting, a voice pattern registration unit that registers a voice pattern of the name of the other party's telephone subscriber, a learning message registration unit in which a learning message is registered in advance, A voice output unit for outputting a call voice from the other party, a display unit for displaying the nominal name recognized by the voice recognition unit and / or a dial number corresponding thereto, and a signal from the voice recognition unit. A control unit for controlling registration, recognition, reading, display, and other predetermined processing operations, wherein when the nominal name is input by voice, the learning message is sent from the voice output unit. It is obtained by a configuration in which di is the audio output.

【０００７】このような構成の音声認識作動装置によれ
ば、名義名の音声入力時は学習メッセージ登録部に登録
された学習メッセージが音声出力部から音声出力される
ため、前もってユーザーが発声の仕方の学習をすること
ができるので、登録や音声認識の動作における音声入力
についてまだ習熟していないユーザーであっても、ユー
ザーは音声認識部にとって最も認識し易い発声の仕方で
音声入力をすることが可能となる。[0007] According to the voice recognition operating device having such a configuration, the learning message registered in the learning message registration unit is output from the voice output unit by voice when the nominal name is input, so that the user can speak in advance. Therefore, even if the user has not yet mastered the voice input in the registration and voice recognition operations, the user can input the voice in the most recognizable way for the voice recognition unit. It becomes possible.

【０００８】[0008]

【発明の実施の形態】以下、本発明の実施の形態につい
て、図面に基づいて具体的に説明する。図１及び図２
は、本発明による音声認識作動装置の第１の実施の形態
に係る携帯型電話機１０を説明するために参照する図で
ある。Embodiments of the present invention will be specifically described below with reference to the drawings. 1 and 2
FIG. 1 is a diagram referred to for describing a portable telephone 10 according to a first embodiment of a voice recognition operating device according to the present invention.

【０００９】図１に示す携帯型電話機１０においては、
相手方と通話するときは音声が入力された音声入力部１
２（マイクロフォン）から音声信号が通話回路３４に送
られ、さらに音声信号は通話回路３４から高周波部２４
に送られてそのアンテナ２４ａから相手方に向けて最寄
りの基地局に無線が発信される。In the portable telephone 10 shown in FIG.
Voice input unit 1 to which voice is input when talking with the other party
2 (microphone) transmits a voice signal to the communication circuit 34, and further the voice signal is transmitted from the communication circuit 34 to the high frequency unit 24.
Is transmitted to the nearest base station from the antenna 24a toward the other party.

【００１０】相手方からの応答が基地局から無線でアン
テナ２４ａに受信されると、その受信信号は高周波部２
４から通話回路３４を経て音声出力部２６（スピーカ
ー）に送られ、音声出力部２６からそれを耳に当ててい
る話者（ユーザー）に応答の音声が聞こえてくるように
なっている。When a response from the other party is received by the antenna 24a from the base station by radio, the received signal is
4 is sent to the voice output unit 26 (speaker) via the communication circuit 34, and the voice of the response is heard from the voice output unit 26 to the speaker (user) wearing the ear.

【００１１】また相手方に発呼するときは、操作部３６
のキーでダイヤル番号を押すことにより発呼することも
できるが、相手方の名義名が名義名パターン記憶部（音
声パターン登録部）２２に登録されているときは、その
名義名をユーザーが音声入力部１２に向かって発声する
ことによって、自動的に通話回路３４が有するダイヤル
回路により相手方に発呼することができる。When calling the other party, the operation unit 36
A call can be made by pressing the dial number with the key of, but when the other party's nominal name is registered in the nominal name pattern storage unit (voice pattern registration unit) 22, the user inputs the nominal name by voice. By speaking toward the unit 12, a call can be automatically made to the other party by the dial circuit included in the communication circuit 34.

【００１２】この場合は、音声入力部１２からの音声信
号はアンプ部１４によって増幅され、Ａ／Ｄ部１６（ア
ナログ／デジタル変換装置）によってアナログ信号から
デジタル信号に変換され、そのデジタル信号を入力した
音声認識部１８が入力された音声を認識して、名義名パ
ターン記憶部２２に登録されている名義名の中から同一
の音声パターンを照合して選び出す。In this case, the audio signal from the audio input unit 12 is amplified by the amplifier unit 14, converted from an analog signal to a digital signal by the A / D unit 16 (analog / digital converter), and the digital signal is input. The recognized voice recognition unit 18 recognizes the input voice and checks and selects the same voice pattern from the nominal names registered in the nominal name pattern storage unit 22.

【００１３】そして音声認識部１８がその名義名に係る
信号を制御部２０に出力すると、制御部２０はダイヤル
番号記憶部３８に登録されたダイヤル番号の中から、予
めその名義名に対応するよう関係付けられたダイヤル番
号を呼び出して、通話回路３４のダイヤル回路に送るこ
とにより自動的に発呼されるようになっている。When the voice recognition unit 18 outputs a signal related to the nominal name to the control unit 20, the control unit 20 selects one of the dial numbers registered in the dial number storage unit 38 in advance so as to correspond to the nominal name. By calling the associated dial number and sending it to the dial circuit of the call circuit 34, a call is automatically made.

【００１４】携帯型電話機１０はＩＣメモリー等により
構成される録音再生部４０を有しており、通話中又は留
守番モードにしたときの音声を録音し、後で再生するこ
とができる。また携帯型電話機１０は温度測定センサー
４２を有しており、この温度測定センサー４２は環境の
温度を測定して制御部２０に知らせることにより、制御
部２０は表示部３０を構成するＬＣＤの輝度が劣化しな
いように制御することができる。The portable telephone 10 has a recording / reproducing section 40 composed of an IC memory or the like, and can record a voice during a call or in an answering machine mode, and reproduce it later. The portable telephone 10 has a temperature measurement sensor 42. The temperature measurement sensor 42 measures the temperature of the environment and notifies the control unit 20 of the temperature. Can be controlled so as not to deteriorate.

【００１５】そして携帯型電話機１０は、表示部３０に
日時及び時刻表示が可能なようにタイマー４４が制御部
２０に常にクロック信号を出力して、表示部３０に表示
される月日と時刻を進行させるようになっている。In the portable telephone 10, the timer 44 always outputs a clock signal to the control unit 20 so that the date and time can be displayed on the display unit 30 so that the date and time displayed on the display unit 30 can be changed. It is going to progress.

【００１６】本実施の形態においては、図１に示す構成
に追加して、図２に示すように練習用音声合成部４８
（学習メッセージ登録部）が設けられている。この練習
用音声合成部４８は、練習用音声が音声入力部１２に入
力される前に、音声が合成された模範用のモデル音声を
音声出力部２６から出力させるようになっている。In the present embodiment, in addition to the configuration shown in FIG. 1, a practice voice synthesizer 48 as shown in FIG.
(Learning message registration unit) is provided. The practice voice synthesis unit 48 causes the model output model voice synthesized with the voice to be output from the voice output unit 26 before the practice voice is input to the voice input unit 12.

【００１７】この練習用音声合成部４８は音声出力部２
６からそのモデル音声を出力させるが、ＩＣメモリーの
ように抑揚もアクセントもない不自然な音声に比べて、
抑揚もアクセントも考慮に入れた、より自然な音声を音
声出力部２６から出力させることができるような機能を
有している。The practice voice synthesizer 48 includes a voice output unit 2
6 to output the model voice, but compared to unnatural voice without inflection and accent like IC memory,
The voice output unit 26 has a function of outputting a more natural voice that takes into account both intonation and accent.

【００１８】次に、名義名とそのダイヤル番号の登録手
順について、図３のフローチャートに基づいて説明す
る。操作部３６のキーを操作して登録モードを選択する
と、表示部３０には最初に登録初期画面が出て（ステッ
プＳ１）、その次には例えば「名前をキーで入力してく
ださい」と表示される（ステップＳ２）。その表示を見
て例えば「山田」とユーザーが操作部３６のキーで名義
名を入力すると、表示部３０に「ヤマダ」と表示される
（ステップＳ３）と共に、その名義名がダイヤル番号記
憶部３８に登録される。Next, a procedure for registering a name and its dial number will be described with reference to the flowchart of FIG. When the registration mode is selected by operating the keys of the operation unit 36, an initial registration screen is first displayed on the display unit 30 (step S1), and then, for example, "Please input a name with a key" is displayed. Is performed (step S2). When the user sees the display and inputs, for example, "Yamada" with the key of the operation unit 36, "Yamada" is displayed on the display unit 30 (step S3), and the nominal name is stored in the dial number storage unit 38. Registered in.

【００１９】次に表示部３０に「ダイヤル番号をキーで
入力してください」と表示され（ステップＳ４）、ユー
ザーがダイヤル番号を入力すると、表示部３０に「ヤマ
ダ０３−○○○○−△△△△」と表示される（ステップ
Ｓ５）と共に、そのダイヤル番号がダイヤル番号記憶部
３８に登録される。Next, "Enter the dial number with the key" is displayed on the display unit 30 (step S4). When the user inputs the dial number, "Yamada 03-OOOO- ○" is displayed on the display unit 30. Is displayed (step S5), and the dialed number is registered in the dialed number storage unit 38.

【００２０】次に表示部３０には「出力音声のように練
習で発音してください」のような表示が出され（ステッ
プＳ６）、その表示に続いて音声出力部２６から予め練
習用音声合成部４８に記憶された、“ヤマダ”という自
然で明瞭な発音が音声出力部２６から出力されて聞こえ
てくる。ユーザーはその発音の仕方をまねして“ヤマ
ダ”と練習で発声してみる。Next, a display such as "Please practice like an output voice in practice" is displayed on the display unit 30 (step S6). The natural and clear pronunciation of “Yamada” stored in the unit 48 is output from the sound output unit 26 and is heard. The user mimics the pronunciation and says "Yamada" in practice.

【００２１】それから表示部３０に「マイクに名義名を
発音してください」と表示され（ステップＳ７）、ここ
で初めて名義名パターン記憶部２２に音声パターンが登
録される名義名として、改めて“ヤマダ”と先に練習し
た発声の仕方でマイクに向かって発音する。Then, "Please pronounce the nominal name on the microphone" is displayed on the display unit 30 (step S7). Here, "Yamada" is newly registered as the nominal name in which the voice pattern is registered in the nominal name pattern storage unit 22 for the first time. "Speak into the microphone in the way you practiced earlier.

【００２２】すると音声入力部１２から音声信号がアン
プ部１４及びＡ／Ｄ部１６を経て音声認識部１８に入力
され、音声認識部１８からの信号により名義名パターン
記憶部２２には“ヤマダ”という音声パターンが正式に
登録される。そして表示部３０には「名義名の登録を終
了しました」と表示されて、一連の登録のための作業を
終了する（ステップＳ８）。Then, a voice signal is input from the voice input unit 12 to the voice recognition unit 18 via the amplifier unit 14 and the A / D unit 16, and “Yamada” is stored in the nominal name pattern storage unit 22 by the signal from the voice recognition unit 18. Is officially registered. Then, a message "Registration of the nominal name has been completed" is displayed on the display unit 30, and a series of operations for registration are completed (step S8).

【００２３】図４，図５は、本発明の第２の実施の形態
について説明するために参照する図である。この第２の
実施の形態においては、図４に示すように、名義名パタ
ーン記憶部２２には名義名パターン記憶部の他に、不特
定パターン記憶部が設けられていて、その不特定パター
ン記憶部には例えば“警察”の“けいさつ”という音声
パターンが登録されている。FIGS. 4 and 5 are views referred to for describing a second embodiment of the present invention. In the second embodiment, as shown in FIG. 4, the nominal name pattern storage unit 22 is provided with an unspecified pattern storage unit in addition to the nominal name pattern storage unit. For example, a voice pattern of "Keisatsu" of "police" is registered in the section.

【００２４】また、ダイヤル番号記憶部３８にはその
“けいさつ”に対応するダイヤル番号“１１０”が登録
されている。さらに録音再生部４０には、登録用音声記
憶部と、練習用音声記憶部が設けられている。In the dial number storage section 38, a dial number "110" corresponding to the "key" is registered. Further, the recording / reproducing unit 40 is provided with a registration voice storage unit and a practice voice storage unit.

【００２５】次に、音声認識の動作手順について、図５
のフローチャートに基づいて説明する。操作部３６を操
作して認識モードを選択すると、まず練習モードの初期
表示画面が表示部３０に出てくる（ステップＳ１）。音
声認識の動作における音声入力について既に習熟してい
るユーザーは、特に練習する必要はないので、パスボタ
ン（キー）を押すことにより練習モードを省略して直ち
に認識モードに移り、音声入力により相手方を発呼する
ことができる。Next, the operation procedure of the speech recognition will be described with reference to FIG.
A description will be given based on the flowchart of FIG. When the recognition mode is selected by operating the operation unit 36, an initial display screen of the practice mode first appears on the display unit 30 (step S1). A user who is already proficient in voice input in the voice recognition operation does not need to practice in particular. By pressing the pass button (key), the user skips the practice mode and immediately shifts to the recognition mode. You can make a call.

【００２６】音声認識の動作における音声入力ついてま
だ習熟していないユーザーは、練習モードを実行するこ
とができる。上記練習モードの表示が表示部３０に出て
くると、次に「音声入力の発音の練習をしましょう」と
いうような表示が表示部３０に出て（ステップＳ２）、
次に例えば「“けいさつ”（著名な学習用名義名）を呼
び出してみましょう」というメッセージが表示部３０に
表示される（ステップＳ３）。A user who has not yet mastered the voice input in the voice recognition operation can execute the practice mode. When the display of the practice mode appears on the display unit 30, a display such as "Let's practice the pronunciation of voice input" appears on the display unit 30 (step S2).
Next, for example, a message “Let's call“ Keisatsu ”(famous learning name)” is displayed on the display unit 30 (step S3).

【００２７】次に「出力音声のように練習で発音してく
ださい」と表示部３０に表示されると共に（ステップＳ
４）、それに続いて録音再生部４０の練習用音声記憶部
に記憶された音声が音声出力部２６から“けいさつ”と
出力されてユーザーに聞こえてくる。Next, a message "Please practice like an output voice" is displayed on the display unit 30 (step S).
4) Subsequently, the voice stored in the practice voice storage unit of the recording / reproducing unit 40 is output as "Keisatsu" from the voice output unit 26 and is heard by the user.

【００２８】その後表示部３０に、「どうぞ発声してく
ださい」と音声の発声待ちの画面が表示され（ステップ
Ｓ５）、それを見たユーザーが音声入力部１２に向かっ
て先に聞こえてきた“ケイサツ”の発音の真似をして発
声する。After that, the display unit 30 displays a screen for waiting for the voice to say "Please say this" (step S5), and the user who has seen it hears the voice input unit 12 earlier. I simulate the pronunciation of "Keisatsu."

【００２９】これにより音声入力部１２から音声信号が
出力されて、アンプ部１４，Ａ／Ｄ部１６を経て音声認
識部１８に送られ、音声認識部１８が名義名パターン記
憶部２２の不特定パターン記憶部内の音声パターンと照
合して、“ケイサツ”の音声パターンを認識して制御部
２０にその“ケイサツ”に係る信号を出力する。制御部
２０はこの信号に基づいてダイヤル番号記憶部３８から
“ケイサツ”に係るダイヤル番号１１０番を呼び出し、
このダイヤル番号１１０番の情報を表示部３０に出力す
る。As a result, a voice signal is output from the voice input unit 12 and sent to the voice recognition unit 18 via the amplifier unit 14 and the A / D unit 16. By collating with the voice pattern in the pattern storage unit, the voice pattern of “Katsu” is recognized, and a signal related to the “Katsu” is output to the control unit 20. The control unit 20 calls the dial number 110 related to “Keisatsu” from the dial number storage unit 38 based on this signal,
The information of the dial number 110 is output to the display unit 30.

【００３０】このため表示部３０に「けいさつ１１０
番」と表示されることにより（ステップＳ６）、ユーザ
ーの発声の仕方が音声認識部１８に適切に認識されたこ
とが分かり、練習モードが終了する（ステップＳ７）。
このようにしてユーザーは発声の仕方を学習してから音
声入力することができ、携帯型電話機１０の音声認識機
能を使いこなすことが可能となる。For this reason, the display unit 30 displays "Keisetsu 110
Is displayed (step S6), it is understood that the user's utterance has been appropriately recognized by the voice recognition unit 18, and the practice mode ends (step S7).
In this way, the user can learn how to make a speech and then input a voice, thereby making it possible to use the voice recognition function of the mobile phone 10.

【００３１】図６は、本発明の第３の実施の形態を説明
するためのフローチャートである。この第３の実施の形
態は、名義名を登録するときにそのテンポを誘導しよう
とするものである。すなわち操作部３６を操作して登録
モードを選択すると、まず登録モードの初期画面が表示
部３０に表示される（ステップＳ１）。FIG. 6 is a flowchart for explaining the third embodiment of the present invention. In the third embodiment, when a nominal name is registered, the tempo is to be derived. That is, when the registration mode is selected by operating the operation unit 36, an initial screen of the registration mode is first displayed on the display unit 30 (step S1).

【００３２】次に表示部３０には、「登録したい名義名
を入力してください」と表示され（ステップＳ２）、ユ
ーザーが操作部３６のキーによりその名義名をカタカナ
で入力すると、次に表示部３０には、「出力音声のテン
ポに合わせて発音してください」と表示される（ステッ
プＳ３）。Next, the display section 30 displays "Please input the name to be registered" (step S2). When the user inputs the name in katakana using the keys of the operation section 36, the display is continued. The unit 30 displays "Please sound according to the tempo of the output sound" (step S3).

【００３３】この後すぐ録音再生部４０の登録用音声記
憶部からの情報が音声出力部２６から音声出力されて、
例えば「ピッ・ピッ・ピッ・・・」というような音声が
出力される。このため、まだ名義名の登録時の音声入力
について習熟していないユーザーは、その「ピッ・ピッ
・ピッ・・・」という音声のテンポに合わせて、名義名
を上手に音声入力部１２に入力することができ、その名
義名を名義名パターン記憶部２２に無事に登録させるこ
とができる（ステップＳ４）。Immediately thereafter, information from the registration voice storage unit of the recording / playback unit 40 is output as voice from the voice output unit 26.
For example, a sound such as “beep, beep,...” Is output. For this reason, a user who has not yet mastered the voice input at the time of registration of the nominal name inputs the nominal name to the voice input unit 12 well according to the tempo of the voice “pip, pip, pip, ...”. And the nominal name can be safely registered in the nominal name pattern storage unit 22 (step S4).

【００３４】図７は、本発明の第４の実施の形態を説明
するためのフローチャートである。この第４の実施の形
態も名義名を登録するときのテンポを誘導しようとする
ものであるが、前記第３の実施の形態のように音声によ
りテンポを誘導するのと異なり、表示部３０に表示され
るドット（丸点）の点滅により発音のテンポの誘導を行
うものである。FIG. 7 is a flowchart for explaining a fourth embodiment of the present invention. The fourth embodiment also seeks to derive a tempo when registering a nominal name, but unlike the third embodiment in which a tempo is induced by voice, the display unit 30 The displayed tempo of the sound is guided by the blinking of the displayed dot (circle).

【００３５】すなわち登録モードの初期画面が表示部３
０に表示された（ステップＳ１）後、次の表示（ステッ
プＳ２）を見て登録したい名義名をキーによりカタカナ
で入力すると、表示部３０にドットの図形Ｄが表示され
ると共に、「下のドットの点滅のテンポに合わせて発音
してしてください」と表示される（ステップＳ３）。ユ
ーザーはその表示部３０のドットの点滅のテンポに合わ
せて名義名を上手に発音して音声入力部１２に入力する
ことができ、その名義名を名義名パターン記憶部２２に
無事に登録させることができる（ステップＳ４）。That is, the initial screen of the registration mode is displayed on the display unit 3.
0 (step S1), and after inputting the name to be registered in katakana using the keys while looking at the next display (step S2), a dot graphic D is displayed on the display unit 30 and " Please sound according to the flashing tempo of the dot "(step S3). The user can well pronounce the nominal name in accordance with the blinking tempo of the dot on the display unit 30 and input the nominal name to the voice input unit 12, and register the nominal name in the nominal name pattern storage unit 22 safely. (Step S4).

【００３６】図８は、本発明の第５の実施の形態を説明
するためのフローチャートである。この第５の実施の形
態も名義名を登録するときのテンポを誘導しようとする
ものであるが、前記実施の形態のように音声やドットの
点滅により行うのと異なり、表示部３０に帯状に連続し
て並んだ複数の四角形の枠Ｓに囲まれた文字を端から順
次変色させていくことにより、その変色のスピードに合
わせて発音のテンポを誘導するものである。FIG. 8 is a flowchart for explaining the fifth embodiment of the present invention. The fifth embodiment also seeks to derive the tempo when registering a nominal name. However, unlike the fifth embodiment, which is performed by flashing a sound or a dot as in the above-described embodiment, the display unit 30 has a band shape. By sequentially changing the color of a character surrounded by a plurality of square frames S arranged in a row from the end, the tempo of sound generation is induced in accordance with the speed of the color change.

【００３７】すなわち登録モードの初期画面が表示部３
０に表示された（ステップＳ１）後、次の表示（ステッ
プＳ２）を見て登録したい名義名をキーで入力すると、
表示部３０に帯状に連続して並んだ、その名義名の文字
の数の四角形の枠Ｓで囲まれた文字が表示されると共
に、「下の四角形の枠の文字の変色のスピードに合わせ
て発音してください」と表示される（ステップＳ３）。
ユーザーはその表示部３０の複数の文字の変色のテンポ
に合わせて、名義名を上手に発音して音声入力部１２に
入力することができ、その名義名を名義名パターン記憶
部２２に無事に登録させることができる（ステップＳ
４）。That is, the initial screen of the registration mode is displayed on the display unit 3.
0 is displayed (Step S1), and the next display (Step S2) is entered by inputting the nominal name to be registered with the key.
The display unit 30 displays the characters surrounded by the rectangular frame S of the number of the characters with the nominal name, which are continuously arranged in a band shape, and “according to the speed of the color change of the characters in the lower rectangular frame. "Please pronounce" is displayed (step S3).
The user can skillfully pronounce the nominal name and input it to the voice input unit 12 in accordance with the discoloration tempo of the plurality of characters on the display unit 30, and the nominal name can be safely stored in the nominal name pattern storage unit 22. Can be registered (Step S
4).

【００３８】以上、本発明の実施の形態について具体的
に述べてきたが、本発明は上記の実施の形態に限定され
るものではなく、本発明の技術的思想に基づいて、その
他にも各種の変更が可能なものである。Although the embodiments of the present invention have been specifically described above, the present invention is not limited to the above-described embodiments, and may be variously modified based on the technical idea of the present invention. Can be changed.

【００３９】[0039]

【発明の効果】以上説明したように、本発明の音声認識
作動装置によれば、登録や音声認識の動作における音声
入力についてまだ習熟していないユーザーであっても、
音声認識部にとって最も認識し易い発声の仕方で音声入
力することを学習できるため、名義名の登録や発呼時の
音声認識のための音声入力をスムーズに行うことが可能
となる。As described above, according to the voice recognition operating device of the present invention, even if the user has not yet mastered the voice input in the registration and voice recognition operations,
Since the voice recognition unit can learn to input a voice in a manner of utterance that is most easily recognized, it is possible to smoothly perform voice input for registration of a nominal name and voice recognition at the time of calling.

[Brief description of the drawings]

【図１】本発明による音声認識作動装置の第１の実施の
形態に係る携帯型電話機１０の構成を示すブロック回路
図である。FIG. 1 is a block circuit diagram showing a configuration of a portable telephone 10 according to a first embodiment of a voice recognition operation device according to the present invention.

【図２】図１における携帯型電話機１０の要部を示すブ
ロック回路図である。FIG. 2 is a block circuit diagram showing a main part of the portable telephone 10 in FIG.

【図３】第１の実施の形態の動作時の表示部３０の表示
内容の変化の流れを示すフローチャートである。FIG. 3 is a flowchart illustrating a flow of a change in display content of a display unit 30 during an operation according to the first exemplary embodiment.

【図４】本発明の第２の実施の形態に係る携帯型電話機
の要部を示すブロック回路図である。FIG. 4 is a block circuit diagram showing a main part of a portable telephone according to a second embodiment of the present invention.

【図５】第２の実施の形態の動作時の表示部３０の表示
内容の変化の流れを示すフローチャートである。FIG. 5 is a flowchart illustrating a flow of a change in display content of a display unit during operation according to the second embodiment;

【図６】本発明の第３の実施の形態の動作時の表示部３
０の表示内容の変化の流れを示すフローチャートであ
る。FIG. 6 shows a display unit 3 during operation according to the third embodiment of the present invention.
11 is a flowchart showing a flow of change of display contents of 0.

【図７】本発明の第４の実施の形態の動作時の表示部３
０の表示内容の変化の流れを示すフローチャートであ
る。FIG. 7 shows a display unit 3 during operation according to the fourth embodiment of the present invention.
11 is a flowchart showing a flow of change of display contents of 0.

【図８】本発明の第５の実施の形態の動作時の表示部３
０の表示内容の変化の流れを示すフローチャートであ
る。FIG. 8 shows a display unit 3 during operation according to the fifth embodiment of the present invention.
11 is a flowchart showing a flow of change of display contents of 0.

[Explanation of symbols]

１０携帯型電話機１２音声入力部１４アンプ部１６Ａ／Ｄ部１８音声認識装置２０制御部２２名義名パターン記憶部２３ノイズ除去部２４高周波部２４ａアンテナ２６音声出力部３０表示部３４通話回路３６操作部３８ダイヤル番号記憶部４０録音再生部４２温度測定センサー４４タイマー４８練習用音声合成部 DESCRIPTION OF SYMBOLS 10 Mobile telephone 12 Audio input part 14 Amplifier part 16 A / D part 18 Speech recognition device 20 Control part 22 Nominal name pattern storage part 23 Noise removal part 24 High frequency part 24a Antenna 26 Audio output part 30 Display part 34 Communication circuit 36 Operation Unit 38 dial number storage unit 40 recording and playback unit 42 temperature measurement sensor 44 timer 48 practice voice synthesis unit

Claims

[Claims]

A voice input unit for inputting a voice of a speaker and outputting an electrical voice signal; a voice recognition unit for inputting the voice signal and recognizing the voice of the speaker; A voice pattern registration unit for registering the voice pattern of the name of the person, a learning message registration unit in which a learning message has been registered in advance, a voice output unit for outputting a call voice from the other party, and the voice recognition unit. A display unit that displays the nominal name and / or a dial number corresponding thereto, and a control unit that controls registration, recognition, reading, display, and other predetermined processing operations based on a signal from the voice recognition unit. The voice recognition operation device according to claim 1, wherein the voice output unit outputs the learning message when the name is input by voice.

2. A voice input unit for inputting a voice of a speaker and outputting an electrical voice signal; a voice recognition unit for inputting the voice signal and recognizing the voice of the speaker; A voice pattern registration unit for registering a voice pattern of a nominal name of a person, a recording / reproducing unit in which a prominent learning nominal name is registered in advance, a voice output unit for outputting a call voice from the other party, and the voice recognition unit. A display unit for displaying the recognized nominal name and / or a dial number corresponding thereto, and a control for controlling registration, recognition, reading, display, and other predetermined processing operations based on a signal from the voice recognition unit. And a voice recognition actuating device comprising: a voice input of the famous learning nominal name at the time of voice input of the nominal name, and a prominent dial number corresponding to the nominal name is displayed on the display unit for learning. Speech recognition actuation device, characterized in that it is.

3. The voice output unit outputs the pre-registered famous learning nominal name as a model before inputting the famous learning nominal name at the time of registration of the nominal voice pattern. The voice recognition operating device according to claim 2, wherein

4. A voice input unit for inputting a voice of a speaker and outputting an electrical voice signal; a voice recognition unit for receiving the voice signal and recognizing the voice of the speaker; A voice pattern registration unit for registering a voice pattern of a nominal name of a person, a voice output unit for outputting a call voice from the other party, and displaying the nominal name recognized by the voice recognition unit and a dial number corresponding thereto. A display unit, and a control unit that controls registration, recognition, reading, display, and other predetermined processing operations based on a signal from the voice recognition unit. A voice recognition operating device, wherein a display for guidance for adjusting the tempo of utterance at the time of voice input is displayed on the display unit.

5. A voice input unit for inputting a voice of a speaker and outputting an electrical voice signal; a voice recognition unit for inputting the voice signal and recognizing the voice of the speaker; A voice pattern registration unit for registering a voice pattern of a nominal name of a person, a voice output unit for outputting a call voice from the other party, and displaying the nominal name recognized by the voice recognition unit and a dial number corresponding thereto. A display unit, and a control unit that controls registration, recognition, reading, display, and other predetermined processing operations based on a signal from the voice recognition unit. A voice recognition operating device, wherein at the time, a voice for guidance for adjusting the tempo of utterance at the time of voice input is output from the voice output unit.