JP7547857B2

JP7547857B2 - Work vehicle voice operation system, work vehicle voice operation method and program

Info

Publication number: JP7547857B2
Application number: JP2020140251A
Authority: JP
Inventors: 友理白川; 俊彦三木; 清孝河合
Original assignee: Tadano Ltd
Current assignee: Tadano Ltd
Priority date: 2020-08-21
Filing date: 2020-08-21
Publication date: 2024-09-10
Anticipated expiration: 2040-08-21
Also published as: JP2022035731A

Description

本発明は作業車音声操作システム、作業車の音声操作方法及びプログラムに関する。 The present invention relates to a work vehicle voice operation system, a work vehicle voice operation method, and a program.

作業車には、音声操作によって作業車の各部装置を操作できるものがある。 Some work vehicles allow you to control various parts of the vehicle using voice commands.

例えば、特許文献１には、オペレータが発声する音声指示を音声認識する音声操作装置によって、ブーム、巻き上げ装置等のクレーン各部装置を音声操作することが開示されている。 For example, Patent Document 1 discloses that various parts of a crane, such as the boom and hoisting device, can be operated by voice using a voice operation device that recognizes voice instructions given by an operator.

特開２０１７－１６０００７号公報JP 2017-160007 A

ところで、作業車では、作業車の姿勢又は状態に応じて操作に規制がかかることがある。例えば、アウトリガのジャッキが設置されていない状態では、ブーム操作に規制がかかり、ブーム操作ができない。 However, with a work vehicle, operation may be restricted depending on the position or condition of the work vehicle. For example, if the outrigger jack is not installed, boom operation is restricted and the boom cannot be operated.

しかし、特許文献１に記載の音声操作装置は、音声指示を音声認識及び音声分析をして操作情報を抽出し、抽出した操作情報に基づいて操作を行うだけである。このため、オペレータが作業車の姿勢、状態によって操作に規制がかかっていることに気付かないで、その規制された操作を音声指示してしまった場合、特許文献１に記載の音声操作装置は、その音声指示から抽出した操作情報で操作を行ってしまうおそれがある。 However, the voice operation device described in Patent Document 1 only extracts operation information by performing voice recognition and voice analysis on the voice instructions, and performs operations based on the extracted operation information. Therefore, if the operator issues a voice instruction to perform a restricted operation without realizing that the operation is restricted due to the attitude and state of the work vehicle, there is a risk that the voice operation device described in Patent Document 1 will perform the operation using the operation information extracted from the voice instruction.

このような音声操作装置による操作は、安全を確保するため、作業車のコントローラによって阻止されてしまうが、その場合、オペレータの音声指示それ自体が無駄となってしまう。また、オペレータからすれば、実行される音声操作と実行されない音声操作が混在していることになり、音声操作が難しくなってしまう。 To ensure safety, operations using such voice-operated devices are blocked by the vehicle's controller, but in that case, the operator's voice instructions themselves become useless. Also, from the operator's perspective, some voice operations are executed and some are not, making voice operation difficult.

本発明は上記の課題を解決するためになされたもので、作業車の操作に規制がかかっている場合でも、容易に音声操作できる作業車音声操作システム、作業車の音声操作方法及びプログラムを提供することを目的とする。 The present invention has been made to solve the above problems, and aims to provide a work vehicle voice operation system, a work vehicle voice operation method, and a program that allow easy voice operation even when the operation of the work vehicle is restricted.

上記の目的を達成するため、本発明の第一の観点に係る作業車音声操作システムは、
作業車を操作するためにオペレータが発声する音声指示を音声認識する音声認識装置から音声認識結果を取得する音声認識結果取得部と、
前記作業車が備える姿勢検出器又は状態検出器が検出した前記作業車の姿勢又は状態に基づいて前記姿勢又は前記状態のもとで実行できる実行可能操作を選定する実行可能操作選定部と、
前記音声認識結果取得部によって取得された前記音声認識結果に前記作業車を操作するための指示が含まれているか否かを判定し、該指示が含まれていないと判定した場合に、前記音声認識結果に含まれる該指示以外の語彙に対する応答が格納された応答データベースを参照して応答を生成し、該指示が含まれていると判定した場合に、さらに該指示が前記姿勢又は前記状態のもとで規制される規制操作の指示であるか否かを判定して、該指示が前記規制操作の指示であると判定した場合に、該指示に代えて前記実行可能操作を勧める応答を生成すると共に、該指示が前記規制操作の指示でないと判定した場合に、該指示の操作を実行する旨の応答を生成する応答生成部と、
前記応答生成部が生成した前記応答を提示する提示部と、
を備える。 In order to achieve the above object, a work vehicle voice operation system according to a first aspect of the present invention comprises:
a voice recognition result acquisition unit that acquires a voice recognition result from a voice recognition device that recognizes voice instructions uttered by an operator to operate the work vehicle;
an executable operation selection unit that selects an executable operation that can be executed under the attitude or state of the work vehicle based on the attitude or state detected by an attitude detector or state detector equipped in the work vehicle;
a response generation unit which determines whether the voice recognition result acquired by the voice recognition result acquisition unit includes an instruction for operating the work vehicle , and if it is determined that the instruction is not included, generates a response by referring to a response database in which responses to vocabulary other than the instruction included in the voice recognition result are stored, and if it is determined that the instruction is included, further determines whether the instruction is an instruction for a restricted operation that is restricted under the attitude or state, and if it is determined that the instruction is an instruction for the restricted operation, generates a response recommending the executable operation instead of the instruction , and if it is determined that the instruction is not an instruction for the restricted operation, generates a response to the effect that the operation of the instruction is to be executed;
a presentation unit that presents the response generated by the response generation unit;
Equipped with.

また、本発明の第二の観点に係る作業車音声操作システムは、
作業車を操作するためにオペレータが発声する音声指示を音声認識する音声認識装置から音声認識結果を取得する音声認識結果取得部と、
前記作業車が備える姿勢検出器又は状態検出器が検出した前記作業車の姿勢又は状態に基づいて前記姿勢又は前記状態のもとで実行できる実行可能操作を選定する実行可能操作選定部と、
前記音声認識結果取得部によって取得された前記音声認識結果に前記作業車を操作するための指示が含まれ、かつ該指示が前記姿勢又は前記状態のもとで規制される規制操作の指示である場合に、該指示に代えて前記実行可能操作を勧める応答を生成すると共に、前記音声認識結果取得部によって取得された前記音声認識結果に前記作業車を操作するための指示が含まれ、かつ該指示が前記規制操作の指示でない場合に、該指示の操作を実行する旨の応答を生成する応答生成部と、
前記応答生成部が生成した前記応答を提示する提示部と、
前記オペレータが音声で指示した操作を実行する旨の応答を前記応答生成部が生成する毎にその音声指示の操作内容を記憶することにより、操作の履歴を記憶する記憶部と、
を備え、
前記応答生成部は、前記音声認識結果取得部によって取得された前記音声認識結果に前記作業車を操作するための指示が含まれ、かつ該指示が前記規制操作の指示であり、前記実行可能操作選定部によって選定された前記実行可能操作が複数ある場合に、それら複数の前記実行可能操作のうち、前記記憶部に記憶された前記履歴のデータで最近の操作又は最も頻度が高い操作に該当する前記実行可能操作を勧める応答を生成する。 In addition, a work vehicle voice operation system according to a second aspect of the present invention comprises:
a voice recognition result acquisition unit that acquires a voice recognition result from a voice recognition device that recognizes voice instructions uttered by an operator to operate the work vehicle;
an executable operation selection unit that selects an executable operation that can be executed under the attitude or state of the work vehicle based on the attitude or state detected by an attitude detector or state detector equipped in the work vehicle;
a response generating unit which, when the voice recognition result acquired by the voice recognition result acquiring unit includes an instruction to operate the work vehicle and the instruction is an instruction for a restricted operation that is restricted under the attitude or state, generates a response recommending the executable operation instead of the instruction, and, when the voice recognition result acquired by the voice recognition result acquiring unit includes an instruction to operate the work vehicle and the instruction is not an instruction for the restricted operation, generates a response to execute the operation of the instruction;
a presentation unit that presents the response generated by the response generation unit;
a storage unit that stores the operation history by storing the operation content of the voice instruction every time the response generation unit generates a response to execute the operation instructed by the operator by voice;
Equipped with
When the voice recognition result acquired by the voice recognition result acquisition unit includes instructions for operating the work vehicle, and the instructions are instructions for the regulated operation, and there are multiple executable operations selected by the executable operation selection unit, the response generation unit generates a response recommending an executable operation among the multiple executable operations that corresponds to the most recent operation or the most frequent operation in the history data stored in the memory unit.

さらに、本発明の第三の観点に係る作業車の音声操作方法は、
作業車を操作するためにオペレータが発声する音声指示を第一コンピュータ又は、第二コンピュータが音声認識する音声認識ステップと、
前記第二コンピュータが、前記作業車が備える姿勢検出器又は状態検出器が検出した前記作業車の姿勢又は状態に基づいて前記姿勢又は前記状態のもとで実行できる実行可能操作を選定する実行可能操作選定ステップと、
前記第二コンピュータが、前記音声認識ステップで音声認識した音声認識結果に前記作業車を操作するための指示が含まれているか否かを判定し、該指示が含まれていないと判定した場合に、前記音声認識結果に含まれる該指示以外の語彙に対する応答が格納された応答データベースを参照して応答を生成し、該指示が含まれていると判定した場合に、さらに該指示が前記姿勢又は前記状態のもとで規制される規制操作の指示であるか否かを判定して、該指示が前記規制操作の指示であると判定した場合に、該指示に代えて前記実行可能操作を勧める応答を生成すると共に、該指示が前記規制操作の指示でないと判定した場合に、該指示の操作を実行する旨の応答を生成する応答生成ステップと、
を備える。 Furthermore, a voice operation method for a work vehicle according to a third aspect of the present invention includes:
a voice recognition step in which the first computer or the second computer recognizes voice instructions uttered by an operator to operate the work vehicle;
an executable operation selection step in which the second computer selects an executable operation that can be executed under the attitude or state of the work vehicle based on the attitude or state detected by an attitude detector or state detector equipped in the work vehicle;
a response generating step in which the second computer determines whether or not the voice recognition result obtained by voice recognition in the voice recognition step includes an instruction to operate the work vehicle, and if it determines that the instruction is not included, generates a response by referring to a response database in which responses to vocabulary other than the instruction included in the voice recognition result are stored, and if it determines that the instruction is included, further determines whether or not the instruction is an instruction for a restricted operation that is restricted under the attitude or state, and if it determines that the instruction is an instruction for the restricted operation, generates a response recommending the executable operation instead of the instruction, and if it determines that the instruction is not an instruction for the restricted operation, generates a response to the effect that the operation of the instruction is to be executed;
Equipped with .

また、本発明の第四の観点に係る作業車を操作するためのプログラムは、
作業車を操作するためにオペレータが発声する音声指示を電気信号に変換して音声データとして出力するマイクロフォンと、前記音声データを音声認識する音声認識装置とを用いて、前記作業車を音声操作するプログラムであって、
コンピュータに、
前記音声認識装置から音声認識結果を取得する音声認識結果取得ステップと、
前記作業車が備える姿勢検出器又は状態検出器が検出した前記作業車の姿勢又は状態に基づいて前記姿勢又は前記状態のもとで実行できる実行可能操作を選定する実行可能操作選定ステップと、
前記音声認識結果取得ステップで取得した前記音声認識結果に前記作業車を操作するための指示が含まれているか否かを判定し、該指示が含まれていないと判定した場合に、前記音声認識結果に含まれる該指示以外の語彙に対する応答が格納された応答データベースを参照して応答を生成し、該指示が含まれていると判定した場合に、さらに該指示が前記姿勢又は前記状態のもとで規制される規制操作の指示であるか否かを判定して、該指示が前記規制操作の指示であると判定した場合に、該指示に代えて前記実行可能操作を勧める応答を生成すると共に、該指示が前記規制操作の指示でないと判定した場合に、該指示の操作を実行する旨の応答を生成する応答生成ステップと、
を実行させるためのものである。 Further, a program for operating a work vehicle according to a fourth aspect of the present invention comprises:
A program for voice-operating a work vehicle using a microphone that converts voice instructions uttered by an operator to operate the work vehicle into electrical signals and outputs the electrical signals as voice data, and a voice recognition device that performs voice recognition on the voice data, comprising:
On the computer,
a speech recognition result acquisition step of acquiring a speech recognition result from the speech recognition device;
an executable operation selection step of selecting an executable operation that can be executed under the attitude or state of the work vehicle based on the attitude or state detected by an attitude detector or state detector equipped in the work vehicle;
a response generating step of determining whether the voice recognition result acquired in the voice recognition result acquisition step includes an instruction for operating the work vehicle , and if it is determined that the instruction is not included, generating a response by referring to a response database in which responses to vocabulary other than the instruction included in the voice recognition result are stored, and if it is determined that the instruction is included, further determining whether the instruction is an instruction for a restricted operation that is restricted under the posture or state, and if it is determined that the instruction is an instruction for the restricted operation, generating a response recommending the executable operation instead of the instruction , and if it is determined that the instruction is not an instruction for the restricted operation, generating a response to execute the operation of the instruction ;
This is to enable the execution of the above.

また、本発明の第五の観点に係る作業車の音声操作方法は、
作業車を操作するためにオペレータが発声する音声指示を第一コンピュータ又は、第二コンピュータが音声認識する音声認識ステップと、
前記第二コンピュータが、前記作業車が備える姿勢検出器又は状態検出器が検出した前記作業車の姿勢又は状態に基づいて前記姿勢又は前記状態のもとで実行できる実行可能操作を選定する実行可能操作選定ステップと、
前記第二コンピュータが、前記音声認識ステップで音声認識した音声認識結果に前記作業車を操作するための指示が含まれ、かつ該指示が前記姿勢又は前記状態のもとで規制される規制操作の指示である場合に、該指示に代えて前記実行可能操作を勧める応答を生成すると共に、前記音声認識ステップで音声認識した前記音声認識結果に前記作業車を操作するための指示が含まれ、かつ該指示が前記規制操作の指示でない場合に、該指示の操作を実行する旨の応答を生成する応答生成ステップと、
前記オペレータが音声で指示した操作を実行する旨の応答を前記応答生成ステップで生成する毎に、前記第二コンピュータが、その音声指示の操作内容を記憶装置に記憶させることにより、操作の履歴を記憶する記憶ステップと、
を備え、
前記応答生成ステップでは、前記音声認識ステップで音声認識した前記音声認識結果に前記作業車を操作するための指示が含まれ、かつ該指示が前記規制操作の指示であり、前記実行可能操作選定ステップで選定した前記実行可能操作が複数ある場合に、それら複数の前記実行可能操作のうち、前記記憶装置に記憶された前記履歴のデータで最近の操作又は最も頻度が高い操作に該当する前記実行可能操作を勧める応答を生成する。
さらに、本発明の第六の観点に係る作業車を操作するためのプログラムは、
作業車を操作するためにオペレータが発声する音声指示を電気信号に変換して音声データとして出力するマイクロフォンと、前記音声データを音声認識する音声認識装置と、記憶装置とを用いて、前記作業車を音声操作するプログラムであって、
コンピュータに、
前記音声認識装置から音声認識結果を取得する音声認識結果取得ステップと、
前記作業車が備える姿勢検出器又は状態検出器が検出した前記作業車の姿勢又は状態に基づいて前記姿勢又は前記状態のもとで実行できる実行可能操作を選定する実行可能操作選定ステップと、
前記音声認識結果取得ステップで取得した前記音声認識結果に前記作業車を操作するための指示が含まれ、かつ該指示が前記姿勢又は前記状態のもとで規制される規制操作の指示である場合に、該指示に代えて前記実行可能操作を勧める応答を生成すると共に、前記音声認識結果取得ステップで取得した前記音声認識結果に前記作業車を操作するための指示が含まれ、かつ該指示が前記規制操作の指示でない場合に、該指示の操作を実行する旨の応答を生成する応答生成ステップと、
前記オペレータが音声で指示した操作を実行する旨の応答を前記応答生成ステップで生成する毎に、その音声指示の操作内容を前記記憶装置に記憶させることにより、操作の履歴を記憶させる記憶ステップと、
を実行させ、
前記応答生成ステップでは、前記音声認識結果取得ステップで取得した前記音声認識結果に前記作業車を操作するための指示が含まれ、かつ該指示が前記規制操作の指示であり、前記実行可能操作選定ステップで選定した前記実行可能操作が複数ある場合に、それら複数の前記実行可能操作のうち、前記記憶装置に記憶された前記履歴のデータで最近の操作又は最も頻度が高い操作に該当する前記実行可能操作を勧める応答を生成する。 Further, a voice operation method for a work vehicle according to a fifth aspect of the present invention comprises:
a voice recognition step in which the first computer or the second computer recognizes voice instructions uttered by an operator to operate the work vehicle;
an executable operation selection step in which the second computer selects an executable operation that can be executed under the attitude or state of the work vehicle based on the attitude or state detected by an attitude detector or state detector equipped in the work vehicle;
a response generating step in which, when the voice recognition result obtained by voice recognition in the voice recognition step includes an instruction to operate the work vehicle and the instruction is an instruction for a restricted operation that is restricted under the attitude or the state, the second computer generates a response recommending the executable operation instead of the instruction , and when the voice recognition result obtained by voice recognition in the voice recognition step includes an instruction to operate the work vehicle and the instruction is not an instruction for the restricted operation, the second computer generates a response to execute the operation of the instruction;
a storage step of storing a history of operations by the second computer by storing the operation content of the voice instruction in a storage device every time the response generating step generates a response to execute an operation instructed by the operator by voice;
Equipped with
In the response generation step, when the voice recognition result obtained in the voice recognition step includes an instruction to operate the work vehicle, and the instruction is an instruction to perform a regulated operation, and there are multiple executable operations selected in the executable operation selection step, a response is generated that recommends an executable operation among the multiple executable operations that corresponds to the most recent operation or the most frequent operation in the history data stored in the storage device.
Furthermore, a program for operating a work vehicle according to a sixth aspect of the present invention comprises:
A program for voice-operating a work vehicle using a microphone that converts voice instructions uttered by an operator to operate the work vehicle into electrical signals and outputs the electrical signals as voice data, a voice recognition device that performs voice recognition on the voice data, and a storage device, comprising:
On the computer,
a speech recognition result acquisition step of acquiring a speech recognition result from the speech recognition device;
an executable operation selection step of selecting an executable operation that can be executed under the attitude or state of the work vehicle based on the attitude or state detected by an attitude detector or state detector equipped in the work vehicle;
a response generating step of generating a response recommending the executable operation in place of the instruction when the voice recognition result acquired in the voice recognition result acquiring step includes an instruction to operate the work vehicle and the instruction is an instruction for a restricted operation that is restricted under the attitude or state, and generating a response to execute the operation of the instruction when the voice recognition result acquired in the voice recognition result acquiring step includes an instruction to operate the work vehicle and the instruction is not an instruction for the restricted operation;
a storage step of storing a history of operations by storing the operation contents of the voice instruction in the storage device every time a response to execute an operation instructed by the operator by voice is generated in the response generating step;
Run the command,
In the response generation step, when the voice recognition result acquired in the voice recognition result acquisition step includes instructions for operating the work vehicle, and the instructions are instructions for the regulated operation, and there are multiple executable operations selected in the executable operation selection step, a response is generated recommending an executable operation among the multiple executable operations that corresponds to the most recent operation or the most frequent operation in the history data stored in the storage device.

本発明の構成によれば、音声認識結果に作業車を操作するための指示が含まれ、かつその指示が規制操作の指示である場合に、応答生成部が、その指示に代えて実行可能操作を勧める応答を生成し、提示部が生成された応答を提示する。このため、オペレータが誤って規制操作を音声で指示しても、その操作に代わる実行可能操作が提示されるので、円滑に音声操作をすることができる。その結果、音声操作が容易である。 According to the configuration of the present invention, when the voice recognition result includes an instruction to operate a work vehicle and the instruction is an instruction to perform a restricted operation, the response generation unit generates a response recommending an executable operation in place of the instruction, and the presentation unit presents the generated response. Therefore, even if the operator mistakenly gives a voice instruction to perform a restricted operation, an executable operation in place of the operation is presented, allowing smooth voice operation. As a result, voice operation is easy.

本発明の実施の形態１に係る作業車音声操作システムのシステム構成図である。1 is a system configuration diagram of a work vehicle voice operation system according to a first embodiment of the present invention. FIG. 作業車音声操作システムの操作対象である高所作業車の側面図である。FIG. 2 is a side view of the aerial work vehicle that is the object of operation of the work vehicle voice operation system. 作業車音声操作システムが備える携帯端末の制御部のブロック図である。4 is a block diagram of a control unit of a mobile terminal provided in the work vehicle voice operation system. FIG. 制御部が備える実行可能操作選定部が取得する規制操作情報のデータ構成図の一例である。13 is a diagram showing an example of a data structure of restricted operation information acquired by an executable operation selection unit included in the control unit. FIG. 記憶部に格納された作業車操作データベースのデータ構成図の一例である。4 is an example of a data configuration diagram of a work vehicle operation database stored in a storage unit. FIG. 記憶部に格納されたコマンド辞書のデータ構成図の一例である。4 is a diagram showing an example of a data structure of a command dictionary stored in a storage unit; 作業車音声操作システムが実施する作業車音声操作処理のフローチャートである。4 is a flowchart of a work vehicle voice operation process performed by the work vehicle voice operation system. 本発明の実施の形態２に係る作業車音声操作システムが備える携帯端末の制御部のブロック図である。13 is a block diagram of a control unit of a mobile terminal provided in a work vehicle voice operation system according to a second embodiment of the present invention. FIG.

以下、本発明の実施の形態に係る作業車音声操作システム、作業車の音声操作方法及びプログラムについて図面を参照して詳細に説明する。なお、図中、同一又は同等の部分には同一の符号を付す。 The following describes in detail the vehicle voice operation system, vehicle voice operation method, and program according to the embodiment of the present invention with reference to the drawings. Note that in the drawings, the same or equivalent parts are given the same reference numerals.

（実施の形態１）
実施の形態１に係る作業車音声操作システムは、携帯端末を用いて高所作業車を音声操作するシステムである。まず、図１及び図２を参照して、システム全体の構成について説明する。 (Embodiment 1)
The vehicle voice operation system according to the first embodiment is a system for voice-operating a vehicle for work at height using a mobile terminal. First, the configuration of the entire system will be described with reference to Figs. 1 and 2.

図１は、本発明の実施の形態１に係る作業車音声操作システム１Ａのシステム構成図である。図２は、作業車音声操作システム１Ａの操作対象である高所作業車５の側面図である。 Figure 1 is a system configuration diagram of a work vehicle voice operation system 1A according to embodiment 1 of the present invention. Figure 2 is a side view of an aerial work vehicle 5 that is the target of operation of the work vehicle voice operation system 1A.

図１に示すように、作業車音声操作システム１Ａは、高所作業車５を操作するための音声指示が入力される携帯端末２Ａと、入力された音声を音声認識するサーバー３と、音声認識の認識結果に基づいて高所作業車５を制御するコントローラ４Ａと、によって構成されている。 As shown in FIG. 1, the work vehicle voice operation system 1A is composed of a mobile terminal 2A to which voice instructions for operating the aerial work vehicle 5 are input, a server 3 that performs voice recognition on the input voice, and a controller 4A that controls the aerial work vehicle 5 based on the recognition results of the voice recognition.

携帯端末２Ａは、オペレータが携帯することが可能な小型かつ軽量な情報端末装置である。例えば、携帯端末２Ａは、スマートフォン、タブレット型コンピュータである。携帯端末２Ａは、マイクロフォン２１、制御部２２、スピーカ２３、記憶部２４及び、通信部２５を備える。 The mobile terminal 2A is a small and lightweight information terminal device that can be carried by an operator. For example, the mobile terminal 2A is a smartphone or a tablet computer. The mobile terminal 2A includes a microphone 21, a control unit 22, a speaker 23, a memory unit 24, and a communication unit 25.

マイクロフォン２１は、音声を電気信号に変換して音声データとして出力する。マイクロフォン２１は、携帯端末２Ａを携帯するオペレータが、高所作業車５を操作するために音声指示を発声すると、その音声指示を音声データとして出力する。マイクロフォン２１は、その音声データを制御部２２に送信する。 The microphone 21 converts voice into an electrical signal and outputs it as voice data. When an operator carrying the mobile terminal 2A gives a voice instruction to operate the aerial work vehicle 5, the microphone 21 outputs the voice instruction as voice data. The microphone 21 transmits the voice data to the control unit 22.

制御部２２は、演算処理を行うＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）と、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）及びＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）を含むメモリとを含むマイクロコンピュータを備える。ＣＰＵは、ＲＯＭ又は記憶部２４に記憶されたプログラムをＲＡＭに読み出して実行することにより、携帯端末２Ａの各部を制御する。例えば、制御部２２は、ＣＰＵが作業車音声操作プログラムを実行することにより、作業車音声操作処理を行う。 The control unit 22 includes a microcomputer including a CPU (Central Processing Unit) that performs arithmetic processing, and memory including a ROM (Read Only Memory) and a RAM (Random Access Memory). The CPU controls each part of the mobile terminal 2A by reading a program stored in the ROM or the storage unit 24 into the RAM and executing it. For example, the control unit 22 performs a work vehicle voice operation process by the CPU executing a work vehicle voice operation program.

詳細には、制御部２２は、通信部２５を制御して、マイクロフォン２１に入力されたオペレータの音声指示の音声データをサーバー３に送信する。また、サーバー３から送信される音声認識結果を受信する。制御部２２は、その音声認識結果に基づいて、オペレータの音声指示に対する応答を生成する。そして、生成した応答データをスピーカ２３に出力する。また、制御部２２は、サーバー３の音声認識結果に基づいて、高所作業車５を操作するための操作信号を生成し、生成した操作信号を通信部２５に送信する。 In detail, the control unit 22 controls the communication unit 25 to transmit voice data of the operator's voice instructions input to the microphone 21 to the server 3. It also receives the voice recognition result transmitted from the server 3. The control unit 22 generates a response to the operator's voice instructions based on the voice recognition result. Then, it outputs the generated response data to the speaker 23. The control unit 22 also generates an operation signal for operating the aerial work vehicle 5 based on the voice recognition result of the server 3, and transmits the generated operation signal to the communication unit 25.

スピーカ２３は、応答データの電気信号を音声に変換して出力する。スピーカ２３は、制御部２２から上記応答データを受信した場合に、その応答データを音声に変換して出力する。これにより、スピーカ２３は、生成された応答をオペレータに提示する。 The speaker 23 converts the electrical signal of the response data into sound and outputs it. When the speaker 23 receives the response data from the control unit 22, it converts the response data into sound and outputs it. In this way, the speaker 23 presents the generated response to the operator.

一方、記憶部２４は、ＥＥＰＲＯＭ（ＥｌｅｃｔｒｏｉｃａｌＥｒａｓａｂｌｅＰｒｏｇｒａｍｍａｂｌｅＲｅａｄ－ＯｎｌｙＭｅｍｏｒｙ）又はフラッシュメモリ等の記憶装置を有し、上述した作業車音声操作プログラムを記憶する。 Meanwhile, the memory unit 24 has a storage device such as an EEPROM (Electronic Erasable Programmable Read-Only Memory) or flash memory, and stores the above-mentioned work vehicle voice operation program.

通信部２５は、図示しない無線通信回路とアンテナを備え、インターネット、構内ネットワーク又は、車内ネットワーク等のネットワーク１００に接続して、サーバー３及びコントローラ４Ａと無線でデータ通信を行う。例えば、通信部２５は、制御部２２から受信した音声指示の音声データをサーバー３に送信する。また、制御部２２から受信した操作信号をコントローラ４Ａに送信する。 The communication unit 25 includes a wireless communication circuit and an antenna (not shown), and is connected to a network 100 such as the Internet, an in-house network, or an in-vehicle network, to wirelessly communicate data with the server 3 and the controller 4A. For example, the communication unit 25 transmits voice data of voice instructions received from the control unit 22 to the server 3. It also transmits operation signals received from the control unit 22 to the controller 4A.

サーバー３は、制御部３１、記憶部３２及び通信部３３を備える。制御部３１は、図示しないＣＰＵ、ＲＯＭ及び、ＲＡＭを有し、ＣＰＵがＲＯＭに格納されたプログラム及びデータを読み出し、かつＲＡＭをワークエリアとして用いることにより、サーバー３を統括的に制御する。また、記憶部３２は、図示しないＥＥＰＲＯＭ又はフラッシュメモリ等を有し、各種処理を行うための各種プログラム、そのデータ及び、生成、取得される各種データを記憶する。通信部３３は、ネットワーク１００に接続して、携帯端末２Ａ及びコントローラ４Ａと通信する。 The server 3 includes a control unit 31, a storage unit 32, and a communication unit 33. The control unit 31 has a CPU, ROM, and RAM (not shown), and the CPU reads out programs and data stored in the ROM and uses the RAM as a work area to perform overall control of the server 3. The storage unit 32 has an EEPROM or flash memory (not shown), and stores various programs for performing various processes, the data thereof, and various data that is generated and acquired. The communication unit 33 connects to the network 100 and communicates with the mobile terminal 2A and the controller 4A.

サーバー３は、制御部３１が記憶部３２に格納された音声認識プログラムを実行することにより、音声認識処理を行う。詳細には、サーバー３は、携帯端末２Ａの通信部２５からオペレータの音声指示の音声データを受信して、その音声データを音声認識する。そして、サーバー３は、音声認識結果を携帯端末２Ａの通信部２５へ送信する。 The server 3 performs voice recognition processing by the control unit 31 executing a voice recognition program stored in the memory unit 32. In detail, the server 3 receives voice data of the operator's voice instructions from the communication unit 25 of the mobile terminal 2A and performs voice recognition on the voice data. The server 3 then transmits the voice recognition result to the communication unit 25 of the mobile terminal 2A.

一方、コントローラ４Ａは、高所作業車５に設けられ、高所作業車５の各部装置を制御する。 On the other hand, the controller 4A is provided in the aerial work vehicle 5 and controls each device of the aerial work vehicle 5.

詳細には、コントローラ４Ａは、図示しないが、演算処理を行うＣＰＵと、ＲＯＭ及びＲＡＭを含むメモリとを含むコンピュータを備える。コントローラ４Ａは、図１に示す記憶装置６１又は、図示しないＲＯＭに記憶された作動プログラムを、ＣＰＵがＲＡＭに読み出して実行する。これにより、コントローラ４Ａは、図２に示すアウトリガ５２Ａ－５２Ｄ、旋回台５３、ブーム５４及び、バケット５５等の各部装置を駆動する。 In detail, the controller 4A is equipped with a computer (not shown) including a CPU that performs calculation processing and memory including ROM and RAM. The CPU of the controller 4A reads out and executes an operating program stored in the storage device 61 shown in FIG. 1 or in the ROM (not shown) into the RAM. In this way, the controller 4A drives each of the devices shown in FIG. 2, such as the outriggers 52A-52D, the rotating base 53, the boom 54, and the bucket 55.

より詳細には、図１に示すように、高所作業車５は、上記の各部装置を操作するための操作装置５６と、上記の各部装置を駆動するための駆動部５７とを備えるところ、コントローラ４Ａは、ＣＰＵが上記作動プログラムを実行することにより、操作装置５６が備える、図示しないレバー、スイッチ等から出力される操作情報を取得して、その操作情報に基づいて駆動部５７を作動させる。これにより、コントローラ４Ａは、操作装置５６のレバーの操作、スイッチのオンオフ等に応じて高所作業車５の上記の各部装置を駆動する。 More specifically, as shown in FIG. 1, the vehicle for working at height 5 includes an operating device 56 for operating the above-mentioned various devices, and a drive unit 57 for driving the above-mentioned various devices. The controller 4A acquires operation information output from levers, switches, etc. (not shown) included in the operating device 56 by the CPU executing the above-mentioned operating program, and operates the drive unit 57 based on the operation information. In this way, the controller 4A drives the above-mentioned various devices of the vehicle for working at height 5 in response to the operation of the lever of the operating device 56, the on/off of the switch, etc.

また、コントローラ４Ａは、検出器５８Ａ－５８Ｅが検出した検出結果に基づいて高所作業車５の姿勢又は状態を求める。そして、コントローラ４Ａは、安全を確保するため、求めた姿勢又は状態から規制すべき操作（以下、規制操作という）を求める。 The controller 4A also determines the posture or state of the aerial work platform vehicle 5 based on the detection results obtained by the detectors 58A-58E. The controller 4A then determines the operation that should be restricted from the determined posture or state (hereinafter, referred to as the restricted operation) in order to ensure safety.

詳細には、高所作業車５は、図１及び図２に示すように、アウトリガ５２Ａ－５２Ｄが有するビームとジャッキの伸縮を検出する検出器５８Ａ、旋回台５３の旋回角を検出する検出器５８Ｂ、車両５１に対するブーム５４の起伏角を検出する検出器５８Ｃ、ブーム５４の伸縮によって変化するブーム５４の長さを検出する検出器５８Ｄ、ブーム５４に対するバケット５５のスイング角を検出する検出器５８Ｅを備える。 In detail, as shown in Figures 1 and 2, the aerial work platform 5 is equipped with a detector 58A that detects the extension and contraction of the beams and jacks of the outriggers 52A-52D, a detector 58B that detects the rotation angle of the swivel base 53, a detector 58C that detects the hoisting angle of the boom 54 relative to the vehicle 51, a detector 58D that detects the length of the boom 54 that changes as the boom 54 extends and contracts, and a detector 58E that detects the swing angle of the bucket 55 relative to the boom 54.

コントローラ４Ａは、例えば、検出器５８Ａの検出結果に基づいて、アウトリガ５２Ａ－５２Ｄのジャッキの設置状態を求める。コントローラ４Ａは、求めた設置状態が、ジャッキ未設置状態（すなわち、フロートが接地していない状態）である場合、転倒を防ぐため、ブーム５４の旋回、起伏、伸縮の操作（以下、ブーム操作という）を規制操作とする。 The controller 4A determines the installation state of the jacks of the outriggers 52A-52D, for example, based on the detection results of the detector 58A. If the determined installation state is a state in which the jacks are not installed (i.e., the float is not on the ground), the controller 4A restricts the operation of rotating, raising and lowering, and extending the boom 54 (hereinafter referred to as boom operation) to prevent tipping over.

また、コントローラ４Ａは、検出器５８Ｂ－５８Ｅの検出結果に基づいて、ブーム５４の姿勢を求める。コントローラ４Ａは、求めたブーム５４の姿勢が、ブーム５４を旋回させるとバケット５５が車両５１に干渉してしまう姿勢である場合、そのような干渉を防ぐため、ブーム操作を規制操作とする。 The controller 4A also determines the attitude of the boom 54 based on the detection results of the detectors 58B-58E. If the determined attitude of the boom 54 is one in which the bucket 55 would interfere with the vehicle 51 if the boom 54 were rotated, the controller 4A restricts the boom operation to prevent such interference.

また、コントローラ４Ａは、求めたブーム５４の姿勢が、作業姿勢、すなわち、未格納の姿勢である場合、転倒を防ぐため、アウトリガ５２Ａ－５２Ｄのビームの張り出し、格納、ジャッキの伸縮（以下、アウトリガ操作という）を規制操作とする。 In addition, if the determined boom 54 posture is the working posture, i.e., the unstowed posture, the controller 4A restricts the extension and retraction of the beams of the outriggers 52A-52D and the extension and retraction of the jacks (hereinafter referred to as outrigger operation) to prevent the boom 54 from tipping over.

さらに、コントローラ４Ａは、操作装置５６の出力から図示しないストップスイッチのオン、オフの状態を求める。例えば、コントローラ４Ａは、求めたストップスイッチのオン、オフの状態がオン状態である場合、非常時等の停止させるべき状態に切り替えられていると扱い、ブーム操作とアウトリガ操作の両方を規制操作とする。 Furthermore, the controller 4A obtains the on/off state of a stop switch (not shown) from the output of the operating device 56. For example, if the obtained on/off state of the stop switch is the on state, the controller 4A treats it as being switched to a state in which the machine should be stopped in an emergency, etc., and restricts both the boom operation and the outrigger operation.

コントローラ４Ａは、このような規制操作とされた操作がある場合、操作装置５６のレバーに連動する駆動部５７の作動を制限する。これにより、コントローラ４Ａは、高所作業車５が規制操作で操作されることを防ぐ。その結果、安全を確保する。 When such a restricted operation is performed, the controller 4A restricts the operation of the drive unit 57 linked to the lever of the operating device 56. This prevents the aerial work platform 5 from being operated by the restricted operation, thereby ensuring safety.

一方、コントローラ４Ａは、図１に示すように、上述した携帯端末２Ａとデータ通信を行うため、高所作業車５に設けられた通信装置６２と接続されている。これにより、コントローラ４Ａは、通信装置６２を介して、ネットワーク１００に接続される。そして、携帯端末２Ａとデータ通信を行う。 As shown in FIG. 1, the controller 4A is connected to a communication device 62 provided on the aerial work platform vehicle 5 in order to perform data communication with the above-mentioned portable terminal 2A. As a result, the controller 4A is connected to the network 100 via the communication device 62. Then, data communication is performed with the portable terminal 2A.

例えば、コントローラ４Ａは、高所作業車５を操作するため、携帯端末２Ａの制御部２２が生成した操作信号を受信する。コントローラ４Ａは、操作信号を受信した場合、その操作信号に基づいて、駆動部５７を作動させる。これにより、アウトリガ５２Ａ－５２Ｄ、旋回台５３、ブーム５４及び、バケット５５等の高所作業車５の各部装置を駆動する。その結果、携帯端末２Ａの操作信号によって、高所作業車５が操作される。 For example, the controller 4A receives an operation signal generated by the control unit 22 of the mobile terminal 2A to operate the vehicle for aerial work 5. When the controller 4A receives an operation signal, it activates the drive unit 57 based on the operation signal. This drives each device of the vehicle for aerial work 5, such as the outriggers 52A-52D, the swivel base 53, the boom 54, and the bucket 55. As a result, the vehicle for aerial work 5 is operated by the operation signal of the mobile terminal 2A.

なお、コントローラ４Ａは、過負荷防止装置としても機能する。すなわち、コントローラ４Ａは、検出器５８Ａ－５８Ｅの出力に基づいて旋回台５３の支点回りのモーメントを求め、求めたモーメントが限界モーメントを超えている場合に、駆動部５７を停止して、過負荷を防止する。 The controller 4A also functions as an overload prevention device. That is, the controller 4A calculates the moment around the fulcrum of the swivel base 53 based on the output of the detectors 58A-58E, and if the calculated moment exceeds the limit moment, it stops the drive unit 57 to prevent overload.

このように、作業車音声操作システム１Ａでは、オペレータの音声指示が携帯端末２Ａのマイクロフォン２１に入力されると、サーバー３がその音声指示の音声データを音声認識し、携帯端末２Ａの制御部２２が音声認識結果に基づいて音声指示に対する応答を生成する。また、音声認識結果に基づいて操作信号を生成する。そして、コントローラ４Ａがその操作信号に基づいて高所作業車５の各部装置を作動させる。 In this way, in the work vehicle voice operation system 1A, when the operator's voice instructions are input to the microphone 21 of the mobile terminal 2A, the server 3 performs voice recognition on the voice data of the voice instructions, and the control unit 22 of the mobile terminal 2A generates a response to the voice instructions based on the voice recognition result. It also generates an operation signal based on the voice recognition result. The controller 4A then operates each device of the aerial work vehicle 5 based on the operation signal.

しかし、上述したように、コントローラ４Ａは、安全を確保するため、高所作業車５の姿勢又は状態に応じた規制操作によって高所作業車５が操作されることを防いでいる。このため、オペレータが規制操作に気付かないで規制操作を実行させる音声指示を行ってしまうと、その音声指示をした操作が実行されず、オペレータが混乱してしまうことがある。また、オペレータが規制操作に気付かないと、実行される音声操作と実行されない音声操作が混在していることになってしまい、音声操作が難しくなってしまう。 However, as described above, in order to ensure safety, the controller 4A prevents the aerial work vehicle 5 from being operated by restrictive operations that correspond to the posture or state of the aerial work vehicle 5. For this reason, if the operator issues a voice command to execute a restrictive operation without noticing the restrictive operation, the voice commanded operation may not be executed, which may confuse the operator. Furthermore, if the operator does not notice the restrictive operation, some voice commands will be executed and some will not, making voice operation difficult.

このような問題を解決するため、携帯端末２Ａの制御部２２は、オペレータが音声指示した操作が規制操作である場合に、その音声指示した操作の代わりとなる操作がある旨のメッセージを音声で知らせる作業車音声操作処理を行う。制御部２２は、この作業車音声操作処理を行うために、ソフトウエアとして構成される処理ブロックを有する。続いて、図３－図６を参照して、制御部２２の処理ブロックについて説明する。また、あわせて、サーバー３の制御部３１の処理ブロックについても説明する。 To solve this problem, the control unit 22 of the mobile terminal 2A performs a work vehicle voice operation process that, when an operation instructed by the operator through voice is a restricted operation, notifies the operator through voice a message that there is an alternative operation to the operation instructed through voice. In order to perform this work vehicle voice operation process, the control unit 22 has a processing block configured as software. Next, the processing block of the control unit 22 will be described with reference to Figures 3 to 6. The processing block of the control unit 31 of the server 3 will also be described.

図３は、作業車音声操作システム１Ａが備える携帯端末２Ａの制御部２２と記憶部２４のブロック図である。図４は、制御部２２が備える実行可能操作選定部２２２が取得する規制操作情報７１のデータ構成図の一例である。図５は、記憶部２４に格納された作業車操作データベース２４１のデータ構成図の一例である。図６は、記憶部２４に格納されたコマンド辞書２４２のデータ構成図の一例である。なお、図３では、作業車音声操作処理の理解を容易にするため、サーバー３の制御部３１と記憶部３２の処理ブロックも示している。 Figure 3 is a block diagram of the control unit 22 and memory unit 24 of the mobile terminal 2A provided in the work vehicle voice operation system 1A. Figure 4 is an example of a data structure diagram of restricted operation information 71 acquired by the executable operation selection unit 222 provided in the control unit 22. Figure 5 is an example of a data structure diagram of the work vehicle operation database 241 stored in the memory unit 24. Figure 6 is an example of a data structure diagram of the command dictionary 242 stored in the memory unit 24. Note that Figure 3 also shows processing blocks of the control unit 31 and memory unit 32 of the server 3 to make it easier to understand the work vehicle voice operation processing.

図３に示すように、携帯端末２Ａの制御部２２は、音声認識結果取得部２２１、実行可能操作選定部２２２及び、応答生成部２２３を備える。 As shown in FIG. 3, the control unit 22 of the mobile terminal 2A includes a voice recognition result acquisition unit 221, an executable operation selection unit 222, and a response generation unit 223.

音声認識結果取得部２２１は、音声データをサーバー３に送信し、そのサーバー３から音声データの音声認識結果を取得する。 The voice recognition result acquisition unit 221 transmits voice data to the server 3 and acquires the voice recognition results of the voice data from the server 3.

詳細には、音声認識結果取得部２２１は、音声認識させる音声データを得るため、携帯端末２Ａに設けられたマイクロフォン２１から、オペレータが発した声の音声データを取得する。具体的には、音声認識結果取得部２２１は、オペレータが発声すると、音声データの取得を開始し、オペレータの発声が途切れるまで音声データを取得する。これにより、音声認識結果取得部２２１は、一続きの音声データを取得する。音声認識結果取得部２２１は、一続きの音声データを取得する毎に、取得した音声データを、図１に示す通信部２５、３３を介してサーバー３の制御部３１に送信する。 In detail, the voice recognition result acquisition unit 221 acquires voice data of the voice uttered by the operator from the microphone 21 provided in the mobile terminal 2A to obtain voice data for voice recognition. Specifically, when the operator speaks, the voice recognition result acquisition unit 221 starts acquiring voice data, and acquires voice data until the operator's speech is interrupted. In this way, the voice recognition result acquisition unit 221 acquires a series of voice data. Each time the voice recognition result acquisition unit 221 acquires a series of voice data, it transmits the acquired voice data to the control unit 31 of the server 3 via the communication units 25 and 33 shown in FIG. 1.

一方、サーバー３の制御部３１は、図３に示すように、音声認識を行う音声認識部３１１を備える。その音声認識部３１１は、音声認識結果取得部２２１からの音声データを受信すると、記憶部３２に格納された音声認識辞書３２１を参照して、音声データの音声を音響的又は言語的に尤もらしい語彙をもとに語彙の抽出を行い、音声認識する。音声認識部３１１は、図１に示す通信部３３、２５を介して、認識結果を図３に示す携帯端末２Ａの音声認識結果取得部２２１に送信する。 On the other hand, the control unit 31 of the server 3 includes a voice recognition unit 311 that performs voice recognition, as shown in FIG. 3. When the voice recognition unit 311 receives voice data from the voice recognition result acquisition unit 221, it refers to the voice recognition dictionary 321 stored in the memory unit 32, extracts vocabulary based on acoustically or linguistically plausible words from the voice data, and performs voice recognition. The voice recognition unit 311 transmits the recognition result to the voice recognition result acquisition unit 221 of the mobile terminal 2A shown in FIG. 3 via the communication units 33 and 25 shown in FIG. 1.

音声認識結果取得部２２１は、音声認識部３１１から音声認識結果を受信すると、その音声認識結果を応答生成部２２３に送信する。 When the voice recognition result acquisition unit 221 receives the voice recognition result from the voice recognition unit 311, it transmits the voice recognition result to the response generation unit 223.

これに対して、実行可能操作選定部２２２は、オペレータの音声指示の操作が規制操作である場合の、その音声指示の操作の代わりとなる操作を選定する。実行可能操作選定部２２２は、そのような操作を選定するため、オペレータの音声を音声認識した段階で、高所作業車５が実行することができる実行可能操作を特定する。 In response to this, the executable operation selection unit 222 selects an operation to replace the voice-instructed operation when the operation instructed by the operator's voice is a restricted operation. In order to select such an operation, the executable operation selection unit 222 identifies executable operations that can be performed by the aerial work vehicle 5 at the stage of performing voice recognition of the operator's voice.

詳細には、実行可能操作選定部２２２は、音声認識結果取得部２２１がマイクロフォン２１から音声データを取得する毎に、規制操作が何であるかを特定するため、コントローラ４Ａから図４に示す規制操作情報７１を取得する。また、記憶部２４には、図３に示すように、作業車操作データベース２４１が格納されているところ、実行可能操作選定部２２２は、規制操作の代わりとなる操作を選定するデータを得るため、その記憶部２４から作業車操作データベース２４１を読み出す。 In detail, each time the voice recognition result acquisition unit 221 acquires voice data from the microphone 21, the executable operation selection unit 222 acquires restricted operation information 71 shown in FIG. 4 from the controller 4A in order to identify the restricted operation. In addition, the storage unit 24 stores a work vehicle operation database 241 as shown in FIG. 3, and the executable operation selection unit 222 reads out the work vehicle operation database 241 from the storage unit 24 in order to obtain data for selecting an operation to replace the restricted operation.

上記の規制操作情報７１では、図４に示すように、高所作業車５の姿勢又は状態と、それら姿勢又は状態のときの、規制操作の規制される対象（規制対象という）及び規制される操作項目が対応付けられている。また、作業車操作データベース２４１では、図５に示すように、高所作業車５の姿勢又は状態とその姿勢又は状態で実行することができる操作（以下、実行可能操作という）の規制対象、操作項目が対応付けられている。 In the above-mentioned restricted operation information 71, as shown in FIG. 4, the posture or state of the aerial work vehicle 5 is associated with the restricted objects (referred to as regulated objects) and restricted operation items of the restricted operation in that posture or state. In addition, in the work vehicle operation database 241, as shown in FIG. 5, the posture or state of the aerial work vehicle 5 is associated with the restricted objects and operation items of the operations that can be performed in that posture or state (hereinafter referred to as executable operations).

図３に示す実行可能操作選定部２２２は、読み出した作業車操作データベース２４１を用いて、上記のコントローラ４Ａから取得した規制操作情報７１の姿勢又は状態に対応する実行可能操作を選定する。これにより、実行可能操作選定部２２２は、マイクロフォン２１から音声データを取得した段階での規制操作に対する実行可能操作を選定する。そして、実行可能操作選定部２２２は、高所作業車５の姿勢又は状態のデータと規制操作のデータと共に、選定した実行可能操作のデータを応答生成部２２３に送信する。 The executable operation selection unit 222 shown in FIG. 3 uses the read work vehicle operation database 241 to select an executable operation corresponding to the posture or state of the restricted operation information 71 acquired from the controller 4A. In this way, the executable operation selection unit 222 selects an executable operation for the restricted operation at the stage when the voice data is acquired from the microphone 21. The executable operation selection unit 222 then transmits data on the selected executable operation to the response generation unit 223 together with data on the posture or state of the aerial work vehicle 5 and data on the restricted operation.

応答生成部２２３は、音声認識結果が高所作業車５を操作するための指示（以下、コマンドという）であるか否か、また、音声認識結果が規制操作の指示であるか否かに応じてオペレータに対する応答を生成する。 The response generation unit 223 generates a response to the operator depending on whether the voice recognition result is an instruction (hereinafter referred to as a command) for operating the aerial work vehicle 5, and whether the voice recognition result is an instruction for a regulated operation.

詳細には、応答生成部２２３は、まず、音声認識結果取得部２２１から音声認識結果を受信する。また、応答生成部２２３は、応答を生成するときのデータを得るため、実行可能操作選定部２２２から高所作業車５の姿勢又は状態のデータ、規制操作のデータ及び、実行可能操作のデータを受信する。さらに、記憶部２４には、図６に示す高所作業車５を操作するための指示（以下、コマンドという）のコマンド対象物、コマンド内容と、そのコマンドに対する応答、そのコマンドに対する高所作業車５の操作信号とが対応付けられたコマンド辞書２４２が格納されているところ、図３に示す応答生成部２２３は、そのコマンド辞書２４２を読み出す。 In detail, the response generation unit 223 first receives the voice recognition result from the voice recognition result acquisition unit 221. Furthermore, in order to obtain data for generating a response, the response generation unit 223 receives data on the attitude or state of the vehicle 5, data on restricted operations, and data on executable operations from the executable operation selection unit 222. Furthermore, the memory unit 24 stores a command dictionary 242 in which the command object of the instruction (hereinafter referred to as a command) for operating the vehicle 5 shown in FIG. 6, the command content, the response to the command, and the operation signal of the vehicle 5 in response to the command are associated with each other, and the response generation unit 223 shown in FIG. 3 reads out the command dictionary 242.

応答生成部２２３は、受信した音声認識結果がコマンドに関するか否かを判定するため、音声認識結果に、コマンド辞書２４２のコマンド対象物とコマンド内容のワード又はそれらに類似したワードが含まれているか否かを判定する。 The response generation unit 223 determines whether the received voice recognition result is related to a command by determining whether the voice recognition result contains words of the command object and command content in the command dictionary 242 or words similar to them.

応答生成部２２３は、受信した音声認識結果に、コマンド辞書２４２のコマンド対象物とコマンド内容のワード又はそれらに類似したワードが含まれていないと判定した場合、音声認識結果がコマンドに関するものでないと扱う。そして、記憶部２４に格納された応答データベース２４３を読み出す。図示しないが、応答データベース２４３では、音声認識結果がコマンドに関するものでないときの、その音声認識結果に含まれる語彙と応答とが対応付けられている。応答生成部２２３は、読み出した応答データベース２４３を参照して応答を生成する。応答生成部２２３は、生成した応答のデータをスピーカに出力し、スピーカ２３が応答データの電気信号を音声に変換することにより、オペレータにその応答を伝える。 When the response generation unit 223 determines that the received voice recognition result does not contain the words of the command object and command content in the command dictionary 242 or words similar thereto, it treats the voice recognition result as not related to a command. It then reads out the response database 243 stored in the storage unit 24. Although not shown, the response database 243 associates the vocabulary contained in the voice recognition result with a response when the voice recognition result is not related to a command. The response generation unit 223 generates a response by referring to the read response database 243. The response generation unit 223 outputs the generated response data to the speaker, and the speaker 23 converts the electrical signal of the response data into sound, thereby conveying the response to the operator.

一方、応答生成部２２３は、コマンド対象物とコマンド内容のワード又はそれらに類似したワードが含まれていると判定した場合、音声認識結果がコマンドに関するものであると扱う。そして、その場合、応答生成部２２３は、音声認識結果が規制操作に関するか否かを判定するため、音声認識結果に、実行可能操作選定部２２２から受信した規制操作のデータの規制対象と規制項目のワード又は、それらに類似したワードが含まれているか否かを判定する。 On the other hand, if the response generation unit 223 determines that the voice recognition result contains words for the command object and command content or words similar thereto, it treats the voice recognition result as relating to a command. In that case, the response generation unit 223 determines whether the voice recognition result contains words for the restricted object and restricted items of the data for the restricted operation received from the executable operation selection unit 222 or words similar thereto, in order to determine whether the voice recognition result is related to a restricted operation.

応答生成部２２３は、受信した音声認識結果に規制対象と規制項目のワード又は、それらに類似したワードが含まれていると判定した場合、音声認識結果が規制操作に関するものであるとする。そして、応答生成部２２３は、実行可能操作選定部２２２から受信した姿勢又は状態のデータと実行可能操作のデータを用いて、音声指示の操作の代わりとなる操作として実行可能操作を提案する応答文を作成する。 When the response generation unit 223 determines that the received voice recognition result contains words of restricted objects and restricted items or words similar thereto, it determines that the voice recognition result is related to a restricted operation. Then, using the posture or state data and the executable operation data received from the executable operation selection unit 222, the response generation unit 223 creates a response sentence that suggests an executable operation as an operation to replace the voice-instructed operation.

詳細には、応答生成部２２３は、高所作業車５が、実行可能操作選定部２２２から受信した姿勢又は状態のデータにあること、そのため、音声指示ができないこと、その代わりに、実行可能操作選定部２２２から受信した実行可能操作のデータの操作ができること、という内容の応答文を作成する。 In detail, the response generation unit 223 creates a response sentence stating that the aerial work vehicle 5 is in the posture or state data received from the executable operation selection unit 222, and therefore voice instructions are not possible, but that instead, the executable operation data received from the executable operation selection unit 222 can be operated.

応答生成部２２３は、生成した応答文の応答データをスピーカ２３に出力する。応答生成部２２３は、スピーカ２３が応答データの電気信号を音声に変換することにより、オペレータに、音声で指示した操作の代わりとなる実行可能操作があることを提示する。 The response generation unit 223 outputs the response data of the generated response sentence to the speaker 23. The response generation unit 223 presents to the operator that there is an executable operation that can be used instead of the operation instructed by voice, by the speaker 23 converting the electrical signal of the response data into voice.

一方、応答生成部２２３は、受信した音声認識結果に規制対象と規制項目のワード又は、それらに類似したワードが含まれていないと判定した場合、音声認識結果が規制操作に関するものでないとする。応答生成部２２３は、上述した、音声認識結果にコマンド辞書２４２のコマンド対象物とコマンド内容のワード又は、それらに類似したワードが含まれているという判定の、そのコマンド対象物とコマンド内容に対応付けられた、図６に示すコマンド辞書２４２のコマンドに対する応答を用いて応答を生成する。そして、生成した応答をスピーカ２３に出力する。 On the other hand, if the response generation unit 223 determines that the received voice recognition result does not contain words for the restricted object and restricted item or words similar thereto, it determines that the voice recognition result does not relate to a restricted operation. The response generation unit 223 generates a response using a response to a command in the command dictionary 242 shown in FIG. 6 that is associated with the command object and command content in the command dictionary 242, which is determined to contain words for the command object and command content in the command dictionary 242 or words similar thereto, as described above. Then, the generated response is output to the speaker 23.

その生成された応答では、そのコマンド対象物をコマンド内容どおりに実行する旨の内容が示されている。応答生成部２２３は、スピーカ２３がこの応答文の電気信号を音声に変換することにより、オペレータに音声認識したコマンドどおりに操作を実行することを知らせる。また、応答生成部２２３は、上述した、コマンド対象物とコマンド内容のワードに対応付けられたコマンド辞書２４２の操作信号を、図１に示す通信部２５に出力する。これにより、通信装置６２を介して操作信号をコントローラ４Ａに送信する。その結果、応答生成部２２３は、音声認識したコマンドどおりに高所作業車５を作動させる。 The generated response indicates that the command object will be executed according to the command content. The response generation unit 223 notifies the operator that the operation will be executed according to the voice-recognized command by having the speaker 23 convert the electrical signal of this response sentence into sound. The response generation unit 223 also outputs an operation signal from the command dictionary 242 associated with the command object and the words of the command content described above to the communication unit 25 shown in FIG. 1. This causes the operation signal to be sent to the controller 4A via the communication device 62. As a result, the response generation unit 223 operates the aerial work vehicle 5 according to the voice-recognized command.

図３に戻って、記憶部２４には、送信時刻と送信した操作信号が対応付けられた操作履歴データベース２４４が格納されている。応答生成部２２３は、操作信号をコントローラ４Ａに出力する毎に、計時した時刻と共に、操作信号を操作履歴データベース２４４に送信して記憶させる。換言すると、計時した時刻と共に、操作内容を操作履歴データベース２４４に送信して記憶させる。 Returning to FIG. 3, the memory unit 24 stores an operation history database 244 in which the transmission time is associated with the transmitted operation signal. Each time the response generation unit 223 outputs an operation signal to the controller 4A, the response generation unit 223 transmits the operation signal together with the clocked time to the operation history database 244 for storage. In other words, the operation content is transmitted together with the clocked time to the operation history database 244 for storage.

上述した実行可能操作は、規制操作情報７１の姿勢又は状態によっては、それらに対応する実行可能操作が複数個存在することがある。そして、上述したように、応答生成部２２３は、受信した音声認識結果に規制対象と規制項目のワード又は、それらに類似したワードが含まれていると判定した場合に、音声指示の操作の代わりに実行可能操作を提案する応答文を作成する。このため、この場合、提案する実行可能操作が複数個存在することになってしまい、どの操作が望ましいのか、オペレータが迷ってしまうおそれがある。 Depending on the posture or state of the restricted operation information 71, there may be multiple executable operations corresponding to the above-mentioned executable operations. As described above, when the response generation unit 223 determines that the received voice recognition result contains words for the restricted object and restricted item or words similar thereto, it creates a response sentence that suggests an executable operation instead of the voice-instructed operation. For this reason, in this case, there may be multiple suggested executable operations, and the operator may be confused as to which operation is desirable.

そこで、応答生成部２２３は、操作をより容易にするため、実行可能操作が複数個存在する場合、上記の操作履歴データベース２４４を用いて、より望ましい実行可能操作を１つ提案する。 Therefore, in order to make the operation easier, when there are multiple executable operations, the response generation unit 223 uses the above-mentioned operation history database 244 to suggest one of the more desirable executable operations.

詳細には、応答生成部２２３は、操作履歴データベース２４４を読み出して、それら複数個の実行可能操作のうち、直近に操作された実行可能操作を、望ましい実行可能操作とする。なお、応答生成部２２３は、それら複数個の実行可能操作のうち、最も頻度が高い実行可能操作を望ましい実行可能操作としてもとよい。これにより、応答生成部２２３は、音声指示の代わりとなる、より望ましい実行可能操作を１つ提案する応答を生成する。その結果、音声操作を容易にする。 In detail, the response generation unit 223 reads the operation history database 244, and among the multiple executable operations, sets the most recently performed executable operation as the desired executable operation. Note that the response generation unit 223 may set the most frequently performed executable operation among the multiple executable operations as the desired executable operation. In this way, the response generation unit 223 generates a response that suggests one more desirable executable operation as an alternative to a voice instruction. As a result, voice operation is made easier.

次に、図７を参照して、作業車音声操作システム１Ａが実施する作業車音声操作処理について説明する。以下の説明では、携帯端末２Ａを起動したときに、携帯端末２Ａの表示装置に、記憶部２４に記憶された作業車音声操作プログラムを起動するためのアイコンが表示されているものとする。また、作業車音声操作プログラムが起動された後、表示装置に作業車音声操作プログラムを終了させる終了ボタンが常時表示されるものとする。さらに、サーバー３では、予め音声認識プログラムが起動され、音声認識処理が行われているものとする。そして、高所作業車５のキャビンには、図示しない遠隔操作選択ボタンが設けられているものとする。 Next, referring to FIG. 7, the work vehicle voice operation processing implemented by the work vehicle voice operation system 1A will be described. In the following description, it is assumed that when the mobile terminal 2A is started, an icon for starting the work vehicle voice operation program stored in the memory unit 24 is displayed on the display device of the mobile terminal 2A. In addition, after the work vehicle voice operation program is started, an end button for ending the work vehicle voice operation program is constantly displayed on the display device. Furthermore, it is assumed that a voice recognition program has been started in advance in the server 3, and voice recognition processing is being performed. And it is assumed that a remote operation selection button (not shown) is provided in the cabin of the aerial work vehicle 5.

図７は、作業車音声操作システム１Ａが実施する作業車音声操作処理のフローチャートである。 Figure 7 is a flowchart of the work vehicle voice operation process performed by the work vehicle voice operation system 1A.

まず、高所作業車５のオペレータは、上記の図示しない遠隔操作選択ボタンをオン状態に切り換える。オペレータは、さらに、携帯端末２Ａを起動する。続いて、表示装置に表示された作業車音声操作プログラムのアイコンをタップする。これにより、制御部２２のマイクロコンピュータによって作業車音声操作プログラムが実行される。その結果、作業車音声操作処理のフローが開始される。 First, the operator of the aerial work vehicle 5 switches the remote operation selection button (not shown) to the on state. The operator then starts the mobile terminal 2A. Next, the operator taps the icon of the work vehicle voice operation program displayed on the display device. This causes the microcomputer of the control unit 22 to execute the work vehicle voice operation program. As a result, the work vehicle voice operation processing flow starts.

作業車音声操作処理のフローが開始されると、図７に示すように、制御部２２は、まず、マイクロフォン２１に音声入力があったか否かを判定する（ステップＳ１）。この判定では、マイクロフォン２１に一定の音量よりも大きい音が入力された場合に、音声の入力があったものとする。 When the flow of the work vehicle voice operation processing is started, as shown in FIG. 7, the control unit 22 first determines whether or not there has been voice input to the microphone 21 (step S1). In this determination, if a sound that is louder than a certain volume is input to the microphone 21, it is determined that there has been voice input.

次に、制御部２２は、音声入力があったと判定した場合（ステップＳ１のＹｅｓ）、オペレータが発した声が入力されたものとして、その音声のデータを取得する（ステップＳ２）。この音声データの取得では、上記の一定の音量よりも大きい音が一定時間、途切れるまで行い、一続きの発声の音声データを取得する。一方、制御部２２は、音声入力がないと判定した場合（ステップＳ１のＮｏ）、ステップＳ１に戻り、ステップＳ１を繰り返す。 Next, if the control unit 22 determines that there has been voice input (Yes in step S1), it assumes that the voice input was made by the operator and acquires the voice data (step S2). In acquiring this voice data, a sound louder than the above-mentioned certain volume is made for a certain period of time until it is interrupted, and voice data of a continuous voice is acquired. On the other hand, if the control unit 22 determines that there has been no voice input (No in step S1), it returns to step S1 and repeats step S1.

制御部２２は、音声の取得に続いて、取得した音声データをサーバー３に送信する（ステップＳ３）。この送信は、携帯端末２Ａの通信部２５によってネットワーク１００に接続することより行う。 After acquiring the voice, the control unit 22 transmits the acquired voice data to the server 3 (step S3). This transmission is performed by connecting the communication unit 25 of the mobile terminal 2A to the network 100.

サーバー３では、図１に示す制御部３１が音声認識プログラムを実行することにより、音声認識処理を行っている。制御部３１は、通信部３３を介して音声データを受信し、受信した音声データを音声認識する。そして、制御部３１は、音声認識結果を、通信部３３を介して携帯端末２Ａの制御部２２に送信する。 In the server 3, the control unit 31 shown in FIG. 1 executes a voice recognition program to perform voice recognition processing. The control unit 31 receives voice data via the communication unit 33 and performs voice recognition on the received voice data. The control unit 31 then transmits the voice recognition result to the control unit 22 of the mobile terminal 2A via the communication unit 33.

図７に戻って、制御部２２は、音声データをサーバー３に送信した後、サーバー３から送信された音声認識結果を受信する（ステップＳ４）。 Returning to FIG. 7, after transmitting the voice data to the server 3, the control unit 22 receives the voice recognition result transmitted from the server 3 (step S4).

続いて、制御部２２は、実行可能操作を選定する（ステップＳ５）。詳細には、制御部２２は、図３に示すコントローラ４Ａから規制操作情報７１を取得し、さらに記憶部２４から作業車操作データベース２４１を読み出す。上述したように、規制操作情報７１では、高所作業車５の現在の姿勢又は現在の状態に、その姿勢又はその状態のときの規制操作が対応付けられており、作業車操作データベース２４１では、高所作業車５の各姿勢又は各状態に、それら各姿勢又は各状態のときの実行可能操作が対応付けられている。制御部２２は、規制操作情報７１と作業車操作データベース２４１を用いて、取得した規制操作情報７１の規制操作がかかっているときの、実行可能操作を選定する。 Then, the control unit 22 selects an executable operation (step S5). In detail, the control unit 22 acquires the restricting operation information 71 from the controller 4A shown in FIG. 3, and further reads out the vehicle operation database 241 from the storage unit 24. As described above, the restricting operation information 71 associates the current attitude or current state of the vehicle 5 with the restricting operation at that attitude or state, and the vehicle operation database 241 associates each attitude or state of the vehicle 5 with an executable operation at that attitude or state. The control unit 22 uses the restricting operation information 71 and the vehicle operation database 241 to select an executable operation when the restricting operation of the acquired restricting operation information 71 is applied.

具体的な例で説明すると、例えば、図４に示す規制操作情報７１の例では、高所作業車５の現在の姿勢又は現在の状態が「ジャッキ未設置」の状態にあり、その状態に、規制対象「ブーム」、規制項目「起こし、倒し、伸長、縮小、右旋回、左旋回」という各規制操作が対応付けられている。一方、図５に示す作業車操作データベース２４１の例では、高所作業車５が「ジャッキ未設置」という姿勢又は状態に、操作対象「アウトリガ」、操作項目「張り出し」という実行可能操作と、操作対象「ジャッキ」、操作項目「伸長」という実行可能操作が対応付けられている。この場合、制御部２２は、規制操作情報７１の規制対象「ブーム」、規制項目「起こし、倒し、伸長、縮小、右旋回、左旋回」という規制操作のときの、実行可能操作として、操作対象「アウトリガ」、操作項目「張り出し」という操作と、操作対象「ジャッキ」、操作項目「伸長」という操作を選定する。 To give a specific example, in the example of restricted operation information 71 shown in Figure 4, the current posture or state of the aerial work vehicle 5 is a "jack not installed" state, with which the restricted operations of restricted object "boom" and restricted items "raise, lower, extend, retract, turn right, turn left" are associated. Meanwhile, in the example of work vehicle operation database 241 shown in Figure 5, the posture or state of the aerial work vehicle 5 in which "jack not installed" is associated with executable operations of operation object "outrigger" and operation item "extend", and executable operations of operation object "jack" and operation item "extend". In this case, the control unit 22 selects an operation with the operation object "outrigger" and the operation item "extension" as well as an operation with the operation object "jack" and the operation item "extension" as executable operations for the restricted operation information 71 with the restricted object "boom" and the restricted items "raise, lower, extend, reduce, turn right, turn left."

図７に戻って、制御部２２は、ステップＳ５に続いて、音声認識結果にコマンド辞書２４２のコマンドが含まれているか否かを判定する（ステップＳ６）。詳細には、記憶部２４にコマンド辞書２４２が格納されているところ、制御部２２は、コマンド辞書２４２を読み出し、音声認識結果に、コマンド辞書２４２のコマンド内容のワード又は類似するワードが含まれているか否かを判定する。 Returning to FIG. 7, following step S5, the control unit 22 determines whether the voice recognition result includes a command in the command dictionary 242 (step S6). In detail, the command dictionary 242 is stored in the storage unit 24, and the control unit 22 reads the command dictionary 242 and determines whether the voice recognition result includes a word of the command content in the command dictionary 242 or a similar word.

制御部２２は、音声認識結果にコマンド辞書２４２のコマンドが含まれていると判定した場合（ステップＳ６のＹｅｓ）、続いて、音声認識結果に規制操作のワードが含まれているか否かを判定する（ステップＳ７）。詳細には、制御部２２は、ステップＳ５の実行可能操作の選定で、規制操作情報７１から規制操作のデータを得ているところ、その規制操作の規制対象と規制項目のワード又は類似するワードが含まれているか否かを判定する。 When the control unit 22 determines that the voice recognition result includes a command in the command dictionary 242 (Yes in step S6), it then determines whether or not the voice recognition result includes a word for a restricted operation (step S7). In detail, when the control unit 22 obtains data on restricted operations from the restricted operation information 71 in the selection of executable operations in step S5, it determines whether or not the data includes words for the restricted target and restricted items of the restricted operation or similar words.

例えば、図４の規制操作情報７１の例の場合、制御部２２は、規制対象「ブーム」と規制項目「起こし、倒し、伸長、縮小、右旋回、左旋回」のそれぞれのワード又はそれらに類似するワードが音声認識結果に含まれているか否かを判定する。 For example, in the case of the example of the restricted operation information 71 in FIG. 4, the control unit 22 determines whether the words of the restricted object "boom" and the restricted items "raising, lowering, extending, reducing, turning right, turning left" or words similar thereto are included in the voice recognition result.

制御部２２は、図７に示すように、音声認識結果に規制操作のワードが含まれていると判定した場合（ステップＳ７のＹｅｓ）、オペレータの音声指示が規制操作の指示であると扱い、その音声指示の操作に代えて、ステップＳ５で選定した実行可能操作を提案する応答を生成する（ステップＳ８）。 As shown in FIG. 7, when the control unit 22 determines that the voice recognition result contains a word for a restricted operation (Yes in step S7), it treats the operator's voice instruction as an instruction for a restricted operation, and generates a response proposing the executable operation selected in step S5 in place of the operation for which the voice instruction is made (step S8).

この応答の生成では、実行可能操作が複数個存在する場合、制御部２２は、記憶部２４から操作履歴データベース２４４を読み出し、操作履歴データベース２４４を用いて、それら複数個の実行可能操作のうち、直近に操作された実行可能操作を提案する実行可能操作とする。そして、制御部２２は、その実行可能操作を提案する応答を生成する。なお、提案する実行可能操作は、複数個の実行可能操作のうち、操作履歴データベース２４４で最も頻度が高い実行可能操作であってもよい。 When generating this response, if there are multiple executable operations, the control unit 22 reads out the operation history database 244 from the storage unit 24, and uses the operation history database 244 to set the most recently performed executable operation among the multiple executable operations as the proposed executable operation. The control unit 22 then generates a response proposing the executable operation. Note that the proposed executable operation may be the executable operation that is most frequently found in the operation history database 244 among the multiple executable operations.

例えば、制御部２２は、音声認識結果に、図４に示された規制操作情報７１の例の規制対象「ブーム」のワードと規制項目「伸長」に類似する「伸ばして」のワードが含まれていると判定した場合、図５に示す作業車操作データベース２４１の操作対象「アウトリガ」、操作項目「張り出し」という操作と、操作対象「ジャッキ」、操作項目「伸長」という操作を応答で提案する実行可能操作として選定する。そして、制御部２２は、選定されたそれらの操作のうち、操作履歴データベース２４４を参照して直近と判定された操作を、音声指示の操作に代わる、望ましい実行可能操作として提案し、かつ、その他の実行可能操作をその次に望ましい実行可能操作として提案する応答を生成する。 For example, if the control unit 22 determines that the voice recognition result includes the word "boom" for the restricted object and the word "extend" for the restricted item in the example of restricted operation information 71 shown in FIG. 4, the control unit 22 selects the operation of the operation object "outrigger" and the operation item "extend" and the operation of the operation object "jack" and the operation item "extend" in the work vehicle operation database 241 shown in FIG. 5 as executable operations to be proposed in the response. The control unit 22 then generates a response proposing the operation determined to be the most recent among the selected operations by referring to the operation history database 244 as a desirable executable operation to replace the voice-instructed operation, and proposing the other executable operations as the next desirable executable operations.

制御部２２は、ステップＳ８で実行可能操作を提案する応答を生成すると、図示しないが、生成した応答のデータをスピーカ２３に出力する。これにより、オペレータに、音声指示した操作に代えて実行可能操作を行うことを提示する。 When the control unit 22 generates a response proposing an executable operation in step S8, the control unit 22 outputs the generated response data to the speaker 23 (not shown). This suggests to the operator that an executable operation be performed instead of the operation specified by voice.

一方、制御部２２は、図７に示すように、音声認識結果に規制操作のワードが含まれていないと判定した場合（ステップＳ７のＮｏ）、ステップＳ６の判定のコマンドに対応する、コマンド辞書２４２の応答文を用いて応答を生成する（ステップＳ９）。 On the other hand, as shown in FIG. 7, if the control unit 22 determines that the voice recognition result does not include a word for a restricted operation (No in step S7), it generates a response using a response sentence in the command dictionary 242 that corresponds to the command determined in step S6 (step S9).

詳細には、制御部２２は、ステップＳ６で音声認識結果にコマンド辞書２４２のコマンドが含まれているか否かを判定するところ、その判定でコマンド辞書２４２から音声認識結果に含まれると判定されたコマンドに対応付けられた応答を読み出す。そして、読み出した応答を用いて応答を生成する。読み出した応答は、そのコマンドを実行する旨の応答であることから、制御部２２が生成する応答は、オペレータが音声指示した操作を実行する旨の応答となる。制御部２２は、この応答のデータをスピーカ２３に出力する。これにより、オペレータに、音声指示した操作を実行することを知らせる。 In detail, in step S6, the control unit 22 determines whether or not the voice recognition result includes a command in the command dictionary 242, and reads out a response associated with the command determined to be included in the voice recognition result from the command dictionary 242. Then, a response is generated using the read response. Since the read response is a response to execute the command, the response generated by the control unit 22 is a response to execute the operation specified by the operator's voice. The control unit 22 outputs the data of this response to the speaker 23. This notifies the operator that the operation specified by voice will be executed.

例えば、制御部２２は、音声認識結果が「ジャッキを伸ばして」という結果であり、図４に示された規制操作情報７１の例の規制対象「ブーム」と規制項目「伸長」のワード又は類似するワードが含まれていないと判定した場合、図６に示すコマンド辞書２４２のコマンド対象物「ジャッキ」、コマンド内容「伸長」に対応する応答「はい、ジャッキを伸長します」を用いて応答を生成し、生成した応答をスピーカ２３に出力する。 For example, if the control unit 22 determines that the voice recognition result is "Extend the jack" and that the words "boom" and "extend" in the example of the restricted operation information 71 shown in FIG. 4 are not included, or similar words, it generates a response using the response "Yes, extend the jack" corresponding to the command object "jack" and the command content "extend" in the command dictionary 242 shown in FIG. 6, and outputs the generated response to the speaker 23.

また、制御部２２は、図示しないが、コマンド辞書２４２から音声認識結果に含まれると判定されたコマンドに対応付けられた操作信号を読み出し、その操作信号を通信部２５に出力する。これにより、操作信号を通信装置６２を介してコントローラ４Ａに送信する。その結果、オペレータが音声指示した操作どおりに高所作業車５が作動する。 The control unit 22 also reads out an operation signal associated with the command determined to be included in the voice recognition result from the command dictionary 242 (not shown), and outputs the operation signal to the communication unit 25. This causes the operation signal to be transmitted to the controller 4A via the communication device 62. As a result, the aerial work platform vehicle 5 operates according to the operation instructed by the operator through voice.

図７に戻って、制御部２２は、ステップＳ８とＳ９で応答を生成すると、次のオペレータによる音声入力に備えるため、ステップＳ１に戻る。 Returning to FIG. 7, once the control unit 22 generates the response in steps S8 and S9, it returns to step S1 to prepare for the next voice input by the operator.

一方、制御部２２は、ステップＳ６で、音声認識結果にコマンド辞書２４２のコマンドが含まれていないと判定した場合（ステップＳ６のＮｏ）、コマンド以外の音声入力に対する応答が格納された応答データベース２４３を参照して応答を生成する（ステップＳ１０）。そして、生成した応答のデータをスピーカ２３に出力して、オペレータに応答する。 On the other hand, if the control unit 22 determines in step S6 that the voice recognition result does not include a command in the command dictionary 242 (No in step S6), the control unit 22 generates a response by referring to the response database 243 in which responses to voice inputs other than commands are stored (step S10). Then, the control unit 22 outputs the generated response data to the speaker 23 to respond to the operator.

制御部２２は、ステップＳ１０で応答を生成すると、ステップＳ８、Ｓ９と同様に、次のオペレータによる音声入力に備えるため、ステップＳ１に戻る。 After generating a response in step S10, the control unit 22 returns to step S1 to prepare for the next voice input by the operator, similar to steps S8 and S9.

上述したように、作業車音声操作プログラムが起動された後、携帯端末２Ａの表示装置に作業車音声操作プログラムを終了させる終了ボタンが常時表示されている。オペレータは、作業車音声操作処理を終了したい場合、終了ボタンを押す。これにより、作業車音声操作処理のフローが強制的に終了される。 As described above, after the work vehicle voice operation program is launched, an end button for terminating the work vehicle voice operation program is constantly displayed on the display device of the mobile terminal 2A. When the operator wishes to terminate the work vehicle voice operation process, he or she presses the end button. This forcibly terminates the flow of the work vehicle voice operation process.

なお、上記で説明した携帯端末２Ａとサーバー３の構成のうち、スピーカ２３は、本明細書、特許請求の範囲でいうところの提示部の一例である。また、サーバー３の制御部３１が備えるＣＰＵを含むコンピュータは、本明細書、特許請求の範囲でいうところの第一コンピュータの一例である。また、携帯端末２Ａの制御部２２が備えるマイクロコンピュータは、第二コンピュータの一例である。検出器５８Ａ－５８Ｅは、本明細書、特許請求の範囲でいうところの姿勢検出器又は状態検出器の一例である。さらに、サーバー３が音声認識をするステップ、ステップＳ４、ステップＳ５及び、ステップＳ６－Ｓ１０は、特許請求の範囲でいうところの音声認識ステップ、音声認識結果取得ステップ、実行可能操作選定ステップ及び、応答生成ステップの一例である。 Of the configurations of the mobile terminal 2A and the server 3 described above, the speaker 23 is an example of a presentation unit as defined in this specification and the claims. The computer including the CPU provided in the control unit 31 of the server 3 is an example of a first computer as defined in this specification and the claims. The microcomputer provided in the control unit 22 of the mobile terminal 2A is an example of a second computer. The detectors 58A-58E are an example of a posture detector or state detector as defined in this specification and the claims. Furthermore, the steps of the server 3 performing voice recognition, steps S4, S5, and steps S6-S10, are examples of a voice recognition step, a voice recognition result acquisition step, an executable operation selection step, and a response generation step as defined in the claims.

以上のように、実施の形態１に係る作業車音声操作システム１Ａでは、音声認識結果に作業車を操作するためのコマンドが含まれ、かつ音声認識結果に規制操作のワードが含まれる場合に、応答生成部２２３が、オペレータの音声指示による操作に代えて実行可能操作を勧める応答を生成する。そして、スピーカ２３が生成された応答のデータを音声に変換してオペレータにその応答を提示する。作業車音声操作システム１Ａでは、オペレータが誤って規制操作を音声で指示した場合でも、その音声指示の操作に代わる実行可能操作がオペレータに提示されるので、円滑な音声操作をすることができる。また、音声操作が容易である。 As described above, in the work vehicle voice operation system 1A according to embodiment 1, when the voice recognition result includes a command for operating the work vehicle and also includes a word for a restricted operation, the response generation unit 223 generates a response recommending an executable operation instead of the operation instructed by the operator's voice. The speaker 23 then converts the generated response data into voice and presents the response to the operator. In the work vehicle voice operation system 1A, even if the operator mistakenly instructs a restricted operation by voice, an executable operation in place of the voice-instructed operation is presented to the operator, allowing smooth voice operation. Voice operation is also easy.

作業車音声操作システム１Ａでは、オペレータが誤って規制操作を音声で指示した場合に、規制操作を実行しないで、その操作に代わる実行可能操作をオペレータに提示するので、安全性を確保することができる。 If the operator mistakenly issues a regulated operation by voice, the work vehicle voice operation system 1A will not execute the regulated operation but will instead present the operator with executable alternative operations, ensuring safety.

また、作業車音声操作システム１Ａでは、オペレータの音声指示を音声認識する音声認識部がサーバー３に設けられている。演算量の多い音声認識処理をサーバー３に実行させることにより、携帯端末２Ａの演算量を小さくすることができる。また、演算を速くすることができる。 In addition, in the work vehicle voice operation system 1A, a voice recognition unit that recognizes the voice instructions of the operator is provided in the server 3. By having the server 3 execute the voice recognition process, which requires a large amount of calculations, the amount of calculations on the mobile terminal 2A can be reduced. In addition, calculations can be made faster.

（実施の形態２）
実施の形態１では、携帯端末２Ａの制御部２２が実行可能操作選定部２２２を備える。しかし、本発明はこれに限定されない。本発明では、高所作業車５のコントローラ４Ａが実行可能操作選定部２２２を備えていてもよい。 (Embodiment 2)
In the first embodiment, the control unit 22 of the mobile terminal 2A includes the executable operation selection unit 222. However, the present invention is not limited to this. In the present invention, the controller 4A of the aerial work platform 5 may include the executable operation selection unit 222.

実施の形態２に係る作業車音声操作システム１Ｂでは、上述したように、コントローラ４Ｂが、実行可能操作選定部４１を備える。以下、図８を参照して、作業車音声操作システム１Ｂについて説明する。実施の形態２では、実施の形態１と異なる構成について説明する。 As described above, in the work vehicle voice operation system 1B according to the second embodiment, the controller 4B includes an executable operation selection unit 41. Below, the work vehicle voice operation system 1B will be described with reference to FIG. 8. In the second embodiment, the configuration different from the first embodiment will be described.

図８は、実施の形態２に係る作業車音声操作システム１Ｂが備える携帯端末２Ｂの制御部２２及びコントローラ４Ｂのブロック図である。 Figure 8 is a block diagram of the control unit 22 and controller 4B of the mobile terminal 2B provided in the work vehicle voice operation system 1B according to embodiment 2.

図８に示すように、作業車音声操作システム１Ｂでは、携帯端末２Ｂが音声認識結果取得部２２１、音声認識結果分析部２２４及び、応答生成部２２３を備える。また、コントローラ４Ｂが実行可能操作選定部４１を備える。 As shown in FIG. 8, in the work vehicle voice operation system 1B, the mobile terminal 2B includes a voice recognition result acquisition unit 221, a voice recognition result analysis unit 224, and a response generation unit 223. In addition, the controller 4B includes an executable operation selection unit 41.

音声認識結果取得部２２１は、実施の形態１と同様に、マイクロフォン２１から得たオペレータの音声データをサーバー３の音声認識部３１１に送信し、その後、音声認識部３１１から音声認識結果を取得する。そして、音声認識結果取得部２２１は、音声認識結果を音声認識結果分析部２２４と応答生成部２２３に送信する。 As in the first embodiment, the voice recognition result acquisition unit 221 transmits the operator's voice data obtained from the microphone 21 to the voice recognition unit 311 of the server 3, and then acquires the voice recognition result from the voice recognition unit 311. The voice recognition result acquisition unit 221 then transmits the voice recognition result to the voice recognition result analysis unit 224 and the response generation unit 223.

音声認識結果分析部２２４は、記憶部２４からコマンド辞書２４２を読み出し、音声認識結果取得部２２１から受信した音声認識結果に、コマンド辞書２４２のコマンド対象物とコマンド内容のワード又はそれらに類似したワードが含まれているか否かを判定する。音声認識結果分析部２２４は、それらワード又はそれらに類似したワードが含まれていると判定した場合、コントローラ４Ｂの実行可能操作選定部４１に、そのコマンド対象物とそのコマンド内容を送信する。一方、音声認識結果分析部２２４は、それらワード又はそれらに類似したワードが含まれていないと判定した場合、応答生成部２２３に、音声認識結果がコマンドに関するものでない旨の分析結果を送信する。 The voice recognition result analysis unit 224 reads the command dictionary 242 from the storage unit 24, and determines whether the voice recognition result received from the voice recognition result acquisition unit 221 contains the command object and command content words in the command dictionary 242, or words similar thereto. If the voice recognition result analysis unit 224 determines that the command object and command content are included, it transmits the command object and command content to the executable operation selection unit 41 of the controller 4B. On the other hand, if the voice recognition result analysis unit 224 determines that the command object and command content are not included, it transmits an analysis result to the response generation unit 223 indicating that the voice recognition result is not related to a command.

コントローラ４Ｂは、実施の形態１で説明したように、検出器５８Ａ－５８Ｅが検出した検出結果から高所作業車５の姿勢又は状態を求め、求めた姿勢又は状態から、安全を確保するため、規制操作を求める。このとき、実施の形態１では説明していないが、コントローラ４Ｂは、操作が規制操作であるか否かを識別するため、規制操作に該当する操作のデータのフラグをオンにする。例えば、ジャッキ未設置状態のとき、ブーム操作のデータのフラグをオンにする。 As described in the first embodiment, the controller 4B determines the attitude or state of the aerial work platform 5 from the detection results detected by the detectors 58A-58E, and requests a restricting operation to ensure safety based on the determined attitude or state. At this time, although not described in the first embodiment, the controller 4B turns on a flag for data of an operation that corresponds to a restricting operation in order to identify whether the operation is a restricting operation or not. For example, when the jack is not installed, the controller 4B turns on a flag for data of boom operation.

実行可能操作選定部４１は、音声認識結果分析部２２４からのコマンド対象物とコマンド内容のデータを受信すると、コマンド対象物とコマンド内容で特定される操作が規制操作であるか否かを、上記フラグがオンであるか否かにより求める。そして、実行可能操作選定部４１は、規制操作であるか否かの判定結果を携帯端末２Ｂの応答生成部２２３に送信する。 When the executable operation selection unit 41 receives data on the command object and the command content from the voice recognition result analysis unit 224, it determines whether the operation specified by the command object and the command content is a restricted operation based on whether the flag is on. Then, the executable operation selection unit 41 transmits the determination result of whether the operation is a restricted operation to the response generation unit 223 of the mobile terminal 2B.

また、実行可能操作選定部４１は、上記フラグがオンでない、すなわち、上記フラグがオフである操作を抽出する。これにより、実行可能操作選定部４１は、音声認識結果分析部２２４からデータを受信した段階での、実行可能操作を選定する。実行可能操作選定部４１は、選定した実行可能操作のデータを携帯端末２Ｂの応答生成部２２３に送信する。 The executable operation selection unit 41 also extracts operations for which the flag is not on, i.e., the flag is off. In this way, the executable operation selection unit 41 selects executable operations at the stage of receiving data from the voice recognition result analysis unit 224. The executable operation selection unit 41 transmits data of the selected executable operation to the response generation unit 223 of the mobile terminal 2B.

応答生成部２２３は、音声認識結果分析部２２４から音声認識結果がコマンドに関するものでない旨の分析結果を受信した場合、記憶部２４から応答データベース２４３を読み出す。その応答データベース２４３では、実施の形態１で説明したように、音声認識結果がコマンドに関するものでないときの、その音声認識結果に含まれる語彙と応答が対応付けられて保存されている。応答生成部２２３は、その応答データベース２４３を参照して応答を生成し、生成した応答のデータをスピーカ２３に出力することにより、オペレータにその応答を伝える。 When the response generation unit 223 receives an analysis result from the voice recognition result analysis unit 224 indicating that the voice recognition result is not related to a command, the response generation unit 223 reads out the response database 243 from the storage unit 24. As described in the first embodiment, the response database 243 stores responses in association with vocabulary included in the voice recognition result when the voice recognition result is not related to a command. The response generation unit 223 generates a response by referring to the response database 243, and conveys the response to the operator by outputting the data of the generated response to the speaker 23.

一方、応答生成部２２３は、音声認識結果がコマンドに関するものであると音声認識結果分析部２２４が判定した結果、実行可能操作選定部４１がそのコマンドが規制操作であるか否かを判定した場合に、実行可能操作選定部４１から規制操作であるか否かの判定結果を受信する。 On the other hand, when the voice recognition result analysis unit 224 determines that the voice recognition result is related to a command, and the executable operation selection unit 41 determines whether the command is a restricted operation, the response generation unit 223 receives the determination result of whether the command is a restricted operation from the executable operation selection unit 41.

応答生成部２２３は、規制操作であるという判定結果を受信した場合、実行可能操作選定部４１から受信した実行可能操作のデータを用いて、音声指示の操作の代わりに実行可能操作を提案する応答を作成する。そして、生成した応答のデータをスピーカ２３に出力することにより、オペレータに音声で指示した操作の代わりに実行可能操作があることを伝える。 When the response generation unit 223 receives a determination result indicating that the operation is a restricted operation, it uses the executable operation data received from the executable operation selection unit 41 to create a response proposing an executable operation instead of the voice-instructed operation. Then, by outputting the generated response data to the speaker 23, it notifies the operator that there is an executable operation instead of the operation instructed by voice.

また、応答生成部２２３は、規制操作でないという判定結果を受信した場合、記憶部２４からコマンド辞書２４２を読み出す。コマンド辞書２４２では、実施の形態１で説明したように、コマンドに対する応答が、コマンド対象、コマンド内容に対応付けられている。そのコマンドに対する応答は、コマンドどおりに操作を実行する旨の応答である。応答生成部２２３は、そのコマンドに対する応答を用いて、応答を生成する。そして、応答生成部２２３は、生成した応答のデータをスピーカ２３に出力することにより、オペレータに音声指示のとおりに操作を実行することを知らせる。 Furthermore, when the response generation unit 223 receives a determination result that the operation is not a restricted operation, it reads out the command dictionary 242 from the storage unit 24. In the command dictionary 242, as described in the first embodiment, responses to commands are associated with command targets and command contents. The response to the command is a response to the effect that the operation will be performed according to the command. The response generation unit 223 generates a response using the response to the command. The response generation unit 223 then outputs the generated response data to the speaker 23, thereby informing the operator that the operation will be performed according to the voice instructions.

以上のように、実施の形態２に係る作業車音声操作システム１Ｂでも、実施の形態１と同様に、音声認識結果に作業車を操作するためのコマンドが含まれ、かつ音声認識結果に規制操作のワードが含まれる場合に、応答生成部２２３が、オペレータの音声指示による操作に代えて実行可能操作を勧める応答を生成する。そして、オペレータにその応答を提示する。その結果、作業車音声操作システム１Ｂでは、オペレータが誤って規制操作を音声で指示した場合でも、その音声指示が実行されずにオペレータが混乱することがない。また、音声指示の操作に代わる実行可能操作がオペレータに提示されるので、音声操作が円滑かつ容易である。 As described above, in the work vehicle voice operation system 1B according to the second embodiment, similarly to the first embodiment, when the voice recognition result includes a command for operating the work vehicle and also includes a word for a restricted operation, the response generation unit 223 generates a response recommending an executable operation instead of the operation instructed by the operator's voice. Then, the response is presented to the operator. As a result, in the work vehicle voice operation system 1B, even if the operator mistakenly instructs a restricted operation by voice, the voice instruction is not executed and the operator is not confused. In addition, because executable operations in place of the voice-instructed operation are presented to the operator, voice operation is smooth and easy.

以上、本発明の実施の形態を説明したが、本発明は上記の実施の形態に限定されるものではない。例えば、実施の形態１及び２では、スピーカ２３が生成した応答データを音声に変換して、オペレータに応答を提示している。しかし、本発明はこれに限定されない。本発明では、提示部が応答生成部２２３によって生成された応答を提示すればよい。携帯端末２Ａ、２Ｂは、液晶ディスプレイ又は有機ＥＬ（ＥｌｅｃｔｒｏＬｕｍｉｎｅｓｃｅｎｃｅ）ディスプレイ等の、図示しない表示装置を備えるところ、スピーカ２３のほか、表示装置が応答を提示してもよい。この場合、応答生成部２２３が生成した応答は、表示装置に文字情報として表示されるとよい。 Although the embodiments of the present invention have been described above, the present invention is not limited to the above embodiments. For example, in the first and second embodiments, the response data generated by the speaker 23 is converted into voice and the response is presented to the operator. However, the present invention is not limited to this. In the present invention, the presentation unit may present the response generated by the response generation unit 223. The mobile terminals 2A and 2B are equipped with a display device (not shown) such as a liquid crystal display or an organic EL (Electro Luminescence) display, and the display device may present the response in addition to the speaker 23. In this case, the response generated by the response generation unit 223 may be displayed as text information on the display device.

また、実施の形態１及び２では、規制操作の例として、ジャッキが未設置であるときのブーム操作、ブーム５４の姿勢が作業姿勢のときのアウトリガ操作、ストップスイッチがオン状態のときのブーム操作とアウトリガ操作を挙げているが、本発明はこれに限定されない。本発明では、規制操作は、高所作業車５のある姿勢又はある状態のもとで規制される操作であれば良い。例えば、規制操作は、ブーム５４が作業範囲を超えかけているときのブーム操作であってもよい。 In addition, in the first and second embodiments, examples of restricted operations include boom operation when a jack is not installed, outrigger operation when the boom 54 is in a working position, and boom operation and outrigger operation when the stop switch is on, but the present invention is not limited to these. In the present invention, a restricted operation may be an operation that is restricted in a certain position or state of the aerial work vehicle 5. For example, a restricted operation may be boom operation when the boom 54 is about to exceed the working range.

実施の形態１及び２では、サーバー３が音声認識部３１１を備えている。しかし、本発明はこれに限定されない。本発明では、携帯端末２Ａ、２Ｂの制御部２２が音声認識部３１１を備えてもよい。換言すると、サーバー３の制御部３１が備えるコンピュータが音声認識処理を実施するのではなく、携帯端末２Ａ、２Ｂの制御部２２が備えるマイクロコンピュータが音声認識処理を実施してもよい。 In the first and second embodiments, the server 3 includes the voice recognition unit 311. However, the present invention is not limited to this. In the present invention, the control unit 22 of the mobile terminals 2A and 2B may include the voice recognition unit 311. In other words, instead of the computer included in the control unit 31 of the server 3 performing the voice recognition process, the microcomputer included in the control unit 22 of the mobile terminals 2A and 2B may perform the voice recognition process.

なお、サーバー３が音声認識部３１１を備えているが、サーバー３は、音声認識処理を行う専用の装置、すなわち、音声認識装置であってもよい。 Although the server 3 is equipped with a voice recognition unit 311, the server 3 may be a dedicated device for performing voice recognition processing, i.e., a voice recognition device.

また、実施の形態１では、携帯端末２Ａの制御部２２が音声認識結果取得部２２１、実行可能操作選定部２２２及び応答生成部２２３を備えている。実施の形態２では、携帯端末２Ｂの制御部２２が音声認識結果取得部２２１、音声認識結果分析部２２４及び応答生成部２２３を備えている。しかし、本発明はこれに限定されない。本発明では、携帯端末２Ａ、２Ｂ、サーバー３、高所作業車５等で構成されるシステムが、音声認識結果取得部２２１、実行可能操作選定部４１、２２２及び応答生成部２２３を備えていていればよい。例えば、サーバー３が音声認識結果取得部２２１、実行可能操作選定部４１、２２２及び応答生成部２２３を備えてもよい。この場合、携帯端末２Ａ、２Ｂの制御部２２は、マイクロフォン２１からの入力データをサーバー３に送信し、サーバー３から送信される応答データを受信して、スピーカ２３を駆動するとよい。 In the first embodiment, the control unit 22 of the mobile terminal 2A includes a voice recognition result acquisition unit 221, an executable operation selection unit 222, and a response generation unit 223. In the second embodiment, the control unit 22 of the mobile terminal 2B includes a voice recognition result acquisition unit 221, a voice recognition result analysis unit 224, and a response generation unit 223. However, the present invention is not limited to this. In the present invention, a system consisting of the mobile terminals 2A and 2B, the server 3, the aerial work vehicle 5, etc., may include the voice recognition result acquisition unit 221, the executable operation selection units 41 and 222, and the response generation unit 223. For example, the server 3 may include the voice recognition result acquisition unit 221, the executable operation selection units 41 and 222, and the response generation unit 223. In this case, the control unit 22 of the mobile terminals 2A and 2B may transmit input data from the microphone 21 to the server 3, receive response data transmitted from the server 3, and drive the speaker 23.

また、携帯端末２Ａ、２Ｂが音声認識結果取得部２２１、実行可能操作選定部４１、２２２及び応答生成部２２３に加えて、音声認識部３１１を備えてもよい。そのほか、サーバー３とは別のサーバーが、音声認識結果取得部２２１、実行可能操作選定部４１、２２２及び応答生成部２２３を備え、そのサーバーとサーバー３及び携帯端末２Ａ、２Ｂが通信をすることにより、音声認識から応答生成までの一連の処理を実施してもよい。 The mobile terminals 2A and 2B may also include a voice recognition unit 311 in addition to the voice recognition result acquisition unit 221, the executable operation selection units 41 and 222, and the response generation unit 223. Alternatively, a server separate from the server 3 may include the voice recognition result acquisition unit 221, the executable operation selection units 41 and 222, and the response generation unit 223, and the server 3 and the mobile terminals 2A and 2B may communicate with this server to carry out a series of processes from voice recognition to response generation.

さらに、実施の形態１及び２では、携帯端末２Ａ、２Ｂの記憶部２４に作業車操作データベース２４１、コマンド辞書２４２、応答データベース２４３及び、操作履歴データベース２４４が格納されているが、作業車操作データベース２４１、コマンド辞書２４２、応答データベース２４３及び、操作履歴データベース２４４は、例えば、サーバー３又は、サーバー３とは別のサーバーの記憶部に格納されていてもよい。換言すると、作業車操作データベース２４１、コマンド辞書２４２、応答データベース２４３及び、操作履歴データベース２４４は、ネットワーク１００を介して接続された記憶部に格納されていてもよい。 Furthermore, in the first and second embodiments, the work vehicle operation database 241, the command dictionary 242, the response database 243, and the operation history database 244 are stored in the storage unit 24 of the mobile terminals 2A and 2B, but the work vehicle operation database 241, the command dictionary 242, the response database 243, and the operation history database 244 may be stored in the storage unit of the server 3 or a server other than the server 3, for example. In other words, the work vehicle operation database 241, the command dictionary 242, the response database 243, and the operation history database 244 may be stored in a storage unit connected via the network 100.

１Ａ、１Ｂ…作業車音声操作システム、２Ａ、２Ｂ…携帯端末、３…サーバー、４Ａ、４Ｂ…コントローラ、５…高所作業車、２１…マイクロフォン、２２…制御部、２３…スピーカ、２４…記憶部、２５…通信部、３１…制御部、３２…記憶部、３３…通信部、４１…実行可能操作選定部、５１…車両、５２Ａ－５２Ｄ…アウトリガ、５３…旋回台、５４…ブーム、５５…バケット、５６…操作装置、５７…駆動部、５８Ａ－５８Ｅ…検出器、６１…記憶装置、６２…通信装置、７１…規制操作情報、１００…ネットワーク、２２１…音声認識結果取得部、２２２…実行可能操作選定部、２２３…応答生成部、２２４…音声認識結果分析部、２４１…作業車操作データベース、２４２…コマンド辞書、２４３…応答データベース、２４４…操作履歴データベース、３１１…音声認識部、３２１…音声認識辞書 1A, 1B... Work vehicle voice operation system, 2A, 2B... Mobile terminal, 3... Server, 4A, 4B... Controller, 5... Aerial work vehicle, 21... Microphone, 22... Control unit, 23... Speaker, 24... Memory unit, 25... Communication unit, 31... Control unit, 32... Memory unit, 33... Communication unit, 41... Executable operation selection unit, 51... Vehicle, 52A-52D... Outrigger, 53... Swing base, 54... Boom, 55... Bucket, 56... Operation device, 5 7...Drive unit, 58A-58E...Detector, 61...Storage device, 62...Communication device, 71...Restricted operation information, 100...Network, 221...Voice recognition result acquisition unit, 222...Executable operation selection unit, 223...Response generation unit, 224...Voice recognition result analysis unit, 241...Work vehicle operation database, 242...Command dictionary, 243...Response database, 244...Operation history database, 311...Voice recognition unit, 321...Voice recognition dictionary

Claims

a voice recognition result acquisition unit that acquires a voice recognition result from a voice recognition device that recognizes voice instructions uttered by an operator to operate the work vehicle;
an executable operation selection unit that selects an executable operation that can be executed under the attitude or state of the work vehicle based on the attitude or state detected by an attitude detector or state detector equipped in the work vehicle;
a response generation unit which determines whether the voice recognition result acquired by the voice recognition result acquisition unit includes an instruction for operating the work vehicle , and if it is determined that the instruction is not included, generates a response by referring to a response database in which responses to vocabulary other than the instruction included in the voice recognition result are stored, and if it is determined that the instruction is included, further determines whether the instruction is an instruction for a restricted operation that is restricted under the posture or state, and if it is determined that the instruction is an instruction for the restricted operation, generates a response recommending the executable operation instead of the instruction , and if it is determined that the instruction is not an instruction for the restricted operation, generates a response to the effect that the operation of the instruction will be executed ;
a presentation unit that presents the response generated by the response generation unit;
A work vehicle voice operation system equipped with:

a voice recognition result acquisition unit that acquires a voice recognition result from a voice recognition device that recognizes voice instructions uttered by an operator to operate the work vehicle;
an executable operation selection unit that selects an executable operation that can be executed under the attitude or state of the work vehicle based on the attitude or state detected by an attitude detector or state detector equipped in the work vehicle;
a response generating unit which, when the voice recognition result acquired by the voice recognition result acquiring unit includes an instruction to operate the work vehicle and the instruction is an instruction for a restricted operation that is restricted under the attitude or state, generates a response recommending the executable operation instead of the instruction, and, when the voice recognition result acquired by the voice recognition result acquiring unit includes an instruction to operate the work vehicle and the instruction is not an instruction for the restricted operation, generates a response to execute the operation of the instruction;
a presentation unit that presents the response generated by the response generation unit;
a storage unit that stores the operation history by storing the operation content of the voice instruction every time the response generation unit generates a response to execute the operation instructed by the operator by voice ;
Equipped with
the response generation unit, when the voice recognition result acquired by the voice recognition result acquisition unit includes an instruction to operate the work vehicle, the instruction is an instruction to perform the regulated operation, and there are a plurality of executable operations selected by the executable operation selection unit, generates a response recommending, from among the plurality of executable operations, an executable operation that corresponds to a most recent operation or a most frequently performed operation in the history data stored in the storage unit.
Work vehicle voice operation system.

a voice recognition step in which the first computer or the second computer recognizes voice instructions uttered by an operator to operate the work vehicle;
an executable operation selection step in which the second computer selects an executable operation that can be executed under the attitude or state of the work vehicle based on the attitude or state detected by an attitude detector or state detector equipped in the work vehicle;
a response generating step in which the second computer determines whether or not the voice recognition result obtained by voice recognition in the voice recognition step includes an instruction to operate the work vehicle, and if it determines that the instruction is not included, generates a response by referring to a response database in which responses to vocabulary other than the instruction included in the voice recognition result are stored, and if it determines that the instruction is included, further determines whether or not the instruction is an instruction for a restricted operation that is restricted under the attitude or state, and if it determines that the instruction is an instruction for the restricted operation , generates a response recommending the executable operation instead of the instruction , and if it determines that the instruction is not an instruction for the restricted operation, generates a response to the effect that the operation of the instruction is to be executed ;
A voice operation method for a work vehicle comprising:

A program for voice-operating a work vehicle using a microphone that converts voice instructions uttered by an operator to operate the work vehicle into electrical signals and outputs the electrical signals as voice data, and a voice recognition device that performs voice recognition on the voice data, comprising:
On the computer,
a speech recognition result acquisition step of acquiring a speech recognition result from the speech recognition device;
an executable operation selection step of selecting an executable operation that can be executed under the attitude or state of the work vehicle based on the attitude or state detected by an attitude detector or state detector equipped in the work vehicle;
a response generating step of determining whether the voice recognition result acquired in the voice recognition result acquisition step includes an instruction for operating the work vehicle , and if it is determined that the instruction is not included, generating a response by referring to a response database in which responses to vocabulary other than the instruction included in the voice recognition result are stored, and if it is determined that the instruction is included, further determining whether the instruction is an instruction for a restricted operation that is restricted under the posture or state, and if it is determined that the instruction is an instruction for the restricted operation, generating a response recommending the executable operation instead of the instruction , and if it is determined that the instruction is not an instruction for the restricted operation, generating a response to execute the operation of the instruction ;
A program for executing.

a voice recognition step in which the first computer or the second computer recognizes voice instructions uttered by an operator to operate the work vehicle;
an executable operation selection step in which the second computer selects an executable operation that can be executed under the attitude or state of the work vehicle based on the attitude or state detected by an attitude detector or state detector equipped in the work vehicle;
a response generating step in which, when the voice recognition result obtained by voice recognition in the voice recognition step includes an instruction to operate the work vehicle and the instruction is an instruction for a restricted operation that is restricted under the attitude or the state, the second computer generates a response recommending the executable operation instead of the instruction, and, when the voice recognition result obtained by voice recognition in the voice recognition step includes an instruction to operate the work vehicle and the instruction is not an instruction for the restricted operation, the second computer generates a response to execute the operation of the instruction;
a storage step of storing a history of operations by the second computer by storing the operation content of the voice instruction in a storage device every time the response generating step generates a response to the effect that the operation instructed by the operator by voice is to be executed;
Equipped with
In the response generation step, when the voice recognition result obtained by voice recognition in the voice recognition step includes an instruction to operate the work vehicle, and the instruction is an instruction to perform the restricted operation, and there are a plurality of executable operations selected in the executable operation selection step, a response is generated that recommends the executable operation that corresponds to the most recent operation or the most frequently performed operation in the history data stored in the storage device out of the plurality of executable operations.
How to operate a work vehicle with voice commands.

A program for voice-operating a work vehicle using a microphone that converts voice instructions uttered by an operator to operate the work vehicle into an electrical signal and outputs the electrical signal as voice data, a voice recognition device that performs voice recognition on the voice data, and a storage device, comprising:
On the computer,
a speech recognition result acquisition step of acquiring a speech recognition result from the speech recognition device;
an executable operation selection step of selecting an executable operation that can be executed under the attitude or state of the work vehicle based on the attitude or state detected by an attitude detector or state detector equipped in the work vehicle;
a response generating step of generating a response recommending the executable operation in place of the instruction when the voice recognition result acquired in the voice recognition result acquiring step includes an instruction to operate the work vehicle and the instruction is an instruction for a restricted operation that is restricted under the attitude or state, and generating a response to execute the operation of the instruction when the voice recognition result acquired in the voice recognition result acquiring step includes an instruction to operate the work vehicle and the instruction is not an instruction for the restricted operation;
a storage step of storing a history of operations by storing the operation contents of the voice instruction in the storage device every time a response to execute an operation instructed by the operator by voice is generated in the response generating step;
Run the command,
In the response generating step, when the voice recognition result acquired in the voice recognition result acquiring step includes an instruction to operate the work vehicle, the instruction is an instruction to perform the restricted operation, and there are a plurality of executable operations selected in the executable operation selecting step, a response is generated that recommends the executable operation that corresponds to the most recent operation or the most frequently performed operation in the history data stored in the storage device out of the plurality of executable operations.
Program.