[go: up one dir, main page]

CN108763329A - Evaluating method, device and the computer equipment of voice interactive system IQ level - Google Patents

Evaluating method, device and the computer equipment of voice interactive system IQ level Download PDF

Info

Publication number
CN108763329A
CN108763329A CN201810433832.0A CN201810433832A CN108763329A CN 108763329 A CN108763329 A CN 108763329A CN 201810433832 A CN201810433832 A CN 201810433832A CN 108763329 A CN108763329 A CN 108763329A
Authority
CN
China
Prior art keywords
instruction
level
cycles
voice interaction
interaction system
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810433832.0A
Other languages
Chinese (zh)
Inventor
徐鹏飞
黄林轶
黄登甲
刘嘉祁
徐华伟
彭琦
刘群兴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Electronic Product Reliability and Environmental Testing Research Institute
Original Assignee
China Electronic Product Reliability and Environmental Testing Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Electronic Product Reliability and Environmental Testing Research Institute filed Critical China Electronic Product Reliability and Environmental Testing Research Institute
Priority to CN201810433832.0A priority Critical patent/CN108763329A/en
Publication of CN108763329A publication Critical patent/CN108763329A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The present invention relates to the evaluating method of voice interactive system IQ level, device and computer equipment, the method includes:Test problem is obtained, the test problem is phonetic problem;Obtain the instruction cycles of test problem described in the voice interactive system success response;Wherein, an instruction cycle is the voice interactive system and an extraneous dialog turns;The evaluation result of the voice interactive system IQ level is determined by described instruction periodicity.The embodiment of the present invention solves the problems, such as that the evaluation result reliability of voice interactive system IQ level is low, and evaluation and test realization process is simple, and evaluation result is more objective.

Description

语音交互系统智商水平的评测方法、装置以及计算机设备Evaluation method, device and computer equipment for IQ level of voice interaction system

技术领域technical field

本发明涉及评测技术领域,特别是涉及语音交互系统智商水平的评测方法、装置、计算机设备和存储介质。The invention relates to the technical field of evaluation, in particular to an evaluation method, device, computer equipment and storage medium for the IQ level of a voice interaction system.

背景技术Background technique

随着移动互联网技术与智能硬件设备的发展,人类已经不再满足于键盘输入和手写输入等人机交互方式,语音交互系统技术在电子信息、互联网、医疗、教育、办公等各个领域均得到了广泛应用,覆盖了智能语音输入系统、智能语音助手、智能音箱、车载语音系统、智能语音辅助医疗系统、智能口语评测系统、智能会议系统等产品。With the development of mobile Internet technology and intelligent hardware devices, human beings are no longer satisfied with human-computer interaction methods such as keyboard input and handwriting input. Voice interaction system technology has been applied in various fields such as electronic information, Internet, medical care, education, and office. It is widely used, covering intelligent voice input system, intelligent voice assistant, intelligent speaker, vehicle voice system, intelligent voice-assisted medical system, intelligent spoken language evaluation system, intelligent conference system and other products.

基于语音交互系统,能够让智能设备具有和人类一样的听识能力,同时将人类语言所表述的自然语义自动转换为计算机能理解和操作的结构化语义,并完成人机交互功能。对于语音交互系统的性能评测,行业内普遍的做法是,根据语音交互系统的识别结果与标准语音识别结果的匹配度评测。Based on the voice interaction system, it can make smart devices have the same listening ability as humans, and at the same time automatically convert the natural semantics expressed in human language into structured semantics that computers can understand and operate, and complete the human-computer interaction function. For the performance evaluation of the voice interaction system, the common practice in the industry is to evaluate the matching degree between the recognition result of the voice interaction system and the standard voice recognition result.

在实现本发明的过程中,发明人发现现有技术中存在如下问题,传统的性能评测方法,需预先构建测试问题库以及标准语音识别结果的数据库,由于数据库中样本的有限性,给定的标准语音识别结果无法考虑语言表达习惯、问题答案多样性、主观随意性等实际情况,导致语音交互系统智商水平(或称作理解能力、智能化程度等)的测评结果的可靠性不高。In the process of realizing the present invention, the inventor found that there are following problems in the prior art. The traditional performance evaluation method needs to pre-build a database of test questions and standard speech recognition results. Due to the limitation of samples in the database, a given Standard speech recognition results cannot take into account actual situations such as language expression habits, diversity of answers to questions, and subjective arbitrariness, which leads to low reliability of the evaluation results of the IQ level (or comprehension ability, intelligence degree, etc.) of the voice interaction system.

发明内容Contents of the invention

基于此,有必要针对现有方式对语音交互系统智商水平的测评结果的可靠性不高的问题,提供一种语音交互系统智商水平的评测方法、装置、计算机设备和存储介质。Based on this, it is necessary to provide a method, device, computer equipment and storage medium for evaluating the IQ level of a voice interactive system in view of the low reliability of the evaluation results of the IQ level of the voice interactive system in the existing method.

本发明实施例第一方面提供一种语音交互系统智商水平的评测方法,包括:The first aspect of the embodiments of the present invention provides a method for evaluating the IQ level of a voice interaction system, including:

获取测试问题,所述测试问题为语音问题;Obtain a test question, the test question is a speech question;

获取所述语音交互系统成功响应所述测试问题的指令周期数;其中,一个指令周期为所述语音交互系统与外界的一个对话回合;Acquiring the number of instruction cycles for the voice interaction system to successfully respond to the test question; wherein, one instruction cycle is a dialogue round between the voice interaction system and the outside world;

通过所述指令周期数确定所述语音交互系统智商水平的评测结果。The evaluation result of the IQ level of the voice interaction system is determined by the number of instruction cycles.

在其中一个实施例中,所述通过所述指令周期数确定所述语音交互系统智商水平的评测结果,包括:In one of the embodiments, the determination of the evaluation result of the IQ level of the voice interaction system through the number of instruction cycles includes:

通过所述指令周期数确定所述测试问题对应的指令周期成本;Determining the instruction cycle cost corresponding to the test question by the number of instruction cycles;

通过所述指令周期成本确定所述语音交互系统智商水平的评测结果。The evaluation result of the IQ level of the voice interaction system is determined by the instruction cycle cost.

在其中一个实施例中,所述获取测试问题,包括:获取预设的多个测试问题;In one of the embodiments, said obtaining test questions includes: obtaining a plurality of preset test questions;

所述通过所述指令周期成本确定所述语音交互系统智商水平的评测结果,包括:The evaluation result of determining the IQ level of the voice interaction system through the instruction cycle cost includes:

根据所述多个测试问题各自对应的指令周期成本,确定所述语音交互系统智商水平的评测结果得分。According to the instruction cycle cost corresponding to each of the plurality of test questions, the score of the evaluation result of the IQ level of the voice interaction system is determined.

在其中一个实施例中,,还包括:所述根据所述多个测试问题各自对应的指令周期成本,确定所述语音交互系统智商水平的评测结果得分,包括:In one of the embodiments, it also includes: determining the evaluation result score of the IQ level of the voice interaction system according to the instruction cycle cost corresponding to each of the plurality of test questions, including:

获取所述多个测试问题各自对应的权重;Acquiring respective weights corresponding to the plurality of test questions;

根据所述多个测试问题各自对应的指令周期成本和权重,确定所述语音交互系统智商水平的评测结果得分。According to the instruction cycle cost and weight corresponding to each of the plurality of test questions, the score of the evaluation result of the IQ level of the voice interaction system is determined.

在其中一个实施例中,所述获取所述多个测试问题各自对应的权重,包括:In one of the embodiments, the obtaining the corresponding weights of the plurality of test questions includes:

获取所述多个测试问题各自对应的类型;Obtaining the corresponding types of the plurality of test questions;

获取各类型对应的权重,作为相应测试问题对应的权重。Obtain the weights corresponding to each type as the weights corresponding to the corresponding test questions.

在其中一个实施例中,所述通过所述指令周期数确定所述测试问题对应的指令周期成本,包括:In one of the embodiments, the determining the instruction cycle cost corresponding to the test question through the number of instruction cycles includes:

获取预设的周期数限值;Get the preset cycle number limit;

通过所述指令周期数和所述周期数限值,确定所述测试问题对应的指令周期成本。The instruction cycle cost corresponding to the test question is determined through the number of instruction cycles and the limit value of the number of cycles.

在其中一个实施例中,所述通过所述指令周期数和所述周期数限值,确定所述测试问题对应的指令周期成本,包括:In one of the embodiments, the determining the instruction cycle cost corresponding to the test question through the number of instruction cycles and the limit value of the number of cycles includes:

若所述指令周期数为N1,确定所述测试问题对应的指令周期成本为成本得分S1;If the number of instruction cycles is N1, determine that the instruction cycle cost corresponding to the test question is cost score S1;

若所述指令周期数为N2,确定所述测试问题对应的指令周期成本为成本得分S2;If the number of instruction cycles is N2, determine that the instruction cycle cost corresponding to the test question is a cost score S2;

若所述指令周期数为N3,确定所述测试问题对应的指令周期成本为成本得分S3;If the number of instruction cycles is N3, determine that the instruction cycle cost corresponding to the test question is a cost score S3;

或者,or,

若所述指令周期数为N1,确定所述测试问题对应的指令周期成本为成本级别L1;If the number of instruction cycles is N1, determine that the instruction cycle cost corresponding to the test question is cost level L1;

若所述指令周期数为N2,确定所述测试问题对应的指令周期成本为成本级别L2;If the number of instruction cycles is N2, determine that the instruction cycle cost corresponding to the test question is cost level L2;

若所述指令周期数为N3,确定所述测试问题对应的指令周期成本为成本级别L3;If the number of instruction cycles is N3, determine that the instruction cycle cost corresponding to the test question is cost level L3;

其中,0<N1<N2<M≤N3,N1为预先设定的周期数,M表示周期数限值;S3<S2<S1,L1、L2、L3对应的级别依次递减。Among them, 0<N1<N2<M≤N3, N1 is the preset number of cycles, M represents the limit value of the number of cycles; S3<S2<S1, the levels corresponding to L1, L2, and L3 decrease in turn.

在其中一个实施例中,当所述指令周期数在N1和M之间时,随着所述指令周期数的递增,成本得分S2的取值递减。In one embodiment, when the number of instruction cycles is between N1 and M, as the number of instruction cycles increases, the value of the cost score S2 decreases.

在其中一个实施例中,所述成本得分S2的取值递减包括:线性递减、指数衰减、对数衰减中至少一种。In one of the embodiments, the decreasing value of the cost score S2 includes: at least one of linear decreasing, exponential decreasing, and logarithmic decreasing.

本发明实施例第二方面提供一种语音交互系统智商水平的评测装置,包括:The second aspect of the embodiment of the present invention provides a device for evaluating the IQ level of a voice interaction system, including:

问题获取模块,用于获取测试问题,所述测试问题为语音问题;A question acquiring module, configured to acquire a test question, the test question being a voice question;

周期计数模块,用于获取所述语音交互系统成功响应所述测试问题的指令周期数;其中,一个指令周期为所述语音交互系统与外界的一个对话回合;The cycle counting module is used to obtain the number of instruction cycles for the voice interaction system to successfully respond to the test question; wherein, one instruction cycle is a dialogue round between the voice interaction system and the outside world;

以及,as well as,

结果确定模块,用于通过所述指令周期数确定所述语音交互系统智商水平的评测结果。The result determination module is used to determine the evaluation result of the IQ level of the voice interaction system through the number of instruction cycles.

本发明实施例第三方面提供一种计算机设备,包括存储器、处理器以及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现如上述语音交互系统智商水平的评测方法。The third aspect of the embodiments of the present invention provides a computer device, including a memory, a processor, and a computer program stored on the memory and operable on the processor, and the processor implements the following when executing the computer program: A method for evaluating the IQ level of the above-mentioned voice interactive system.

本发明实施例第四方面提供一种计算机存储介质,其上存储有计算机程序,该程序被处理器执行时实现如上述语音交互系统智商水平的评测方法。The fourth aspect of the embodiments of the present invention provides a computer storage medium, on which a computer program is stored, and when the program is executed by a processor, the method for evaluating the IQ level of the voice interaction system described above is implemented.

实施本发明提供的实施例,基于预设的测试问题,获得语音交互系统成功响应所述测试问题的对话回合数量,该数据可反映指令周期成本,然后基于对话回合数量确定语音交互系统智商水平的评测结果;由于评估核心参照为语音交互系统成功响应所述测试问题的对话回合数量,因此能够兼容多种语言表达习惯、问题答案多样性、主观随意性等实际情况,有利于提高评测结果的可靠性。Implement the embodiment provided by the present invention, based on the preset test questions, obtain the number of dialogue rounds in which the voice interaction system successfully responds to the test questions, the data can reflect the command cycle cost, and then determine the IQ level of the voice interaction system based on the number of dialogue rounds Evaluation results; since the evaluation core refers to the number of dialogue rounds in which the voice interaction system successfully responds to the test questions, it can be compatible with actual conditions such as multilingual expression habits, diversity of question answers, and subjective randomness, which is conducive to improving the reliability of evaluation results sex.

附图说明Description of drawings

图1为一个实施例中语音交互系统智商水平的评测方法的应用环境图;Fig. 1 is the application environment diagram of the evaluation method of voice interaction system IQ level in an embodiment;

图2为一实施例的语音交互系统智商水平的评测方法的示意性流程图;Fig. 2 is the schematic flowchart of the evaluation method of the intelligence quotient level of the speech interactive system of an embodiment;

图3为另一实施例的语音交互系统智商水平的评测方法的示意性流程图;Fig. 3 is the schematic flowchart of the evaluation method of the intelligence quotient level of the speech interactive system of another embodiment;

图4为一实施例的语音交互系统智商水平的评测装置的示意性结构图;Fig. 4 is a schematic structural diagram of an evaluation device for an IQ level of a voice interactive system according to an embodiment;

图5为一个实施例中计算机设备的内部结构图。Figure 5 is an internal block diagram of a computer device in one embodiment.

具体实施方式Detailed ways

为了使本发明的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本发明进行进一步详细说明。应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

在本文中提及“实施例”意味着,结合实施例描述的特定特征、结构或特性可以包含在本申请的至少一个实施例中。在说明书中的各个位置出现该短语并不一定均是指相同的实施例,也不是与其它实施例互斥的独立的或备选的实施例。本领域技术人员显式地和隐式地理解的是,本文所描述的实施例可以与其它实施例相结合。Reference herein to an "embodiment" means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one embodiment of the present application. The occurrences of this phrase in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. It is understood explicitly and implicitly by those skilled in the art that the embodiments described herein can be combined with other embodiments.

本申请提供的语音交互系统智商水平的评测方法,可以应用于如图1所示的应用环境中。其中,智能终端102为设有语音交互系统的终端,例如,智能音箱、无人驾驶、智能语音助手等具有语音交互功能的终端。评测终端104用于对智能终端102的性能进行评测,在一实施例中,评测终端104能够输出语音信息,并根据智能终端102对该语音信息的响应情况,对智能终端102的语音交互系统的智商水平进行评测。在其他实施例中,输出语音信息的步骤,也可以由其他第三方终端或者用户执行。The method for evaluating the IQ level of a voice interactive system provided in this application can be applied to the application environment shown in FIG. 1 . Wherein, the smart terminal 102 is a terminal equipped with a voice interaction system, for example, a terminal with a voice interaction function such as a smart speaker, a driverless car, and a smart voice assistant. The evaluation terminal 104 is used to evaluate the performance of the smart terminal 102. In one embodiment, the evaluation terminal 104 can output voice information, and according to the response of the smart terminal 102 to the voice information, the voice interaction system of the smart terminal 102 IQ level is assessed. In other embodiments, the step of outputting voice information may also be performed by other third-party terminals or users.

在一个实施例中,如图2所示,提供了一种语音交互系统智商水平的评测方法,以该方法应用于图1中的评测终端为例进行说明,包括以下步骤:In one embodiment, as shown in Figure 2, a method for evaluating the IQ level of a voice interaction system is provided, and the method is applied to the evaluation terminal in Figure 1 as an example for illustration, including the following steps:

S110,获取测试问题,所述测试问题为语音问题。S110. Obtain a test question, where the test question is a speech question.

其中,测试问题指的是需要向语音交互系统发出的语音信息,例如,“给XXX打电话”、“查询今天的天气”等。Wherein, the test question refers to the voice information that needs to be sent to the voice interaction system, for example, "call XXX", "query today's weather" and so on.

S120,获取所述语音交互系统成功响应所述测试问题的指令周期数;其中,一个指令周期为所述语音交互系统与外界的一个对话回合。S120. Obtain the number of instruction cycles for the voice interaction system to successfully respond to the test question; wherein, one instruction cycle is a dialogue round between the voice interaction system and the outside world.

在发明实施例中,对于如下情景:In the embodiment of the invention, for the following scenarios:

用户:给XXX打电话。(用户指令)User: Call XXX. (user command)

系统:好的,是否现在给联系人XXX打电话?(系统执行)System: Ok, do you want to call the contact XXX now? (system execution)

用户:是的。(用户指令)User: Yes. (user command)

系统:正在呼叫。(系统执行)System: calling. (system execution)

指令周期可以理解为:以上人机交互过程即包含2个指令周期,即人机对话回合的概念。The instruction cycle can be understood as: the above human-computer interaction process includes two instruction cycles, that is, the concept of a human-computer dialogue round.

在上述情景中,测试问题为:“给XXX打电话”。语音交互系统成功响应该测试问题的指令周期数为2。In the above scenario, the test question is: "Call XXX". The number of instruction cycles for the voice interactive system to successfully respond to the test question is 2.

在另一情景中:In another scenario:

用户:给XXX打电话。(用户指令)User: Call XXX. (user command)

系统:对不起,没有听清楚。(系统执行)System: Sorry, I didn't hear you clearly. (system execution)

用户:给XXX打电话。(用户指令)User: Call XXX. (user command)

系统:好的,是否现在给联系人XXX打电话?(系统执行)System: Ok, do you want to call the contact XXX now? (system execution)

用户:是的。(用户指令)User: Yes. (user command)

系统:正在呼叫。(系统执行)System: calling. (system execution)

在上述情景中,对于相同的测试问题(“给XXX打电话”),语音交互系统成功响应该测试问题的指令周期数为3。In the above scenario, for the same test question ("call XXX"), the number of instruction cycles for the voice interaction system to successfully respond to the test question is 3.

基于上述示例,语音交互系统成功响应测试问题的指令周期数,指的是语音交互系统从收到测试问题的语音到执行测试问题对应的指令的回话回合。Based on the above example, the number of command cycles for the voice interaction system to successfully respond to the test question refers to the conversation rounds from the voice interaction system receiving the voice of the test question to executing the instruction corresponding to the test question.

S130,通过所述指令周期数确定所述语音交互系统智商水平的评测结果。S130. Determine the evaluation result of the IQ level of the voice interaction system according to the number of instruction cycles.

本发明实施例中,当语音交互系统的智商水平较低时,其成功响应相同测试问题的指令周期数极大可能会更多,当语音交互系统的智商水平较高时,其成功响应相同测试问题的指令周期数极大可能会更少。In the embodiment of the present invention, when the IQ level of the voice interaction system is low, the number of instruction cycles for successfully responding to the same test question is likely to be more; when the IQ level of the voice interaction system is high, it can successfully respond to the same test. The number of instruction cycles in question will most likely be less.

通过上述实施例的方法,基于预设的测试问题,得到语音交互系统成功响应所述测试问题的指令周期数,该数据可反映指令周期成本,然后基于指令周期数确定语音交互系统智商水平的评测结果;由于评估核心参照为语音交互系统成功响应所述测试问题的指令周期数,因此能够兼容多种语言表达习惯、问题答案多样性、主观随意性等实际情况,有利于提高评测结果的可靠性。Through the method of the above-mentioned embodiment, based on the preset test questions, the number of instruction cycles for the voice interaction system to successfully respond to the test questions is obtained. This data can reflect the cost of the instruction cycle, and then determine the evaluation of the IQ level of the voice interaction system based on the number of instruction cycles. Results; Since the evaluation core reference is the number of command cycles in which the voice interaction system successfully responds to the test questions, it can be compatible with the actual conditions such as multi-language expression habits, diversity of question answers, and subjective randomness, which is conducive to improving the reliability of the evaluation results. .

并且,通过语音交互系统成功响应测试问题花费的指令周期数的多少进行评测,能综合评测语音交互系统的语音识别能力、语义理解能力以及语义处理能力,评测维度全面,从另一层面,提高评测结果的可靠性。In addition, by evaluating the number of instruction cycles spent by the voice interaction system to successfully respond to test questions, it is possible to comprehensively evaluate the speech recognition ability, semantic understanding ability, and semantic processing ability of the voice interaction system. reliability of the results.

此外,无需预先构建标准语言识别结果数据库,还降低了评测的过程复杂度。In addition, there is no need to pre-build a standard language recognition result database, which also reduces the complexity of the evaluation process.

在一实施例中,上述步骤S130包括:通过所述指令周期数确定所述测试问题对应的指令周期成本,然后通过所述指令周期成本确定所述语音交互系统智商水平的评测结果。In an embodiment, the above step S130 includes: determining the instruction cycle cost corresponding to the test question according to the number of instruction cycles, and then determining the evaluation result of the IQ level of the voice interaction system through the instruction cycle cost.

语音交互系统成功响应相同测试问题的指令周期数更多,所需的时间更多,语音交互系统的任务执行量也更多,因此相应的指令周期成本更高。因此,通过指令周期数,能够反映出指令周期成本的高低,进而通过指令周期成本的高低更直观的确定语音交互系统智商水平的评测结果。The number of instruction cycles for the voice interaction system to successfully respond to the same test question is more, and it takes more time, and the task execution volume of the voice interaction system is also more, so the corresponding instruction cycle cost is higher. Therefore, the number of instruction cycles can reflect the level of the instruction cycle cost, and then more intuitively determine the evaluation result of the IQ level of the voice interaction system through the level of the instruction cycle cost.

在一实施例中,参与评测的测试问题可以是多个,将多个测试问题预先保存在相应的数据库中。在对语音交互系统进行评测的时候,可以从该数据库中以此获取各个测试问题来对语音交互系进行评测,也可以是从中按照一定的规则选取部分测试问题来对语音交互系进行评测,一定的规则也包含是随机的选取。In an embodiment, there may be multiple test questions participating in the evaluation, and the multiple test questions are pre-stored in a corresponding database. When evaluating the voice interaction system, each test question can be obtained from the database to evaluate the voice interaction system, or some test questions can be selected according to certain rules to evaluate the voice interaction system. The rules also include random selection.

在其他实施例中,还可以时完全随机的,无需预设的测试问题并存储,用户、评测终端或者第三方终端随机发出测试问题的语音信息,此后基于各个测试问题对应的评测信息,综合评测语音交互系统的性能。In other embodiments, it can also be completely random, without the need for preset test questions and storage, and the user, evaluation terminal or third-party terminal randomly sends out the voice information of the test questions, and then comprehensively evaluates based on the evaluation information corresponding to each test question Performance of voice interaction systems.

进一步地,可根据多个测试问题各自对应的指令周期成本,确定所述语音交互系统智商水平的评测结果。例如:获取所述多个测试问题各自对应的权重;根据所述多个测试问题各自对应的指令周期成本和权重,确定所述语音交互系统智商水平的评测结果。其中,类型可以根据问题理解复杂度划分,或者根据成功响应该问题所需的最少指令周期数划分,或者根据问题对应的语音类型或者口音进行划分(例如,英文类型和中文类型,普通话类型、粤语类型或者其他方言类型等)。为不同类型的测试问题,设置相应的权重,由此可得到语音交互系统对不同类型语音的响应情况,基于此全面评测语音交互系统的智商水平。Further, the evaluation result of the IQ level of the voice interaction system can be determined according to the instruction cycle costs corresponding to each of the multiple test questions. For example: obtaining the corresponding weights of the plurality of test questions; determining the evaluation result of the IQ level of the voice interaction system according to the instruction cycle costs and weights corresponding to the plurality of test questions. Among them, the type can be divided according to the complexity of question understanding, or according to the minimum number of instruction cycles required to successfully respond to the question, or according to the voice type or accent corresponding to the question (for example, English type and Chinese type, Mandarin type, Cantonese type, etc.) type or other dialect types, etc.). For different types of test questions, set corresponding weights, so that the response of the voice interaction system to different types of voices can be obtained, and based on this, the IQ level of the voice interaction system can be comprehensively evaluated.

通过多个测试问题对语音交互系统进行评测,可进一步提高评测结果的可靠性。Evaluating the voice interaction system through multiple test questions can further improve the reliability of the evaluation results.

在一实施例中,通过所述指令周期数确定所述测试问题对应的指令周期成本的具体方式可包括:获取预设的周期数限值;通过所述指令周期数和所述周期数限值,确定所述测试问题对应的指令周期成本。例如:In an embodiment, the specific method of determining the instruction cycle cost corresponding to the test question through the number of instruction cycles may include: obtaining a preset limit value of the number of cycles; using the number of instruction cycles and the limit value of the number of cycles , to determine the instruction cycle cost corresponding to the test question. E.g:

若所述指令周期数为N1,确定所述测试问题对应的指令周期成本为成本得分S1;若所述指令周期数为N2,确定所述测试问题对应的指令周期成本为成本得分S2;若所述指令周期数为N3,确定所述测试问题对应的指令周期成本为成本得分S3。If the number of instruction cycles is N1, determine that the instruction cycle cost corresponding to the test question is a cost score S1; if the instruction cycle number is N2, determine that the instruction cycle cost corresponding to the test question is a cost score S2; The number of instruction cycles is N3, and the instruction cycle cost corresponding to the test problem is determined as cost score S3.

其中,0<N1<N2<M≤N3,N1为预先设定的周期数,M表示周期数限值;S3<S2<S1。本发明实施例中,指令周期成本为成本得分越高,表明语音交互系统成功响应测试问题的成本越低。其中,预先设定的周期数根据成功响应一个测试问题所消耗的最少指令周期数确定,N1可以是1,也可以是其他整数,视实际情况确定。Wherein, 0<N1<N2<M≤N3, N1 is the preset number of cycles, M represents the limit value of the number of cycles; S3<S2<S1. In the embodiment of the present invention, the higher the cost score of the instruction cycle cost, the lower the cost for the voice interaction system to successfully respond to the test question. Wherein, the preset number of cycles is determined according to the minimum number of instruction cycles consumed to successfully respond to a test question, and N1 may be 1 or other integers, depending on the actual situation.

对应的,可根据测试问题对应的指令周期成本的成本得分,确定语音交互系统智商水平的评测结果。Correspondingly, the evaluation result of the IQ level of the voice interaction system can be determined according to the cost score of the instruction cycle cost corresponding to the test question.

在其他实施例中,确定所述测试问题对应的指令周期成本也可以是成本得分以外的其他方式,例如等级的方式,具体例如:In other embodiments, determining the instruction cycle cost corresponding to the test question may also be in other ways than the cost score, such as a level, for example:

若所述指令周期数为N1,确定所述测试问题对应的指令周期成本为成本级别L1;若所述指令周期数为N2,确定所述测试问题对应的指令周期成本为成本级别L2;若所述指令周期数为N3,确定所述测试问题对应的指令周期成本为成本级别L3;其中,L1、L2、L3对应的级别依次递减。本发明实施例中,指令周期成本的成本级别越高,表明语音交互系统成功响应所述测试问题的成本越低。其中,级别高低,可以是基于级别数值大小确定,例如级别1低于级别2,或者是级别1大于级别2,可根据实际情况确定。If the instruction cycle number is N1, determine that the instruction cycle cost corresponding to the test question is cost level L1; if the instruction cycle number is N2, determine that the instruction cycle cost corresponding to the test question is cost level L2; The number of instruction cycles is N3, and the instruction cycle cost corresponding to the test problem is determined to be cost level L3; wherein, the levels corresponding to L1, L2, and L3 are successively decreased. In the embodiment of the present invention, the higher the cost level of the instruction cycle cost, the lower the cost for the voice interaction system to successfully respond to the test question. Wherein, the level may be determined based on the numerical value of the level, for example, level 1 is lower than level 2, or level 1 is greater than level 2, which may be determined according to the actual situation.

对应的,可根据测试问题对应的指令周期成本的成本级别,确定语音交互系统智商水平的评测结果。Correspondingly, the evaluation result of the IQ level of the voice interaction system can be determined according to the cost level of the instruction cycle cost corresponding to the test question.

在上述实施例的基础上,结合图3,下面以该方法应用于图1中的评测终端为例,进一步对语音交互系统智商水平的评测方法进行举例说明。其中,参与评测的测试问题有多个,对应地语音交互系统智商水平的评测方法包括如下步骤:On the basis of the above-mentioned embodiments, in conjunction with FIG. 3 , the method is applied to the evaluation terminal in FIG. 1 as an example to further illustrate the evaluation method of the IQ level of the voice interaction system. Among them, there are multiple test questions participating in the evaluation, and the corresponding evaluation method of the IQ level of the voice interaction system includes the following steps:

步骤S301,设置多个测试问题,保存在问题数据库中。Step S301, setting multiple test questions and storing them in the question database.

步骤S302,对设置的多个测试问题划分类型,并对各类型对应的测试问题设置对应的权重,保存该对应关系。Step S302, classify the multiple set test questions into types, set corresponding weights for the test questions corresponding to each type, and save the corresponding relationship.

步骤S303,从问题数据库中获取测试问题,发出该测试问题对应的语音。In step S303, a test question is obtained from the question database, and a voice corresponding to the test question is issued.

测试问题对应的语音既可以是评测终端发出,也可以是测试用户发出,或者是第三方设备发出,例如通过用户手机播放录制的语音的方式。The voice corresponding to the test question can be issued by the evaluation terminal, the test user, or a third-party device, for example, by playing the recorded voice through the user's mobile phone.

步骤S304,获取所述语音交互系统成功响应该测试问题的指令周期数。Step S304, obtaining the number of instruction cycles for which the voice interaction system successfully responds to the test question.

针对不同的语音交互系统,由于智商水平的差异,会造成语音交互系统在识别、理解和处理语音指令时无法准确完成任务。这种情况下语音交互系统就会给出错误的答案、执行错误的操作或无法执行操作等情况。这时需要用户重新下达语音命令,如降低语速、改变语音指令表达方式等,使语音交互系统更好地识别和理解用户的语音。将使解决一个测试问题所需的指令周期数增加,加大了指令周期成本。因此,在语音交互系统成功响应一个语音指令时,所耗费的指令周期数越多,则反映出该语音交互系统的智商水平较低。For different voice interaction systems, due to differences in IQ levels, voice interaction systems cannot accurately complete tasks when recognizing, understanding, and processing voice commands. In this case, the voice interaction system will give wrong answers, perform wrong operations, or fail to perform operations. At this time, the user needs to re-issue the voice command, such as reducing the speech rate, changing the expression mode of the voice command, etc., so that the voice interaction system can better recognize and understand the user's voice. It will increase the number of instruction cycles required to solve a test problem, increasing the instruction cycle cost. Therefore, when the voice interaction system successfully responds to a voice command, the more command cycles it consumes, it reflects the lower IQ level of the voice interaction system.

步骤S305,通过所述指令周期数确定该测试问题对应的指令周期成本。Step S305, determine the instruction cycle cost corresponding to the test question according to the number of instruction cycles.

例如:假设语音交互系统成功响应该测试问题所需的最少指令周期数为1,且以M个指令周期为周期数限值。那么,若语音交互系统在M个指令周期内无法提供正确答案或无法合理响应,则该任务的成本评分为0;若语音交互系统在第一个指令周期内提供了正确答案或合理响应,则该任务的成本评分为满分S0;若语音交互系统在第N(1<N<M)个指令周期内提供了正确答案或合理响应,则该任务的成本评分为SN(0<SN<S)。其中,S0和SN之间为递减关系,包括但不限于线性递减、指数衰减、对数衰减或其他类型衰减关系,可以根据测试对象的需求和问题集的特点进行选择。For example: assume that the minimum number of instruction cycles required by the voice interaction system to successfully respond to the test question is 1, and M instruction cycles are used as the cycle number limit. Then, if the voice interaction system cannot provide the correct answer or reasonable response within M instruction cycles, the cost score of the task is 0; if the voice interaction system provides the correct answer or reasonable response within the first instruction cycle, then The cost score of this task is full score S 0 ; if the voice interaction system provides the correct answer or reasonable response within the Nth (1<N<M) instruction cycle, the cost score of this task is S N (0<S N <S). Among them, there is a decreasing relationship between S 0 and SN , including but not limited to linear decreasing, exponential decreasing, logarithmic decreasing or other types of decreasing relationship, which can be selected according to the needs of the test object and the characteristics of the problem set.

步骤S306,问题数据库中是否还有其他测试问题未参与评测,若是,返回步骤S303,获取下一个测试问题进行评测;若否,执行下一步。Step S306, whether there are other test questions in the question database that have not participated in the evaluation, if so, return to step S303, and obtain the next test question for evaluation; if not, go to the next step.

步骤S307,获取参与评测的各个测试问题对应的权重,根据多个测试问题各自对应的指令周期成本和权重,确定所述语音交互系统智商水平的评测结果得分。Step S307, obtaining weights corresponding to each test question participating in the evaluation, and determining the evaluation result score of the IQ level of the voice interaction system according to the instruction cycle costs and weights corresponding to each of the multiple test questions.

即,最终语音交互系统智商水平的评测结果得分,是通过各类问题对应的成本得分及其权重的综合运算得到。综合运算包括但不限于累加、累乘等运算。That is, the final score of the evaluation result of the IQ level of the voice interaction system is obtained through the comprehensive calculation of the cost scores and their weights corresponding to various questions. Comprehensive operations include, but are not limited to, operations such as accumulation and multiplication.

上述实施例的方法,通过引入指令周期成本的概念,利用语音交互系统的成功执行一项任务(即成功响应一个测试问题)所消耗的指令周期成本,作为来衡量其智商水平的指标,并且对其其智商水平进行量化处理。由于指令周期成本计算易于操作、便于数字化等特点,能够准确区分各语音交互系统的智商水平。同时,能够避免标准答案匹配评测方法带来的局限性问题,使评测结果更加合理客观,可靠性更高。In the method of the above-mentioned embodiment, by introducing the concept of instruction cycle cost, the instruction cycle cost consumed by the successful execution of a task (that is, successfully responding to a test question) of the voice interaction system is used as an index to measure its IQ level, and the Its IQ level is quantified. Due to the characteristics of easy operation and digitalization of instruction cycle cost calculation, it can accurately distinguish the IQ level of each voice interaction system. At the same time, it can avoid the limitation problem caused by the standard answer matching evaluation method, and make the evaluation result more reasonable and objective with higher reliability.

可以理解的,上述实施例的语音交互系统智商水平的评测方法,其应用对象包括但不限于智能音箱、无人驾驶、智能语音助手等具有智能化的语音交互功能的终端。It can be understood that the method for evaluating the IQ level of the voice interaction system in the above embodiment is applicable to terminals with intelligent voice interaction functions such as smart speakers, driverless cars, and smart voice assistants.

应该理解的是,对于前述的各方法实施例,虽然流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,方法实施例的流程图中的至少一部分步骤可以包括多个子步骤或者多个阶段,这些子步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些子步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that, for the foregoing method embodiments, although the steps in the flow chart are shown sequentially as indicated by the arrows, these steps are not necessarily executed sequentially in the order indicated by the arrows. Unless otherwise specified herein, there is no strict order restriction on the execution of these steps, and these steps can be executed in other orders. Moreover, at least some of the steps in the flow chart of the method embodiment may include multiple sub-steps or multiple stages, these sub-steps or stages are not necessarily executed at the same time, but may be executed at different time, these sub-steps Or the order of execution of the stages is not necessarily performed sequentially, but may be performed alternately or alternately with at least a part of other steps or substeps of other steps or stages.

基于与上述实施例中的语音交互系统智商水平的评测方法相同的思想,本文还提供语音交互系统智商水平的评测装置。Based on the same idea as the method for evaluating the IQ level of the voice interaction system in the above embodiments, this paper also provides an evaluation device for the IQ level of the voice interaction system.

在一个实施例中,如图4所示,本实施例的语音交互系统智商水平的评测装置包括:问题获取模块410、周期计数模块420以及结果确定模块430,各模块详述如下:In one embodiment, as shown in Figure 4, the evaluation device of the IQ level of the voice interaction system of this embodiment includes: a question acquisition module 410, a cycle counting module 420 and a result determination module 430, each module is described in detail as follows:

上述问题获取模块410,用于获取测试问题,所述测试问题为语音问题。The above-mentioned question acquisition module 410 is configured to acquire a test question, and the test question is a speech question.

上述周期计数模块420,用于获取所述语音交互系统成功响应所述测试问题的指令周期数;其中,一个指令周期为所述语音交互系统与外界的一个对话回合。The above-mentioned cycle counting module 420 is used to obtain the number of instruction cycles for the voice interaction system to successfully respond to the test question; wherein, one instruction cycle is a dialogue round between the voice interaction system and the outside world.

上述结果确定模块430,用于通过所述指令周期数确定所述语音交互系统智商水平的评测结果。The above-mentioned result determination module 430 is configured to determine the evaluation result of the IQ level of the voice interaction system through the number of instruction cycles.

在一实施例中,上述结果确定模块430包括:In one embodiment, the above-mentioned result determination module 430 includes:

成本核算子模块,用于通过所述指令周期数确定所述测试问题对应的指令周期成本。结果确定子模块,用于通过所述指令周期成本确定所述语音交互系统智商水平的评测结果。The cost accounting sub-module is used to determine the instruction cycle cost corresponding to the test question through the number of instruction cycles. The result determination sub-module is used to determine the evaluation result of the IQ level of the voice interaction system through the instruction cycle cost.

在一实施例中,上述问题获取模块410:应用获取预设的多个测试问题。对应地,上述结果确定模块430,用于根据所述多个测试问题各自对应的指令周期成本,确定所述语音交互系统智商水平的评测结果得分。In an embodiment, the above-mentioned question obtaining module 410: the application obtains a plurality of preset test questions. Correspondingly, the above-mentioned result determination module 430 is configured to determine the evaluation result score of the IQ level of the voice interaction system according to the instruction cycle costs corresponding to each of the plurality of test questions.

在一实施例中,上述结果确定子模块,包括:In one embodiment, the above-mentioned result determination submodule includes:

权重获取单元,用于获取所述多个测试问题各自对应的权重。以及,结果计算单元,应用根据所述多个测试问题各自对应的指令周期成本和权重,确定所述语音交互系统智商水平的评测结果得分。The weight acquisition unit is configured to acquire the respective weights corresponding to the plurality of test questions. And, the result calculation unit is used to determine the evaluation result score of the IQ level of the voice interaction system according to the instruction cycle cost and weight corresponding to each of the plurality of test questions.

在一实施例中,上述权重获取单元,用于获取所述多个测试问题各自对应的类型;获取各类型对应的权重,作为相应测试问题对应的权重。In an embodiment, the above-mentioned weight acquisition unit is configured to acquire the types corresponding to each of the plurality of test questions; and acquire the weights corresponding to each type as the weights corresponding to the corresponding test questions.

在一实施例中,上述成本核算子模块包括:In one embodiment, the above-mentioned cost accounting sub-module includes:

限值获取单元,用于获取预设的周期数限值;以及,a limit acquisition unit, configured to acquire a preset limit of the number of cycles; and,

成本计算单元,用于通过所述指令周期数和所述周期数限值,确定所述测试问题对应的指令周期成本。A cost calculation unit, configured to determine the instruction cycle cost corresponding to the test question through the instruction cycle number and the cycle number limit.

在一实施例中,上述成本计算单元包括:In one embodiment, the above-mentioned cost calculation unit includes:

第一成本计算单元,用于若所述指令周期数为N1,确定所述测试问题对应的指令周期成本为成本得分S1;第二成本计算单元,用于若所述指令周期数为N2,确定所述测试问题对应的指令周期成本为成本得分S2;第三成本计算单元,用于若所述指令周期数为N3,确定所述测试问题对应的指令周期成本为成本得分S3。The first cost calculation unit is configured to determine that the instruction cycle cost corresponding to the test question is a cost score S1 if the instruction cycle number is N1; the second cost calculation unit is configured to determine that the instruction cycle number is N2. The instruction cycle cost corresponding to the test question is a cost score S2; the third cost calculation unit is configured to determine that the instruction cycle cost corresponding to the test question is a cost score S3 if the number of instruction cycles is N3.

其中,0<N1<N2<M≤N3,N1为预先设定的周期数,M表示周期数限值;S3<S2<S1。Wherein, 0<N1<N2<M≤N3, N1 is the preset number of cycles, M represents the limit value of the number of cycles; S3<S2<S1.

在另一实施例中,第一成本计算单元,用于若所述指令周期数为N1,确定所述测试问题对应的指令周期成本为成本级别L1;第二成本计算单元,用于若所述指令周期数为N2,确定所述测试问题对应的指令周期成本为成本级别L2;第三成本计算单元,用于若所述指令周期数为N3,确定所述测试问题对应的指令周期成本为成本级别L3;In another embodiment, the first cost calculation unit is configured to determine that the instruction cycle cost corresponding to the test problem is cost level L1 if the instruction cycle number is N1; the second cost calculation unit is configured to determine The number of instruction cycles is N2, and it is determined that the instruction cycle cost corresponding to the test question is cost level L2; the third cost calculation unit is used to determine that the instruction cycle cost corresponding to the test question is cost if the instruction cycle number is N3 Level L3;

其中,0<N1<N2<M≤N3,N1为预先设定的周期数,M表示周期数限值;L1、L2、L3对应的级别依次递减。Among them, 0<N1<N2<M≤N3, N1 is the preset number of cycles, M represents the limit value of the number of cycles; the levels corresponding to L1, L2, and L3 decrease in turn.

关于语音交互系统智商水平的评测装置的具体限定可以参见上文中对于语音交互系统智商水平的评测方法的限定,在此不再赘述。上述语音交互系统智商水平的评测装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。For the specific limitations of the evaluation device for the IQ level of the voice interaction system, please refer to the above definition of the evaluation method for the IQ level of the voice interaction system, which will not be repeated here. Each module in the evaluation device for the IQ level of the above-mentioned voice interactive system can be fully or partially realized by software, hardware and combinations thereof. The above-mentioned modules can be embedded in or independent of the processor in the computer device in the form of hardware, and can also be stored in the memory of the computer device in the form of software, so that the processor can call and execute the corresponding operations of the above modules.

此外,上述示例的语音交互系统智商水平的评测装置的实施方式中,各程序模块的逻辑划分仅是举例说明,实际应用中可以根据需要,例如出于相应硬件的配置要求或者软件的实现的便利考虑,将上述功能分配由不同的程序模块完成,即将所述语音交互系统智商水平的评测装置的内部结构划分成不同的程序模块,以完成以上描述的全部或者部分功能。In addition, in the implementation of the evaluation device for the IQ level of the voice interaction system in the above example, the logical division of each program module is only an example, and in actual applications, it can be based on the needs, such as the configuration requirements of the corresponding hardware or the convenience of software implementation. It is considered that the above function distribution is completed by different program modules, that is, the internal structure of the IQ level evaluation device of the voice interaction system is divided into different program modules to complete all or part of the functions described above.

在一个实施例中,提供了一种计算机设备,该计算机设备可以是终端,其内部结构图可以如图5所示。该计算机设备包括通过系统总线连接的处理器、存储器、显示屏和输入装置。其中,该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统和计算机程序。该内存储器为非易失性存储介质中的操作系统和计算机程序的运行提供环境。该计算机程序被处理器执行时以实现一种语音交互系统智商水平的评测方法。该计算机设备的显示屏可以是液晶显示屏或者电子墨水显示屏,该计算机设备的输入装置可以是显示屏上覆盖的触摸层,也可以是计算机设备外壳上设置的按键、轨迹球或触控板,还可以是外接的键盘、触控板或鼠标等。In one embodiment, a computer device is provided. The computer device may be a terminal, and its internal structure may be as shown in FIG. 5 . The computer device includes a processor, a memory, a display screen and an input device connected by a system bus. Wherein, the processor of the computer device is used to provide calculation and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system and computer programs. The internal memory provides an environment for the operation of the operating system and computer programs in the non-volatile storage medium. When the computer program is executed by a processor, a method for evaluating the IQ level of a voice interaction system is realized. The display screen of the computer device may be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer device may be a touch layer covered on the display screen, or a button, a trackball or a touch pad provided on the casing of the computer device , and can also be an external keyboard, touchpad, or mouse.

本领域技术人员可以理解,图5中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的计算机设备的限定,具体的计算机设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。Those skilled in the art can understand that the structure shown in Figure 5 is only a block diagram of a part of the structure related to the solution of this application, and does not constitute a limitation to the computer equipment on which the solution of this application is applied. The specific computer equipment can be More or fewer components than shown in the figures may be included, or some components may be combined, or have a different arrangement of components.

在一个实施例中,提供了一种计算机设备,包括存储器和处理器,存储器中存储有计算机程序,该处理器执行计算机程序时能够实现上述任一实施例的语音交互系统智商水平的评测方法的步骤。In one embodiment, a computer device is provided, including a memory and a processor, and a computer program is stored in the memory, and when the processor executes the computer program, the method for evaluating the IQ level of a voice interaction system in any of the above embodiments can be realized. step.

本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一非易失性计算机可读取存储介质中,该计算机程序在被处理器执行时,能够实现上述任一实施例的语音交互系统智商水平的评测方法的步骤。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented through computer programs to instruct related hardware, and the computer programs can be stored in a non-volatile computer-readable memory In the medium, when the computer program is executed by the processor, it can realize the steps of the method for evaluating the IQ level of the voice interaction system in any of the above embodiments.

基于此,在一个实施例中,提供了一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现上述任一实施例的语音交互系统智商水平的评测方法的步骤。Based on this, in one embodiment, a computer-readable storage medium is provided, on which a computer program is stored, and when the program is executed by a processor, the steps of the method for evaluating the IQ level of a voice interaction system in any of the above-mentioned embodiments are implemented. .

其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink)DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。Wherein, any references to memory, storage, database or other media used in the various embodiments provided in the present application may include non-volatile and/or volatile memory. Nonvolatile memory can include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory can include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in many forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Chain Synchlink DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.

以上实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above embodiments can be combined arbitrarily. To make the description concise, all possible combinations of the technical features in the above embodiments are not described. However, as long as there is no contradiction in the combination of these technical features, they should be It is considered to be within the range described in this specification.

在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其它实施例的相关描述。In the foregoing embodiments, the descriptions of each embodiment have their own emphases, and for parts not described in detail in a certain embodiment, reference may be made to relevant descriptions of other embodiments.

本文实施例的术语“包括”和“具有”以及它们任何变形,意图在于覆盖不排他的包含。例如包含了一系列步骤或(模块)单元的过程、方法、系统、产品或设备没有限定于已列出的步骤或单元,而是可选地还包括没有列出的步骤或单元,或可选地还包括对于这些过程、方法、产品或设备固有的其它步骤或单元。The terms "comprising" and "having" and any variations thereof in the embodiments herein are intended to cover non-exclusive inclusion. For example, a process, method, system, product or device comprising a series of steps or (module) units is not limited to the listed steps or units, but optionally also includes steps or units not listed, or optionally It also includes other steps or elements inherent to these processes, methods, products or devices.

在本文中提及的“多个”是指两个或两个以上。“和/或”,描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。字符“/”一般表示前后关联对象是一种“或”的关系。The "plurality" mentioned herein means two or more. "And/or" describes the association relationship of associated objects, indicating that there may be three types of relationships, for example, A and/or B may indicate: A exists alone, A and B exist simultaneously, and B exists independently. The character "/" generally indicates that the contextual objects are an "or" relationship.

在本文中提及的“第一\第二”仅仅是是区别类似的对象,不代表针对对象的特定排序,可以理解地,“第一\第二”在允许的情况下可以互换特定的顺序或先后次序。应该理解“第一\第二”区分的对象在适当情况下可以互换,以使这里描述的实施例能够以除了在这里图示或描述的那些以外的顺序实施。The "first\second" mentioned in this article is only to distinguish similar objects, and does not represent a specific ordering of objects. Understandably, "first\second" can be interchanged with specific sequence or sequence. It should be understood that the terms "first\second" are interchangeable under appropriate circumstances such that the embodiments described herein can be practiced in sequences other than those illustrated or described herein.

以上所述实施例仅表达了本申请的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本申请构思的前提下,还可以做出若干变形和改进,这些都属于本申请的保护范围。因此,本申请专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only represent several implementation modes of the present application, and the description thereof is relatively specific and detailed, but it should not be construed as limiting the scope of the patent for the invention. It should be noted that those skilled in the art can make several modifications and improvements without departing from the concept of the present application, and these all belong to the protection scope of the present application. Therefore, the scope of protection of the patent application should be based on the appended claims.

Claims (12)

1.一种语音交互系统智商水平的评测方法,其特征在于,包括:1. A method for evaluating the IQ level of a voice interactive system, characterized in that it comprises: 获取测试问题,所述测试问题为语音问题;Obtain a test question, the test question is a speech question; 获取所述语音交互系统成功响应所述测试问题的指令周期数;其中,一个指令周期为所述语音交互系统与外界的一个对话回合;Acquiring the number of instruction cycles for the voice interaction system to successfully respond to the test question; wherein, one instruction cycle is a dialogue round between the voice interaction system and the outside world; 通过所述指令周期数确定所述语音交互系统智商水平的评测结果。The evaluation result of the IQ level of the voice interaction system is determined by the number of instruction cycles. 2.根据权利要求1所述的方法,其特征在于,所述通过所述指令周期数确定所述语音交互系统智商水平的评测结果,包括:2. The method according to claim 1, wherein the determination of the evaluation result of the IQ level of the voice interaction system by the number of instruction cycles includes: 通过所述指令周期数确定所述测试问题对应的指令周期成本;Determining the instruction cycle cost corresponding to the test question by the number of instruction cycles; 通过所述指令周期成本确定所述语音交互系统智商水平的评测结果。The evaluation result of the IQ level of the voice interaction system is determined by the instruction cycle cost. 3.根据权利要求2所述的方法,其特征在于,所述获取测试问题,包括:获取预设的多个测试问题;3. The method according to claim 2, wherein said obtaining test questions comprises: obtaining a plurality of preset test questions; 所述通过所述指令周期成本确定所述语音交互系统智商水平的评测结果,包括:The evaluation result of determining the IQ level of the voice interaction system through the instruction cycle cost includes: 根据所述多个测试问题各自对应的指令周期成本,确定所述语音交互系统智商水平的评测结果得分。According to the instruction cycle cost corresponding to each of the plurality of test questions, the score of the evaluation result of the IQ level of the voice interaction system is determined. 4.根据权利要求3所述的方法,其特征在于,还包括:所述根据所述多个测试问题各自对应的指令周期成本,确定所述语音交互系统智商水平的评测结果得分,包括:4. The method according to claim 3, further comprising: determining the evaluation result score of the IQ level of the voice interaction system according to the instruction cycle cost corresponding to each of the plurality of test questions, including: 获取所述多个测试问题各自对应的权重;Acquiring respective weights corresponding to the plurality of test questions; 根据所述多个测试问题各自对应的指令周期成本和权重,确定所述语音交互系统智商水平的评测结果。The evaluation result of the IQ level of the voice interaction system is determined according to the instruction cycle cost and weight corresponding to each of the plurality of test questions. 5.根据权利要求4所述的方法,其特征在于,所述获取所述多个测试问题各自对应的权重,包括:5. The method according to claim 4, wherein said obtaining the respective weights corresponding to said plurality of test questions comprises: 获取所述多个测试问题各自对应的类型;Obtaining the corresponding types of the plurality of test questions; 获取各类型对应的权重,作为相应测试问题对应的权重。Obtain the weights corresponding to each type as the weights corresponding to the corresponding test questions. 6.根据权利要求2至5任一所述的方法,其特征在于,所述通过所述指令周期数确定所述测试问题对应的指令周期成本,包括:6. The method according to any one of claims 2 to 5, wherein the determining the instruction cycle cost corresponding to the test question through the number of instruction cycles includes: 获取预设的周期数限值;Obtain the preset cycle number limit; 通过所述指令周期数和所述周期数限值,确定所述测试问题对应的指令周期成本。The instruction cycle cost corresponding to the test question is determined through the number of instruction cycles and the limit value of the number of cycles. 7.根据权利要求6所述的方法,其特征在于,所述通过所述指令周期数和所述周期数限值,确定所述测试问题对应的指令周期成本,包括:7. The method according to claim 6, wherein the determining the instruction cycle cost corresponding to the test question through the number of instruction cycles and the limit value of the number of cycles includes: 若所述指令周期数为N1,确定所述测试问题对应的指令周期成本为成本得分S1;If the number of instruction cycles is N1, determine that the instruction cycle cost corresponding to the test question is cost score S1; 若所述指令周期数为N2,确定所述测试问题对应的指令周期成本为成本得分S2;If the number of instruction cycles is N2, determine that the instruction cycle cost corresponding to the test question is a cost score S2; 若所述指令周期数为N3,确定所述测试问题对应的指令周期成本为成本得分S3;If the number of instruction cycles is N3, determine that the instruction cycle cost corresponding to the test question is a cost score S3; 或者,or, 若所述指令周期数为N1,确定所述测试问题对应的指令周期成本为成本级别L1;If the number of instruction cycles is N1, determine that the instruction cycle cost corresponding to the test question is cost level L1; 若所述指令周期数为N2,确定所述测试问题对应的指令周期成本为成本级别L2;If the number of instruction cycles is N2, determine that the instruction cycle cost corresponding to the test question is cost level L2; 若所述指令周期数为N3,确定所述测试问题对应的指令周期成本为成本级别L3;If the number of instruction cycles is N3, determine that the instruction cycle cost corresponding to the test question is cost level L3; 其中,0<N1<N2<M≤N3,N1为预先设定的周期数,M表示周期数限值;S3<S2<S1,L1、L2、L3对应的级别依次递减。Among them, 0<N1<N2<M≤N3, N1 is the preset number of cycles, M represents the limit value of the number of cycles; S3<S2<S1, the levels corresponding to L1, L2, and L3 decrease in turn. 8.根据权利要求7所述的方法,其特征在于,8. The method of claim 7, wherein, 当所述指令周期数在N1和M之间时,随着所述指令周期数的递增,成本得分S2的取值递减。When the number of instruction cycles is between N1 and M, as the number of instruction cycles increases, the value of the cost score S2 decreases. 9.根据权利要求8所述的方法,其特征在于,所述成本得分S2的取值递减包括:线性递减、指数衰减、对数衰减中至少一种。9 . The method according to claim 8 , wherein the decreasing value of the cost score S2 includes at least one of linear decreasing, exponential decreasing, and logarithmic decreasing. 10.一种语音交互系统智商水平的评测装置,其特征在于,包括:10. An evaluation device for the IQ level of a voice interactive system, characterized in that it comprises: 问题获取模块,用于获取测试问题,所述测试问题为语音问题;A question acquiring module, configured to acquire a test question, the test question being a voice question; 周期计数模块,用于获取所述语音交互系统成功响应所述测试问题的指令周期数;其中,一个指令周期为所述语音交互系统与外界的一个对话回合;The cycle counting module is used to obtain the number of instruction cycles for the voice interaction system to successfully respond to the test question; wherein, one instruction cycle is a dialogue round between the voice interaction system and the outside world; 以及,as well as, 结果确定模块,用于通过所述指令周期数确定所述语音交互系统智商水平的评测结果。The result determination module is used to determine the evaluation result of the IQ level of the voice interaction system through the number of instruction cycles. 11.一种计算机设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,其特征在于,所述处理器执行所述程序时实现权利要求1至9任一所述方法的步骤。11. A computer device, comprising a memory, a processor, and a computer program stored on the memory and operable on the processor, characterized in that, when the processor executes the program, any one of claims 1 to 9 is realized. steps of the method described above. 12.一种计算机可读存储介质,其上存储有计算机程序,其特征在于,该程序被处理器执行时实现权利要求1至9任一所述方法的步骤。12. A computer-readable storage medium, on which a computer program is stored, characterized in that, when the program is executed by a processor, the steps of the method according to any one of claims 1 to 9 are realized.
CN201810433832.0A 2018-05-08 2018-05-08 Evaluating method, device and the computer equipment of voice interactive system IQ level Pending CN108763329A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810433832.0A CN108763329A (en) 2018-05-08 2018-05-08 Evaluating method, device and the computer equipment of voice interactive system IQ level

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810433832.0A CN108763329A (en) 2018-05-08 2018-05-08 Evaluating method, device and the computer equipment of voice interactive system IQ level

Publications (1)

Publication Number Publication Date
CN108763329A true CN108763329A (en) 2018-11-06

Family

ID=64009222

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810433832.0A Pending CN108763329A (en) 2018-05-08 2018-05-08 Evaluating method, device and the computer equipment of voice interactive system IQ level

Country Status (1)

Country Link
CN (1) CN108763329A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109545185A (en) * 2018-11-12 2019-03-29 百度在线网络技术(北京)有限公司 Interactive system evaluation method, evaluation system, server and computer-readable medium
CN110009206A (en) * 2019-03-21 2019-07-12 五邑大学 A timed speech scoring method, device, equipment and storage medium
CN110619868A (en) * 2019-08-29 2019-12-27 深圳市优必选科技股份有限公司 Voice assistant optimization method, voice assistant optimization device and intelligent equipment
CN110689903A (en) * 2019-09-24 2020-01-14 百度在线网络技术(北京)有限公司 Method, device, equipment and medium for evaluating intelligent sound box
CN112383451A (en) * 2020-11-30 2021-02-19 天津链数科技有限公司 Intelligent household appliance intelligent level testing system and method based on voice interaction
CN113489846A (en) * 2021-06-30 2021-10-08 未鲲(上海)科技服务有限公司 Voice interaction testing method, device, equipment and computer storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8296144B2 (en) * 2008-06-04 2012-10-23 Robert Bosch Gmbh System and method for automated testing of complicated dialog systems
US8793119B2 (en) * 2009-07-13 2014-07-29 At&T Intellectual Property I, L.P. System and method for generating manually designed and automatically optimized spoken dialog systems
CN104464757A (en) * 2014-10-28 2015-03-25 科大讯飞股份有限公司 Voice evaluation method and device
CN105068661A (en) * 2015-09-07 2015-11-18 百度在线网络技术(北京)有限公司 Man-machine interaction method and system based on artificial intelligence
CN107688533A (en) * 2017-07-25 2018-02-13 上海壹账通金融科技有限公司 Applied program testing method, device, computer equipment and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8296144B2 (en) * 2008-06-04 2012-10-23 Robert Bosch Gmbh System and method for automated testing of complicated dialog systems
US8793119B2 (en) * 2009-07-13 2014-07-29 At&T Intellectual Property I, L.P. System and method for generating manually designed and automatically optimized spoken dialog systems
CN104464757A (en) * 2014-10-28 2015-03-25 科大讯飞股份有限公司 Voice evaluation method and device
CN105068661A (en) * 2015-09-07 2015-11-18 百度在线网络技术(北京)有限公司 Man-machine interaction method and system based on artificial intelligence
CN107688533A (en) * 2017-07-25 2018-02-13 上海壹账通金融科技有限公司 Applied program testing method, device, computer equipment and storage medium

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109545185A (en) * 2018-11-12 2019-03-29 百度在线网络技术(北京)有限公司 Interactive system evaluation method, evaluation system, server and computer-readable medium
CN109545185B (en) * 2018-11-12 2022-03-18 百度在线网络技术(北京)有限公司 Interactive system evaluation method, evaluation system, server, and computer-readable medium
CN110009206A (en) * 2019-03-21 2019-07-12 五邑大学 A timed speech scoring method, device, equipment and storage medium
CN110619868A (en) * 2019-08-29 2019-12-27 深圳市优必选科技股份有限公司 Voice assistant optimization method, voice assistant optimization device and intelligent equipment
CN110619868B (en) * 2019-08-29 2021-12-17 深圳市优必选科技股份有限公司 Voice assistant optimization method, voice assistant optimization device and intelligent equipment
CN110689903A (en) * 2019-09-24 2020-01-14 百度在线网络技术(北京)有限公司 Method, device, equipment and medium for evaluating intelligent sound box
CN110689903B (en) * 2019-09-24 2022-05-13 百度在线网络技术(北京)有限公司 Method, device, equipment and medium for evaluating intelligent sound box
CN112383451A (en) * 2020-11-30 2021-02-19 天津链数科技有限公司 Intelligent household appliance intelligent level testing system and method based on voice interaction
CN112383451B (en) * 2020-11-30 2022-12-16 天津链数科技有限公司 Intelligent household appliance intelligent level testing system and method based on voice interaction
CN113489846A (en) * 2021-06-30 2021-10-08 未鲲(上海)科技服务有限公司 Voice interaction testing method, device, equipment and computer storage medium
CN113489846B (en) * 2021-06-30 2024-02-27 上海凌荣网络科技有限公司 Voice interaction testing method, device, equipment and computer storage medium

Similar Documents

Publication Publication Date Title
CN108763329A (en) Evaluating method, device and the computer equipment of voice interactive system IQ level
US10656909B2 (en) Learning intended user actions
KR102437944B1 (en) Voice wake-up method and device
WO2020098249A1 (en) Electronic device, response conversation technique recommendation method and computer readable storage medium
CN114238656B (en) Logic map completion method based on reinforcement learning and related equipment thereof
CN108899012A (en) Interactive voice equipment evaluating method, system, computer equipment and storage medium
CN107015964B (en) Intelligent robot development-oriented custom intention implementation method and device
CN111209380B (en) Control method and device for conversation robot, computer equipment and storage medium
CN114842849B (en) Voice dialogue detection method and device
KR20170126667A (en) Method for generating conference record automatically and apparatus thereof
CN112286485B (en) Method and device for controlling application through voice, electronic equipment and storage medium
WO2021012649A1 (en) Method and device for expanding question and answer sample
CN110544468B (en) Application awakening method and device, storage medium and electronic equipment
JP2023532395A (en) Suggesting alternative interfaces when environmental interference is predicted to interfere with specific interactions of automated assistants
CN113158690A (en) Testing method and device for conversation robot
CN117149967A (en) Response generation method, device, server and computer readable storage medium
CN112396444B (en) Intelligent robot response method and device
CN119096252A (en) Communication method, electronic device, storage medium and product
CN117076626A (en) Interaction method, device, computer equipment and storage medium
CN117933387A (en) Dialogue data generation method, dialogue data generation system, model training method and dialogue processing method
US7908143B2 (en) Dialog call-flow optimization
CN114783405B (en) Speech synthesis method, device, electronic equipment and storage medium
CN119003707A (en) Information processing method, system, equipment and medium
CN117494715A (en) Dialogue processing method and device, electronic equipment and storage medium
JP2025530312A (en) Restricting third-party application access to audio data content

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information
CB02 Change of applicant information

Address after: 511300 No.78, west of Zhucun Avenue, Zhucun street, Zengcheng District, Guangzhou City, Guangdong Province

Applicant after: CHINA ELECTRONIC PRODUCT RELIABILITY AND ENVIRONMENTAL TESTING RESEARCH INSTITUTE ((THE FIFTH ELECTRONIC RESEARCH INSTITUTE OF MIIT)(CEPREI LABORATORY))

Address before: 510610 No. 110 Zhuang Road, Tianhe District, Guangdong, Guangzhou, Dongguan

Applicant before: CHINA ELECTRONIC PRODUCT RELIABILITY AND ENVIRONMENTAL TESTING RESEARCH INSTITUTE ((THE FIFTH ELECTRONIC RESEARCH INSTITUTE OF MIIT)(CEPREI LABORATORY))

RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20181106