[go: up one dir, main page]

CN115118820A - Call processing method and device, computer equipment and storage medium - Google Patents

Call processing method and device, computer equipment and storage medium Download PDF

Info

Publication number
CN115118820A
CN115118820A CN202210731432.4A CN202210731432A CN115118820A CN 115118820 A CN115118820 A CN 115118820A CN 202210731432 A CN202210731432 A CN 202210731432A CN 115118820 A CN115118820 A CN 115118820A
Authority
CN
China
Prior art keywords
target
template
target user
speech
templates
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210731432.4A
Other languages
Chinese (zh)
Inventor
陈杭
陈子意
朱益兴
于欣璐
李骁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ping An Bank Co Ltd
Original Assignee
Ping An Bank Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Bank Co Ltd filed Critical Ping An Bank Co Ltd
Priority to CN202210731432.4A priority Critical patent/CN115118820A/en
Publication of CN115118820A publication Critical patent/CN115118820A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42221Conversation recording systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3343Query execution using phonetics
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/335Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/487Arrangements for providing information services, e.g. recorded voice services or time announcements
    • H04M3/493Interactive information services, e.g. directory enquiries ; Arrangements therefor, e.g. interactive voice response [IVR] systems or voice portals
    • H04M3/4936Speech interaction details
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • H04M3/51Centralised call answering arrangements requiring operator intervention, e.g. call or contact centers for telemarketing
    • H04M3/5166Centralised call answering arrangements requiring operator intervention, e.g. call or contact centers for telemarketing in combination with interactive voice response systems or voice portals, e.g. as front-ends

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Databases & Information Systems (AREA)
  • Mathematical Physics (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Business, Economics & Management (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Marketing (AREA)
  • Software Systems (AREA)
  • Telephonic Communication Services (AREA)

Abstract

本申请实施例公开了一种通话处理方法、装置、计算机设备及存储介质,本方案获取目标用户的历史通话记录;基于预设的双模态神经网络模型,对历史通话记录对应的语音信息和文本信息分别进行特征提取,得到目标用户面对各种类型的话术模板时的语音特征和文本特征;根据目标用户面对各种类型的话术模板时的语音特征和文本特征确定目标用户面对各种类型的话术模板时的偏好程度;基于目标用户面对各种类型的话术模板时的偏好程度,从各种类型的话术模板中确定偏好程度最高的第一目标话术模板,并将第一目标话术模板进行提示操作,以使客服坐席根据第一目标话术模板与目标用户进行通话,从而提高了电话坐席服务的服务质量。

Figure 202210731432

The embodiments of the present application disclose a call processing method, device, computer equipment, and storage medium. This solution obtains historical call records of a target user; based on a preset dual-modal neural network model, the voice information and The text information is extracted separately to obtain the speech features and text features of the target users when they face various types of discourse templates; the preference degree of each type of discourse template; based on the preference degree of the target user when faced with various types of discourse templates, the first target discourse template with the highest preference is determined from the various types of discourse templates, and the first target discourse template with the highest preference is determined. The target speech template performs a prompt operation, so that the customer service agent can communicate with the target user according to the first target speech template, thereby improving the service quality of the telephone agent service.

Figure 202210731432

Description

一种通话处理方法、装置、计算机设备及存储介质A call processing method, device, computer equipment and storage medium

技术领域technical field

本申请涉及通话技术领域,具体涉及一种通话处理方法、装置、计算机设备及存储介质。The present application relates to the field of call technology, and in particular, to a call processing method, device, computer equipment, and storage medium.

背景技术Background technique

随着金融科技和社会经济的发展,银行专门设置电话坐席服务,以更好的服务客户。现有的电话坐席服务主要依赖于客服坐席本身的经验来进行服务,而且由于客服坐席流动性大,且专业水平参差不齐,而导致电话坐席服务的服务质量较差。With the development of financial technology and social economy, banks have specially set up telephone agent services to better serve customers. The existing telephone agent service mainly relies on the experience of the customer service agent to provide services, and the service quality of the telephone agent service is poor due to the large mobility of the customer service agent and the uneven professional level.

发明内容SUMMARY OF THE INVENTION

本申请实施例提供一种通话处理方法、装置、计算机设备及存储介质,可以提高电话坐席服务的服务质量。The embodiments of the present application provide a call processing method, apparatus, computer equipment and storage medium, which can improve the service quality of the telephone agent service.

本申请实施例提供了一种通话处理方法,包括:The embodiment of the present application provides a call processing method, including:

获取目标用户的历史通话记录,上述历史通话记录中包括基于各种类型的话术模板对上述目标用户进行通话时的通话记录;Obtain the historical call records of the target user, the above-mentioned historical call records include the call records when the above-mentioned target user is called based on various types of speech templates;

基于预设的双模态神经网络模型,对上述历史通话记录对应的语音信息和文本信息分别进行特征提取,得到上述目标用户面对各种类型的话术模板时的语音特征和文本特征;Based on a preset dual-modal neural network model, feature extraction is performed on the voice information and text information corresponding to the above-mentioned historical call records, respectively, to obtain the above-mentioned target user's voice features and text features when faced with various types of discourse templates;

根据上述目标用户面对各种类型的话术模板时的语音特征和文本特征确定上述目标用户面对各种类型的话术模板时的偏好程度;Determine the degree of preference of the above-mentioned target user when facing various types of discourse templates according to the speech features and text characteristics of the above-mentioned target users when they face various types of discourse templates;

基于上述目标用户面对各种类型的话术模板时的偏好程度,从上述各种类型的话术模板中确定偏好程度最高的第一目标话术模板,并将上述第一目标话术模板进行提示操作,以使客服坐席根据上述第一目标话术模板与上述目标用户进行通话。Based on the degree of preference of the target user when faced with various types of speech templates, the first target speech template with the highest preference is determined from the above various types of speech templates, and the above-mentioned first target speech template is prompted. , so that the customer service agent can communicate with the target user according to the first target speech template.

相应的,本申请实施例还提供了一种通话处理装置,包括:Correspondingly, an embodiment of the present application also provides a call processing device, including:

记录获取模块,用于获取目标用户的历史通话记录,上述历史通话记录中包括基于各种类型的话术模板对上述目标用户进行通话时的通话记录;A record acquisition module, used for acquiring the historical call records of the target user, the above-mentioned historical call records including the call records when the above-mentioned target user is called based on various types of speech templates;

特征提取模块,用于基于预设的双模态神经网络模型,对上述历史通话记录对应的语音信息和文本信息分别进行特征提取,得到上述目标用户面对各种类型的话术模板时的语音特征和文本特征;The feature extraction module is configured to perform feature extraction on the voice information and text information corresponding to the above-mentioned historical call records based on a preset dual-modal neural network model, respectively, to obtain the above-mentioned target user's voice characteristics when faced with various types of discourse templates and text features;

程度确定模块,用于根据上述目标用户面对各种类型的话术模板时的语音特征和文本特征确定上述目标用户面对各种类型的话术模板时的偏好程度;A degree determination module, configured to determine the degree of preference of the above-mentioned target user when facing various types of discourse templates according to the speech features and text characteristics of the above-mentioned target users when they face various types of discourse templates;

提示操作模块,用于基于上述目标用户面对各种类型的话术模板时的偏好程度,从上述各种类型的话术模板中确定偏好程度最高的第一目标话术模板,并将上述第一目标话术模板进行提示操作,以使客服坐席根据上述第一目标话术模板与上述目标用户进行通话。The prompting operation module is used to determine the first target speech template with the highest degree of preference from the above-mentioned various types of speech templates based on the preference degree of the target user when facing various types of speech templates, and assign the above-mentioned first target speech template. The speech template performs a prompt operation, so that the customer service agent can communicate with the target user according to the first target speech template.

在一些实施例中,上述通话处理装置还包括:In some embodiments, the above-mentioned call processing apparatus further includes:

第一信息获取模块,用于当上述目标用户为新用户时,获取上述目标用户的属性信息;a first information acquisition module, configured to acquire attribute information of the target user when the target user is a new user;

用户确定模块,用于根据上述目标用户的属性信息确定与上述目标用户属性相似的历史用户;a user determination module, configured to determine historical users whose attributes are similar to the above-mentioned target user according to the attribute information of the above-mentioned target user;

模板获取模块,用于获取上述历史用户对应的偏好程度最高的第二目标话术模板,并将上述第二目标话术模板进行提示操作,以使上述客服坐席根据上述第二目标话术模板与上述目标用户进行通话。The template obtaining module is used to obtain the second target speech template with the highest preference corresponding to the above-mentioned historical users, and perform a prompt operation on the above-mentioned second target speech template, so that the above-mentioned customer service agent can match the second target speech template with the above-mentioned second target speech template. The above target user makes a call.

在一些实施例中,上述通话处理装置还包括:In some embodiments, the above-mentioned call processing apparatus further includes:

第二信息获取模块,用于当上述目标用户为新用户时,获取上述目标用户的预设时间段内的通话信息;a second information acquisition module, configured to acquire call information within a preset time period of the target user when the target user is a new user;

场景确定模块,用于根据上述通话信息确定上述目标用户所需的业务场景;a scenario determination module, configured to determine the business scenario required by the target user according to the call information;

模板确定模块,用于根据上述业务场景确定上述业务场景对应的第三目标话术模板,并将上述第三目标话术模板进行提示操作,以使上述客服坐席根据上述第三目标话术模板与上述目标用户进行通话。The template determination module is configured to determine the third target speech template corresponding to the above-mentioned business scenario according to the above-mentioned business scenario, and perform a prompt operation on the above-mentioned third target speech template, so that the above-mentioned customer service agent can match the above-mentioned third target speech template with the above-mentioned third target speech template. The above target user makes a call.

在一些实施例中,上述通话处理装置还包括:In some embodiments, the above-mentioned call processing apparatus further includes:

预测模块,用于实时获取通话过程中的通话记录,将上述通话记录输入预设的神经网络模型进行偏好程度预测,得到上述目标用户对上述第一目标话术模板的偏好程度;a prediction module, configured to acquire call records during the call in real time, input the call records into a preset neural network model to predict the preference degree, and obtain the preference degree of the target user for the first target speech template;

选取模块,用于当上述偏好程度小于预设阈值时,根据上述通话记录从预设的模板集合中选取第四目标话术模板,对上述第四目标话术模板进行提示操作,以使上述客服坐席根据上述第四目标话术模板与上述目标用户进行通话。The selection module is configured to select a fourth target speech template from the preset template set according to the call record when the preference degree is less than a preset threshold, and perform a prompt operation on the fourth target speech template, so that the customer service The agent communicates with the target user according to the fourth target speech template.

在一些实施例中,上述选取模块包括:In some embodiments, the above-mentioned selection module includes:

场景确定单元,用于根据上述通话记录确定上述目标用户当前所需的业务场景;a scenario determination unit, configured to determine the business scenario currently required by the target user according to the call record;

第一选取单元,用于从上述模板集合中选取上述业务场景对应的第四目标话术模板。The first selection unit is configured to select the fourth target speech template corresponding to the above-mentioned business scenario from the above-mentioned template set.

在一些实施例中,上述第一目标话术模板包括至少一个业务阶段,上述业务阶段对应至少一个话术子模板,上述通话处理装置还包括:In some embodiments, the above-mentioned first target speech template includes at least one service phase, the above-mentioned service phase corresponds to at least one speech sub-template, and the above-mentioned call processing apparatus further includes:

阶段确定模块,用于获取目标用户的当前答复信息,根据上述当前答复信息确定当前业务阶段;a stage determination module, used for obtaining the current reply information of the target user, and determining the current business stage according to the above-mentioned current reply information;

标亮提示模块,用于从上述当前业务阶段对应的至少一个话术子模板中选取目标子模板,并对上述目标子模板进行标亮提示。A highlighting and prompting module is used to select a target sub-template from at least one speech sub-template corresponding to the above-mentioned current business stage, and to highlight and prompt the above-mentioned target sub-template.

在一些实施例中,上述标亮提示模块包括:In some embodiments, the above-mentioned highlighting and prompting module includes:

状态获取单元,用于获取上述目标用户的情绪状态;a state acquisition unit, used for acquiring the emotional state of the above-mentioned target user;

第二选取单元,用于根据上述情绪状态从上述当前业务阶段对应的至少一个话术子模板中选取目标子模板。The second selecting unit is configured to select a target sub-template from at least one speech sub-template corresponding to the current business stage according to the emotional state.

相应的,本申请实施例还提供了一种计算机设备,包括存储器,处理器及存储在储存器上并可在处理器上运行的计算机程序,其中,处理器执行本申请实施例任一提供的通话处理方法。Correspondingly, an embodiment of the present application also provides a computer device, including a memory, a processor, and a computer program stored on the memory and running on the processor, wherein the processor executes any of the provided embodiments of the present application. Call handling method.

相应的,本申请实施例还提供了一种存储介质,存储介质存储有多条指令,指令适于处理器进行加载,以执行如上的通话处理方法。Correspondingly, an embodiment of the present application further provides a storage medium, where the storage medium stores a plurality of instructions, and the instructions are suitable for the processor to load, so as to execute the above call processing method.

本申请实施例获取目标用户的历史通话记录,上述历史通话记录中包括基于各种类型的话术模板对上述目标用户进行通话时的通话记录;基于预设的双模态神经网络模型,对上述历史通话记录对应的语音信息和文本信息分别进行特征提取,得到上述目标用户面对各种类型的话术模板时的语音特征和文本特征;根据上述目标用户面对各种类型的话术模板时的语音特征和文本特征确定上述目标用户面对各种类型的话术模板时的偏好程度;基于上述目标用户面对各种类型的话术模板时的偏好程度,从上述各种类型的话术模板中确定偏好程度最高的第一目标话术模板,并将上述第一目标话术模板进行提示操作,以使客服坐席根据上述第一目标话术模板与上述目标用户进行通话,从而通过目标用户的历史通话记录来确定目标用户最偏好的话术模板,再基于该话术模板来促使电话坐席服务与目标用户进行通话,提高了电话坐席服务的专业水平以及电话坐席服务的服务质量,致使用户通话体验变好。This embodiment of the present application acquires the historical call records of the target user, and the above-mentioned historical call records include the call records when the above-mentioned target users are called based on various types of discourse templates; The voice information and text information corresponding to the call records are extracted respectively to obtain the voice features and text features of the above-mentioned target users when they are faced with various types of discourse templates; and text features to determine the degree of preference of the target user when faced with various types of discourse templates; based on the degree of preference of the target user when faced with various types of discourse templates, the highest preference degree is determined from the above various types of discourse templates the first target speech template, and perform a prompt operation on the above-mentioned first target speech template, so that the customer service agent can communicate with the above-mentioned target user according to the above-mentioned first target speech template, so as to determine the historical call records of the target user. The target user's most preferred speech template, and then based on the speech template to prompt the telephone agent service to communicate with the target user, the professional level of the telephone agent service and the service quality of the telephone agent service are improved, and the user's call experience is improved.

附图说明Description of drawings

为了更清楚地说明本申请实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions in the embodiments of the present application more clearly, the following briefly introduces the drawings that are used in the description of the embodiments. Obviously, the drawings in the following description are only some embodiments of the present application. For those skilled in the art, other drawings can also be obtained from these drawings without creative effort.

图1为本申请实施例提供的一种通话处理方法的流程示意图。FIG. 1 is a schematic flowchart of a call processing method provided by an embodiment of the present application.

图2为本申请实施例提供的一种通话处理装置的结构框图。FIG. 2 is a structural block diagram of a call processing apparatus provided by an embodiment of the present application.

图3为本申请实施例提供的计算机设备的结构示意图。FIG. 3 is a schematic structural diagram of a computer device provided by an embodiment of the present application.

具体实施方式Detailed ways

下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请的一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are only a part of the embodiments of the present application, rather than all the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those skilled in the art without creative work fall within the protection scope of the present application.

本申请实施例提供一种通话处理方法、装置、存储介质及计算机设备。具体地,本申请实施例的通话处理方法可以由计算机设备执行,其中,该计算机设备可以为服务器,也可以为终端等设备。其中,服务器可以是独立的物理服务器,也可以是多个物理服务器构成的服务器集群或者分布式系统,还可以是提供云服务、云数据库、云计算、云函数、云存储、网络服务、云通信、中间件服务、域名服务、安全服务、CDN、以及大数据和人工智能平台等基础云计算服务的云服务器。终端可以是智能手机、台式计算机、笔记本电脑、平板电脑等,但并不局限于此。终端以及服务器可以通过有线或无线通信方式进行直接或间接地连接,本申请在此不做限制。Embodiments of the present application provide a call processing method, apparatus, storage medium, and computer equipment. Specifically, the call processing method in the embodiment of the present application may be executed by a computer device, where the computer device may be a server, or may be a terminal or other device. The server may be an independent physical server, or a server cluster or distributed system composed of multiple physical servers, or may provide cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, and cloud communications. , middleware services, domain name services, security services, CDN, and cloud servers for basic cloud computing services such as big data and artificial intelligence platforms. The terminal may be a smart phone, a desktop computer, a notebook computer, a tablet computer, etc., but is not limited thereto. The terminal and the server may be directly or indirectly connected through wired or wireless communication, which is not limited in this application.

例如,该计算机设备可以是终端,该终端可以获取目标用户的历史通话记录,上述历史通话记录中包括基于各种类型的话术模板对上述目标用户进行通话时的通话记录;基于预设的双模态神经网络模型,对上述历史通话记录对应的语音信息和文本信息分别进行特征提取,得到上述目标用户面对各种类型的话术模板时的语音特征和文本特征;根据上述目标用户面对各种类型的话术模板时的语音特征和文本特征确定上述目标用户面对各种类型的话术模板时的偏好程度;基于上述目标用户面对各种类型的话术模板时的偏好程度,从上述各种类型的话术模板中确定偏好程度最高的第一目标话术模板,并将上述第一目标话术模板进行提示操作,以使客服坐席根据上述第一目标话术模板与上述目标用户进行通话。For example, the computer device may be a terminal, and the terminal may acquire historical call records of the target user, and the historical call records include call records when the target user is called based on various types of speech templates; state neural network model, respectively perform feature extraction on the voice information and text information corresponding to the above-mentioned historical call records, and obtain the voice and text characteristics of the above-mentioned target users when they face various types of discourse templates; The speech features and text features of the types of discourse templates determine the degree of preference of the above-mentioned target users when facing various types of discourse templates; From the speech templates, determine the first target speech template with the highest degree of preference, and perform a prompt operation on the first target speech template, so that the customer service agent can communicate with the target user according to the first target speech template.

基于上述问题,本申请实施例提供第一种通话处理方法、装置、计算机设备及存储介质,可以提高电话坐席服务的服务质量。Based on the above problems, the embodiments of the present application provide a first call processing method, device, computer equipment, and storage medium, which can improve the service quality of the telephone agent service.

以下分别进行详细说明。需说明的是,以下实施例的描述顺序不作为对实施例优选顺序的限定。Each of them will be described in detail below. It should be noted that the description order of the following embodiments is not intended to limit the preferred order of the embodiments.

本申请实施例提供一种通话处理方法,该方法可以由终端或服务器执行,本申请实施例以通话处理方法由终端执行为例来进行说明。The embodiment of the present application provides a call processing method, and the method can be executed by a terminal or a server. The embodiment of the present application takes the call processing method executed by a terminal as an example for description.

请参阅图1,图1为本申请实施例提供的一种通话处理方法的流程示意图。该通话处理方法的具体流程可以如下:Please refer to FIG. 1. FIG. 1 is a schematic flowchart of a call processing method provided by an embodiment of the present application. The specific flow of the call processing method may be as follows:

101、获取目标用户的历史通话记录。101. Obtain the historical call records of the target user.

在本实施例中,上述历史通话记录为客服坐席在历史时间中基于各种类型的话术模板对目标用户进行通话时产生的通话记录,该历史通话记录中包括基于各种类型的话术模板对目标用户进行通话时的通话记录。In this embodiment, the above-mentioned historical call record is a call record generated when a customer service agent makes a call to a target user based on various types of discourse templates in historical time. Call records when the user makes a call.

其中,上述客服坐席为与用户进行通话的电话客服人员;上述话术模板中包括但不限于是打招呼的相关内容、回复用户问题的相关内容以及请用户等待的相关内容等;上述话术模板的类型包括但不限于是简洁类型、温柔类型、可爱类型等。Among them, the above-mentioned customer service agents are the telephone customer service personnel who talk with the user; the above-mentioned speech template includes but is not limited to the relevant content of greeting, the relevant content of replying to the user's question, and the relevant content of asking the user to wait, etc.; Types include, but are not limited to, concise type, gentle type, cute type, etc.

102、基于预设的双模态神经网络模型,对历史通话记录对应的语音信息和文本信息分别进行特征提取,得到目标用户面对各种类型的话术模板时的语音特征和文本特征。102. Based on a preset dual-modal neural network model, feature extraction is performed on the voice information and text information corresponding to the historical call records, respectively, to obtain the voice features and text features of the target user when faced with various types of discourse templates.

在本实施例中,预先通话语音数据和文本数据训练得到双模态神经网络模型,当存在业务需求时,即当前客服坐席需与目标用户进行通话时,终端可通过双模态神经网络模型对所获取的目标用户的历史通话记录进行两个模态的特征提取,该两个模态分贝为语音和文本两模态,从而通过双模态神经网络模型的特征提取来得到目标用户在面对每个特定类型的话术模板时的语音特征和文本特征。In this embodiment, the bimodal neural network model is obtained by training the voice data and text data in advance. When there is a business requirement, that is, when the current customer service agent needs to talk with the target user, the terminal can use the bimodal neural network model to The acquired historical call records of the target user are subjected to feature extraction of two modalities, and the two modal decibels are two modalities of speech and text. Phonetic and textual features for each particular type of utterance template.

可以理解的是,一般情况下历史通话记录中包括客服语音和用户语音,由于当前进行特征提取主要针对于用户语音,所以可从历史通话记录中提取出用户语音,从而对用户语音进行特征提取,并且在进行文本的特征提取时,可预先将用户语音转化成文本信息,再将用户语音对应的语音信息和用户语音转化后的文本信息输入至预设的双模态神经网络模型中进行特征提取。It is understandable that, in general, historical call records include customer service voice and user voice. Since the current feature extraction is mainly for user voice, user voice can be extracted from historical call records, so as to perform feature extraction on user voice. And when the feature extraction of the text is carried out, the user's voice can be converted into text information in advance, and then the voice information corresponding to the user's voice and the text information after the user's voice conversion are input into the preset bimodal neural network model for feature extraction. .

103、根据目标用户面对各种类型的话术模板时的语音特征和文本特征确定目标用户面对各种类型的话术模板时的偏好程度。103. Determine, according to the speech features and text features of the target user when the target user faces various types of utterance templates, the degree of preference of the target user when faced with various types of utterance templates.

在本实施例中,由于目标用户面对不同类型的话术模板,其偏好程度也有所不同,所以在得到目标用户在面对每个特定类型的话术模板时的语音特征和文本特征之后,可基于双模态神经网络模型对其所提取的语音特征和文本特征进行偏好程度识别,即识别出目标用户在面对每个特定类型的话术模板时的偏好程度。In this embodiment, since the target users face different types of discourse templates, their preference degrees are also different, so after obtaining the speech features and text characteristics of the target users when they face each specific type of discourse template, it can be based on The bimodal neural network model performs preference degree recognition on the extracted speech features and text features, that is, the preference degree of the target user when faced with each specific type of discourse template.

例如,若当前目标用户的历史通话记录中包括三种类型的话术模板,分别为简洁类型的话术模板、温柔类型的话术模板以及可爱类型的话术模板,基于双模态神经网络模型,可得到目标用户在面对简洁类型的话术模板时的偏好程度为A,目标用户在面对温柔类型的话术模板时的偏好程度为B,目标用户在面对可爱类型的话术模板时的偏好程度为C。For example, if the historical call records of the current target user include three types of speech templates, namely concise type speech templates, gentle type speech templates, and cute type speech templates, based on the dual-modal neural network model, the target The user's preference degree when facing the concise type of discourse template is A, the target user's preference degree when facing the gentle type discourse template is B, and the target user's preference degree when facing the cute type of discourse template is C.

104、基于目标用户面对各种类型的话术模板时的偏好程度,从各种类型的话术模板中确定偏好程度最高的第一目标话术模板,并将第一目标话术模板进行提示操作,以使客服坐席根据第一目标话术模板与目标用户进行通话。104. Based on the preference degree of the target user when facing various types of speech templates, determine the first target speech template with the highest degree of preference from the various types of speech templates, and perform a prompt operation on the first target speech template, So that the customer service agent can communicate with the target user according to the first target speech template.

在本实施例中,通过从目标用户的历史通话记录中的各种类型的话术模板,从中选取目标用户相对偏好程度更高的话术模板,即第一目标话术模板,从而可以将第一目标话术模板在终端的触控显示屏上进行显示,促使客服坐席根据显示屏上所显示的第一目标话术模板与目标用户进行通话,从而通过采用目标用户偏好程度最高的话术模板对目标用户进行通话,来提高客服坐席的服务质量。In this embodiment, by selecting from various types of speech templates in the historical call records of the target user, a speech template with a higher degree of relative preference of the target user, that is, the first target speech template, can be used as the first target speech template. The speech template is displayed on the touch screen of the terminal, prompting the customer service agent to communicate with the target user according to the first target speech template displayed on the display screen, so that the target user can be informed by using the speech template with the highest preference of the target user. Make calls to improve the service quality of customer service agents.

例如,基于上述示例,进一步限定,上述B大于C大于A,从而可得到目标用户相对偏好程度更高的第一目标话术模板为温柔类型的话术模板,进而终端对温柔类型的话术模板进行提示操作,以使客服坐席根据温柔类型的话术模板与目标用户进行通话。For example, based on the above example, it is further limited that the above B is greater than C and greater than A, so that the first target speech template with a higher relative preference of the target user can be obtained as a gentle type speech template, and then the terminal prompts the gentle type speech template Operation, so that the customer service agent can talk to the target user according to the gentle type speech template.

在一些实施例中,由于不同话术模板所对应的业务场景不同,就会存在至少两种相同类型的话术模板分别对应的业务场景不同的现象,而针对不同的业务场景的话术模板的内容也有所不同,为了更好的提高电话坐席服务的服务质量,在将第一目标话术模板进行提示操作之后还可以包括,进一步实时检测客服坐席与目标用户的通话记录,以实时判断当前目标用户所需的业务场景是否与沟通之前得到的第一话术模板所对应的业务场景一致,从而在两者的业务场景不一致时及时应对,以提高电话坐席服务的服务质量。其中,上述在业务场景不一致时的应对方式包括但不限于是将第一目标话术模板进行替换,对第一目标话术模板进行修改等;上述业务场景包括但不限于是售后场景、咨询场景和销售场景等。In some embodiments, due to different business scenarios corresponding to different discourse templates, there may be a phenomenon that at least two discourse templates of the same type correspond to different business scenarios, and the contents of discourse templates for different business scenarios also include The difference is that, in order to better improve the service quality of the telephone agent service, after performing the prompt operation on the first target speech template, it may further include further real-time detection of the call records between the customer service agent and the target user, so as to judge in real time what the current target user has. Whether the required business scenario is consistent with the business scenario corresponding to the first speech template obtained before the communication, so that when the two business scenarios are inconsistent, it can be dealt with in time to improve the service quality of the telephone agent service. Among them, the above-mentioned response methods when the business scenarios are inconsistent include but are not limited to replacing the first target speech template, modifying the first target speech template, etc. The above business scenarios include but are not limited to after-sales scenarios and consulting scenarios. and sales scenarios.

具体可以包括:终端实时获取客服坐席基于第一话术模板与目标用户的通话过程中的通话记录,并将该通话记录输入预设的神经网络模型实时进行偏好程度预测,以得到目标用户当前对第一目标话术模板的偏好程度,当目标用户当前对第一目标话术模板的偏好程度大于等于预设阈值时,说明目标用户当前对第一目标话术模板仍处于满意的状态,则不对第一目标话术模板进行任何处理;当目标用户当前对第一目标话术模板的偏好程度小于预设阈值时,说明目标用户当前对第一目标话术模板已经不满意,且既有可能会进行客户投诉等不友善行为,对此,可根据通话记录从预设的模板集合中选取第四目标话术模板,对第四目标话术模板进行提示操作,以使客服坐席根据第四目标话术模板与目标用户进行通话。Specifically, the terminal may acquire the call records of the customer service agent during the call process with the target user based on the first speech template in real time, and input the call records into a preset neural network model to predict the preference degree in real time, so as to obtain the current preference of the target user. The preference degree of the first target speech template. When the target user's current preference for the first target speech template is greater than or equal to the preset threshold, it means that the target user is still satisfied with the first target speech template. The first target speech template performs any processing; when the target user's current preference for the first target speech template is less than the preset threshold, it means that the target user is currently dissatisfied with the first target speech template, and both may To conduct unfriendly behaviors such as customer complaints, you can select the fourth target speech template from the preset template set according to the call record, and perform a prompt operation on the fourth target speech template, so that the customer service agent can use the fourth target speech template according to the fourth target speech. Use the technology template to talk to the target user.

其中,上述第四目标话术模板的类型可以与第一目标话术模板的类型一致,而两者的业务场景不同。Wherein, the type of the above-mentioned fourth target speech template may be consistent with the type of the first target speech template, but the business scenarios of the two are different.

上述根据通话记录从预设的模板集合中选取第四目标话术模板,具体可以包括:终端根据通话记录确定目标用户当前所需的业务场景;从模板集合中选取业务场景对应的第四目标话术模板。其中,该模板集合可以根据第一目标话术模板的类型来进行确定,即可将类型一致而业务场景的话术模板存储于同一模板集合中,待有需求时,根据第一目标话术模板的类型确定对应的模板集合,再从所确定的模板集合中选取业务场景对应的第四目标话术模板,该第四目标话术模板与第一目标话术模板的类型一致,且第四目标话术模板的业务场景更加符合目标用户的当前所需。The above-mentioned selecting the fourth target speech template from the preset template set according to the call record may specifically include: the terminal determines the business scenario currently required by the target user according to the call record; selecting the fourth target speech corresponding to the business scenario from the template set technique template. The template set can be determined according to the type of the first target vocabulary template, that is, the vocabulary templates of the same type and business scenarios can be stored in the same template set, and when there is a demand, according to the type of the first target vocabulary template Determine the template set corresponding to the type, and then select the fourth target language template corresponding to the business scenario from the determined template set, the fourth target language template is of the same type as the first target language template, and the fourth target language template is of the same type. The business scenarios of technology templates are more in line with the current needs of target users.

在一些实施例中,由于不同的业务场景存在不同的阶段,例如,打招呼阶段、回复用户问题阶段以及请用户等待阶段等,所以上述应用于特定业务场景的第一目标话术模板也同样包括至少一个业务阶段,该业务阶段对应至少一个话术子模板,即相当于同种类型但话术方式不同的子模板,为了保障服务质量,在将第一目标话术模板进行提示操作之后还可以包括:终端获取目标用户的当前答复信息,该答复信息即相当于用户回答客服上一个阶段的话语的答复内容,从而根据当前答复信息可以确定当前沟通处于业务场景中的哪一个业务阶段,即确定当前业务阶段,由于业务阶段均对应至少一个话术子模板,所以可从当前业务阶段对应的至少一个话术子模板中选取目标子模板,并对目标子模板在触控显示屏上进行标亮提示。In some embodiments, since there are different stages in different business scenarios, for example, the greeting stage, the replying user question stage, and the user waiting stage, etc., the above-mentioned first target speech template applied to a specific business scenario also includes at least A business stage, this business stage corresponds to at least one speech sub-template, that is, a sub-template of the same type but different speech modes. In order to ensure service quality, after prompting the first target speech template, you can also include : The terminal obtains the current reply information of the target user, which is equivalent to the reply content of the user's reply to the customer service's words in the previous stage, so that it can determine which business stage the current communication is in the business scenario according to the current reply information, that is, to determine the current In the business stage, since the business stage corresponds to at least one conversation sub-template, the target sub-template can be selected from the at least one conversation sub-template corresponding to the current business stage, and the target sub-template can be highlighted on the touch screen. .

上述从当前业务阶段对应的至少一个话术子模板中选取目标子模板具体可以包括:采用随机选取的方式从当前业务阶段对应的至少一个话术子模板中选取目标子模板,或者采用特定规则选取的方式从当前业务阶段对应的至少一个话术子模板中选取目标子模板。其中,上述特定规则可以是根据目标用户的当前状态或属性信息确定,上述当前状态可以是情绪状态;上述属性信息可以是目标用户的性别、年龄、职业等信息。The above-mentioned selecting the target sub-template from at least one speech sub-template corresponding to the current business stage may specifically include: selecting the target sub-template from at least one speech sub-template corresponding to the current business stage by random selection, or using a specific rule to select the target sub-template. The target sub-template is selected from at least one speech sub-template corresponding to the current business stage. The specific rule may be determined according to the current state or attribute information of the target user, and the current state may be an emotional state; the attribute information may be the target user's gender, age, occupation and other information.

具体地,根据当前状态从当前业务阶段对应的至少一个话术子模板中选取目标子模板可以包括:终端可获取目标用户的情绪状态,从而根据该情绪状态从当前业务阶段对应的至少一个话术子模板中选取目标子模板。其中,上述目标用户的情绪状态可通过实时获取客服坐席基于第一话术模板与目标用户的通话过程中的通话记录,以根据该通话记录来识别目标用户的情绪状态。Specifically, selecting the target sub-template from at least one vocabulary sub-template corresponding to the current business stage according to the current state may include: the terminal may acquire the emotional state of the target user, and then select the target sub-template from at least one vocabulary corresponding to the current business stage according to the emotional state. Select the target sub-template from the sub-template. Wherein, the emotional state of the target user can be obtained by real-time acquisition of the call record of the customer service agent during the call with the target user based on the first speech template, so as to identify the emotional state of the target user according to the call record.

在一些实施例中,由于可能存在一些目标用户是新用户,而该目标用户是没有历史通话记录的,致使面对目标用户为新用户时,是无法获取该目标用户的历史通话记录,并根据该历史通话记录确定目标用户偏好程度最高的第一目标话术模板的,所以当面对目标用户为新用户时,可以根据目标用户的属性信息或目标用户预设时间段内的通话信息确定目标用户偏好程度最高的话术模板。其中,上述属性信息可以是目标用户的性别、年龄、职业等信息。In some embodiments, since some target users may be new users, and the target users have no historical call records, when the target user is a new user, it is impossible to obtain the historical call records of the target user, and according to The historical call record determines the first target speech template with the highest preference of the target user, so when the target user is a new user, the target can be determined according to the attribute information of the target user or the call information within the preset time period of the target user The discourse template with the highest user preference. The above attribute information may be information such as gender, age, occupation, etc. of the target user.

可以理解的是,若根据目标用户存在历史记录,但根据其历史记录无法确定出第一目标话术模板,即说明目标用户虽然有历史通话记录,但是历史通话记录中的不同类型的话术模板对应的通话记录所分别对应的目标用户的偏好程度过低,从而并不能依据该偏好程度过低的话术模板得到第一目标话术模板,则可将该目标用户认为是新用户。It is understandable that, if the target user has historical records, but the first target vocabulary template cannot be determined according to the historical records, it means that although the target user has historical call records, different types of vocabulary templates in the historical call records correspond to different types of vocabulary templates. If the preference degree of the target users corresponding to the call records is too low, so that the first target speech template cannot be obtained according to the speech template with the too low preference degree, the target user can be regarded as a new user.

具体可以包括:当目标用户为新用户时,终端获取目标用户的属性信息,以根据目标用户的属性信息确定与目标用户属性相似的历史用户,从而获取历史用户对应的偏好程度最高的第二目标话术模板,并将第二目标话术模板进行提示操作,以使客服坐席根据第二目标话术模板与目标用户进行通话。Specifically, it may include: when the target user is a new user, the terminal obtains attribute information of the target user, so as to determine historical users with similar attributes to the target user according to the attribute information of the target user, so as to obtain the second target corresponding to the historical user with the highest degree of preference The language template, and prompting the second target language template, so that the customer service agent can communicate with the target user according to the second target language template.

其中,上述历史用户包括至少一个,历史用户对应的偏好程度最高的第二目标话术模板的获取方式可参照上述步骤101至步骤104。当历史用户存在至少两个时,根据至少两个历史用户分别对应的各个类型的话术模板的偏好程度,从至少两个历史用户分别对应的各个类型的话术模板中选取偏好程度最高的话术模板,即第二目标话术模板。The above-mentioned historical user includes at least one, and for the acquisition method of the second target speech template with the highest degree of preference corresponding to the historical user, reference may be made to the above-mentioned steps 101 to 104 . When there are at least two historical users, according to the preference degrees of the respective types of discourse templates corresponding to the at least two historical users, a discourse template with the highest preference degree is selected from the discourse templates of various types corresponding to the at least two historical users respectively, That is, the second target speech template.

具体可以包括:当目标用户为新用户时,终端获取目标用户的预设时间段内的通话信息,以根据通话信息确定目标用户所需的业务场景,从而根据业务场景确定业务场景对应的第三目标话术模板,并将第三目标话术模板进行提示操作,以使客服坐席根据第三目标话术模板与目标用户进行通话。Specifically, it may include: when the target user is a new user, the terminal obtains call information within a preset time period of the target user, so as to determine the service scenario required by the target user according to the call information, thereby determining the third corresponding to the service scenario according to the service scenario. target language template, and perform a prompt operation on the third target language template, so that the customer service agent can communicate with the target user according to the third target language template.

其中,上述业务场景对应至少一种类型的话术模板,可通过随机选取的方式从至少一种类型的话术模板中选取第三目标话术模板。也可根据各个类型的话术模板对应的偏好程度大于预设程度阈值的人数,从中将人数最多的目标类型的话术模板确定为第三目标话术模板。Wherein, the above business scenario corresponds to at least one type of speech template, and a third target speech template may be selected from at least one type of speech template by random selection. The speech template of the target type with the largest number of people can also be determined as the third target speech template according to the number of people whose preference degree corresponding to each type of speech template is greater than the preset degree threshold.

本发明实施例中,“第一”、“第二”、“第三”、“第四”仅仅是为区别话术模板,无其他特殊含义。In the embodiment of the present invention, "first", "second", "third", and "fourth" are only used to distinguish vocabulary templates, and have no other special meanings.

本申请实施例公开了一种通话处理方法,该方法包括:获取目标用户的历史通话记录,上述历史通话记录中包括基于各种类型的话术模板对上述目标用户进行通话时的通话记录;基于预设的双模态神经网络模型,对上述历史通话记录对应的语音信息和文本信息分别进行特征提取,得到上述目标用户面对各种类型的话术模板时的语音特征和文本特征;根据上述目标用户面对各种类型的话术模板时的语音特征和文本特征确定上述目标用户面对各种类型的话术模板时的偏好程度;基于上述目标用户面对各种类型的话术模板时的偏好程度,从上述各种类型的话术模板中确定偏好程度最高的第一目标话术模板,并将上述第一目标话术模板进行提示操作,以使客服坐席根据上述第一目标话术模板与上述目标用户进行通话,以此,可以提高电话坐席服务的服务质量。The embodiment of the present application discloses a call processing method. The method includes: acquiring historical call records of a target user, where the historical call records include call records when making calls to the target user based on various types of speech templates; The set dual-modal neural network model performs feature extraction on the voice information and text information corresponding to the above-mentioned historical call records, respectively, to obtain the above-mentioned target users when they face various types of discourse templates. The speech features and text features when faced with various types of discourse templates determine the degree of preference of the above target users when faced with various types of discourse templates; Among the above various types of speech templates, determine the first target speech template with the highest degree of preference, and perform a prompt operation on the above-mentioned first target speech template, so that the customer service agent can communicate with the above-mentioned target user according to the above-mentioned first target speech template. Therefore, the service quality of the telephone agent service can be improved.

为便于更好的实施本申请实施例提供的通话处理方法,本申请实施例还提供一种基于上述通话处理方法的通话处理装置。其中名词的含义与上述通话处理方法中相同,具体实现细节可以参考方法实施例中的说明。In order to facilitate better implementation of the call processing method provided by the embodiment of the present application, the embodiment of the present application further provides a call processing apparatus based on the above-mentioned call processing method. The meanings of the nouns are the same as those in the above-mentioned call processing method, and the specific implementation details may refer to the descriptions in the method embodiments.

请参阅图2,图2为本申请实施例提供的一种通话处理装置的结构框图,该装置包括:Please refer to FIG. 2. FIG. 2 is a structural block diagram of a call processing apparatus provided by an embodiment of the present application. The apparatus includes:

记录获取模块201,用于获取目标用户的历史通话记录,历史通话记录中包括基于各种类型的话术模板对目标用户进行通话时的通话记录。The record obtaining module 201 is configured to obtain the historical call records of the target user, and the historical call records include the call records when the target user is called based on various types of discourse templates.

特征提取模块202,用于基于预设的双模态神经网络模型,对历史通话记录对应的语音信息和文本信息分别进行特征提取,得到目标用户面对各种类型的话术模板时的语音特征和文本特征。The feature extraction module 202 is configured to perform feature extraction respectively on the voice information and text information corresponding to the historical call records based on the preset bimodal neural network model, and obtain the voice features and text features.

程度确定模块203,用于根据目标用户面对各种类型的话术模板时的语音特征和文本特征确定目标用户面对各种类型的话术模板时的偏好程度。The degree determination module 203 is configured to determine the degree of preference of the target user when facing various types of discourse templates according to the speech features and text characteristics of the target user when facing various types of discourse templates.

提示操作模块204,用于基于目标用户面对各种类型的话术模板时的偏好程度,从各种类型的话术模板中确定偏好程度最高的第一目标话术模板,并将第一目标话术模板进行提示操作,以使客服坐席根据第一目标话术模板与目标用户进行通话。The prompt operation module 204 is configured to, based on the preference degree of the target user when facing various types of speech templates, determine the first target speech template with the highest preference from the various types of speech templates, and assign the first target speech template to the first target speech template. The template performs a prompt operation, so that the customer service agent can communicate with the target user according to the first target speech template.

在一些实施例中,上述通话处理装置还包括:In some embodiments, the above-mentioned call processing apparatus further includes:

第一信息获取模块,用于当目标用户为新用户时,获取目标用户的属性信息。The first information acquisition module is configured to acquire attribute information of the target user when the target user is a new user.

用户确定模块,用于根据目标用户的属性信息确定与目标用户属性相似的历史用户。The user determination module is used for determining historical users with attributes similar to the target user according to the attribute information of the target user.

模板获取模块,用于获取历史用户对应的偏好程度最高的第二目标话术模板,并将第二目标话术模板进行提示操作,以使客服坐席根据第二目标话术模板与目标用户进行通话。The template acquisition module is used to obtain the second target speech template with the highest preference corresponding to the historical user, and perform a prompt operation on the second target speech template, so that the customer service agent can communicate with the target user according to the second target speech template .

在一些实施例中,上述通话处理装置还包括:In some embodiments, the above-mentioned call processing apparatus further includes:

第二信息获取模块,用于当目标用户为新用户时,获取目标用户的预设时间段内的通话信息。The second information obtaining module is configured to obtain call information of the target user within a preset time period when the target user is a new user.

场景确定模块,用于根据通话信息确定目标用户所需的业务场景。The scenario determination module is used to determine the service scenario required by the target user according to the call information.

模板确定模块,用于根据业务场景确定业务场景对应的第三目标话术模板,并将第三目标话术模板进行提示操作,以使客服坐席根据第三目标话术模板与目标用户进行通话。The template determination module is used to determine the third target speech template corresponding to the business scenario according to the business scenario, and perform a prompt operation on the third target speech template, so that the customer service agent can communicate with the target user according to the third target speech template.

在一些实施例中,上述通话处理装置还包括:In some embodiments, the above-mentioned call processing apparatus further includes:

预测模块,用于实时获取通话过程中的通话记录,将通话记录输入预设的神经网络模型进行偏好程度预测,得到目标用户对第一目标话术模板的偏好程度。The prediction module is used to obtain the call records during the call in real time, input the call records into the preset neural network model to predict the preference degree, and obtain the preference degree of the target user for the first target speech template.

选取模块,用于当偏好程度小于预设阈值时,根据通话记录从预设的模板集合中选取第四目标话术模板,对第四目标话术模板进行提示操作,以使客服坐席根据第四目标话术模板与目标用户进行通话。The selection module is configured to select the fourth target speech template from the preset template set according to the call record when the preference degree is less than the preset threshold, and perform a prompt operation on the fourth target speech template, so that the customer service agent can perform a prompt operation according to the fourth target speech template. The target speech template communicates with the target user.

在一些实施例中,上述选取模块包括:In some embodiments, the above-mentioned selection module includes:

场景确定单元,用于根据通话记录确定目标用户当前所需的业务场景。The scenario determination unit is used to determine the current business scenario required by the target user according to the call record.

第一选取单元,用于从模板集合中选取业务场景对应的第四目标话术模板。The first selection unit is configured to select the fourth target speech template corresponding to the business scenario from the template set.

在一些实施例中,第一目标话术模板包括至少一个业务阶段,业务阶段对应至少一个话术子模板,上述通话处理装置还包括:In some embodiments, the first target speech template includes at least one service phase, and the service phase corresponds to at least one speech sub-template, and the above-mentioned call processing apparatus further includes:

阶段确定模块,用于获取目标用户的当前答复信息,根据当前答复信息确定当前业务阶段。The stage determination module is used for acquiring the current reply information of the target user, and determining the current business stage according to the current reply information.

标亮提示模块,用于从当前业务阶段对应的至少一个话术子模板中选取目标子模板,并对目标子模板进行标亮提示。The highlighting and prompting module is used to select a target sub-template from at least one speech sub-template corresponding to the current business stage, and to highlight and prompt the target sub-template.

在一些实施例中,上述标亮提示模块包括:In some embodiments, the above-mentioned highlighting and prompting module includes:

状态获取单元,用于获取目标用户的情绪状态。The state obtaining unit is used to obtain the emotional state of the target user.

第二选取单元,用于根据情绪状态从当前业务阶段对应的至少一个话术子模板中选取目标子模板。The second selection unit is configured to select a target sub-template from at least one speech sub-template corresponding to the current business stage according to the emotional state.

本申请实施例公开了一种通话处理装置,通过记录获取模块201,用于获取目标用户的历史通话记录,历史通话记录中包括基于各种类型的话术模板对目标用户进行通话时的通话记录;特征提取模块202,用于基于预设的双模态神经网络模型,对历史通话记录对应的语音信息和文本信息分别进行特征提取,得到目标用户面对各种类型的话术模板时的语音特征和文本特征;程度确定模块203,用于根据目标用户面对各种类型的话术模板时的语音特征和文本特征确定目标用户面对各种类型的话术模板时的偏好程度;提示操作模块204,用于基于目标用户面对各种类型的话术模板时的偏好程度,从各种类型的话术模板中确定偏好程度最高的第一目标话术模板,并将第一目标话术模板进行提示操作,以使客服坐席根据第一目标话术模板与目标用户进行通话。以此,提高电话坐席服务的服务质量。The embodiment of the present application discloses a call processing apparatus, which is used to obtain historical call records of a target user through a record acquisition module 201, where the historical call records include call records when the target user is on a call based on various types of speech templates; The feature extraction module 202 is configured to perform feature extraction respectively on the voice information and text information corresponding to the historical call records based on the preset bimodal neural network model, and obtain the voice features and text feature; the degree determination module 203 is used to determine the preference degree of the target user when facing various types of discourse templates according to the speech features and text features of the target user when facing various types of discourse templates; the prompt operation module 204 is used to use Based on the preference degree of the target user when facing various types of speech templates, the first target speech template with the highest preference is determined from the various types of speech templates, and the first target speech template is prompted to operate. Make the customer service agent communicate with the target user according to the first target speech template. In this way, the service quality of the telephone agent service is improved.

相应的,本申请实施例还提供一种计算机设备,该计算机设备可以为终端。如图3所示,图3为本申请实施例提供的计算机设备的结构示意图。该计算机设备300包括有一个或者一个以上处理核心的处理器301、有一个或一个以上计算机可读存储介质的存储器302及存储在存储器302上并可在处理器上运行的计算机程序。其中,处理器301与存储器302电性连接。本领域技术人员可以理解,图中示出的计算机设备结构并不构成对计算机设备的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。Correspondingly, an embodiment of the present application further provides a computer device, where the computer device may be a terminal. As shown in FIG. 3 , FIG. 3 is a schematic structural diagram of a computer device provided by an embodiment of the present application. The computer device 300 includes a processor 301 having one or more processing cores, a memory 302 having one or more computer-readable storage media, and a computer program stored on the memory 302 and executable on the processor. The processor 301 is electrically connected to the memory 302 . Those skilled in the art can understand that the computer device structure shown in the figures does not constitute a limitation on the computer device, and may include more or less components than the one shown, or combine some components, or arrange different components.

处理器301是计算机设备300的控制中心,利用各种接口和线路连接整个计算机设备300的各个部分,通过运行或加载存储在存储器302内的软件程序和/或模块,以及调用存储在存储器302内的数据,执行计算机设备300的各种功能和处理数据,从而对计算机设备300进行整体监控。The processor 301 is the control center of the computer device 300, and uses various interfaces and lines to connect various parts of the entire computer device 300, by running or loading the software programs and/or modules stored in the memory 302, and calling the software programs and/or modules stored in the memory 302. to perform various functions of the computer device 300 and process data, so as to monitor the computer device 300 as a whole.

在本申请实施例中,计算机设备300中的处理器301会按照如下的步骤,将一个或一个以上的应用程序的进程对应的指令加载到存储器302中,并由处理器301来运行存储在存储器302中的应用程序,从而实现各种功能:In this embodiment of the present application, the processor 301 in the computer device 300 loads the instructions corresponding to the processes of one or more application programs into the memory 302 according to the following steps, and the processor 301 executes the instructions stored in the memory. 302 application in order to achieve various functions:

获取目标用户的历史通话记录,上述历史通话记录中包括基于各种类型的话术模板对目标用户进行通话时的通话记录;Obtain the historical call records of the target user, the above-mentioned historical call records include the call records when the target user is called based on various types of speech templates;

基于预设的双模态神经网络模型,对历史通话记录对应的语音信息和文本信息分别进行特征提取,得到目标用户面对各种类型的话术模板时的语音特征和文本特征;Based on the preset dual-modal neural network model, feature extraction is performed on the voice information and text information corresponding to the historical call records, respectively, to obtain the voice features and text features of the target user when faced with various types of discourse templates;

根据目标用户面对各种类型的话术模板时的语音特征和文本特征确定目标用户面对各种类型的话术模板时的偏好程度;Determine the preference degree of the target user when faced with various types of discourse templates according to the speech features and text characteristics of the target user when faced with various types of discourse templates;

基于目标用户面对各种类型的话术模板时的偏好程度,从各种类型的话术模板中确定偏好程度最高的第一目标话术模板,并将第一目标话术模板进行提示操作,以使客服坐席根据第一目标话术模板与目标用户进行通话。Based on the preference degree of the target user when faced with various types of speech templates, the first target speech template with the highest preference is determined from the various types of speech templates, and the first target speech template is prompted to operate, so that the The customer service agent communicates with the target user according to the first target speech template.

以上各个操作的具体实施可参见前面的实施例,在此不再赘述。For the specific implementation of the above operations, reference may be made to the foregoing embodiments, and details are not described herein again.

可选的,如图3所示,计算机设备300还包括:触控显示屏303、射频电路304、音频电路305、输入单元306以及电源307。其中,处理器301分别与触控显示屏303、射频电路304、音频电路305、输入单元306以及电源307电性连接。本领域技术人员可以理解,图3中示出的计算机设备结构并不构成对计算机设备的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。Optionally, as shown in FIG. 3 , the computer device 300 further includes: a touch display screen 303 , a radio frequency circuit 304 , an audio circuit 305 , an input unit 306 and a power supply 307 . The processor 301 is electrically connected to the touch display screen 303 , the radio frequency circuit 304 , the audio circuit 305 , the input unit 306 and the power supply 307 respectively. Those skilled in the art can understand that the computer device structure shown in FIG. 3 does not constitute a limitation to the computer device, and may include more or less components than the one shown, or combine some components, or arrange different components.

触控显示屏303可用于显示图形用户界面以及接收用户作用于图形用户界面产生的操作指令。触控显示屏303可以包括显示面板和触控面板。其中,显示面板可用于显示由用户输入的消息或提供给用户的消息以及计算机设备的各种图形用户接口,这些图形用户接口可以由图形、文本、图标、视频和其任意组合来构成。可选的,可以采用液晶显示器(LCD,Liquid Cryst客户端账户l Displ客户端账户y)、有机发光二极管(OLED,Org客户端账户nic Light-Emitting Diode)等形式来配置显示面板。触控面板可用于收集用户在其上或附近的触摸操作(比如用户使用手指、触笔等任何适合的物体或附件在触控面板上或在触控面板附近的操作),并生成相应的操作指令,且操作指令执行对应程序。可选的,触控面板可包括触摸检测装置和触摸控制器两个部分。其中,触摸检测装置检测用户的触摸方位,并检测触摸操作带来的信号,将信号传送给触摸控制器;触摸控制器从触摸检测装置上接收触摸消息,并将它转换成触点坐标,再送给处理器301,并能接收处理器301发来的命令并加以执行。触控面板可覆盖显示面板,当触控面板检测到在其上或附近的触摸操作后,传送给处理器301以确定触摸事件的类型,随后处理器301根据触摸事件的类型在显示面板上提供相应的视觉输出。在本申请实施例中,可以将触控面板与显示面板集成到触控显示屏303而实现输入和输出功能。但是在某些实施例中,触控面板与触控面板可以作为两个独立的部件来实现输入和输出功能。即触控显示屏303也可以作为输入单元306的一部分实现输入功能。在本申请实施例中,通过处理器301执行通话处理程序在触控显示屏303上显示话术模板。The touch screen 303 can be used to display a graphical user interface and receive operation instructions generated by the user acting on the graphical user interface. The touch display 303 may include a display panel and a touch panel. Among them, the display panel can be used to display messages input by the user or provided to the user and various graphical user interfaces of the computer equipment, which can be composed of graphics, text, icons, videos and any combination thereof. Optionally, the display panel may be configured in the form of a liquid crystal display (LCD, Liquid Crystal Displ y), an organic light emitting diode (OLED, Org nic Light-Emitting Diode). The touch panel can be used to collect the user's touch operations on or near it (such as the user's operations on or near the touch panel using a finger, stylus, etc., any suitable object or accessory), and generate corresponding operations instruction, and the operation instruction executes the corresponding program. Optionally, the touch panel may include two parts, a touch detection device and a touch controller. Among them, the touch detection device detects the user's touch orientation, detects the signal brought by the touch operation, and transmits the signal to the touch controller; the touch controller receives the touch message from the touch detection device, converts it into contact coordinates, and then sends it to the touch controller. To the processor 301, and can receive the command sent by the processor 301 and execute it. The touch panel can cover the display panel, and when the touch panel detects a touch operation on or near it, it is transmitted to the processor 301 to determine the type of the touch event, and then the processor 301 provides the display panel according to the type of the touch event. Corresponding visual output. In this embodiment of the present application, the touch panel and the display panel may be integrated into the touch display screen 303 to implement input and output functions. However, in some embodiments, the touch panel and the touch panel may be used as two independent components to implement input and output functions. That is, the touch display screen 303 can also be used as a part of the input unit 306 to realize the input function. In this embodiment of the present application, the processor 301 executes the call processing program to display the speech template on the touch screen 303 .

射频电路304可用于收发射频信号,以通过无线通信与网络设备或其他计算机设备建立无线通讯,与网络设备或其他计算机设备之间收发信号。The radio frequency circuit 304 can be used for transmitting and receiving radio frequency signals, so as to establish wireless communication with network equipment or other computer equipment through wireless communication, and send and receive signals with network equipment or other computer equipment.

音频电路305可以用于通过扬声器、传声器提供用户与计算机设备之间的音频接口。音频电路305可将接收到的音频数据转换后的电信号,传输到扬声器,由扬声器转换为声音信号输出;另一方面,传声器将收集的声音信号转换为电信号,由音频电路305接收后转换为音频数据,再将音频数据输出处理器301处理后,经射频电路304以发送给比如另一计算机设备,或者将音频数据输出至存储器302以便进一步处理。音频电路305还可能包括耳塞插孔,以提供外设耳机与计算机设备的通信。Audio circuitry 305 may be used to provide an audio interface between the user and computer equipment through speakers, microphones. The audio circuit 305 can transmit the electrical signal converted from the received audio data to the speaker, and the speaker converts it into a sound signal for output; on the other hand, the microphone converts the collected sound signal into an electrical signal, which is converted after being received by the audio circuit 305 In the form of audio data, the audio data is output to the processor 301 for processing, and then sent to, for example, another computer device via the radio frequency circuit 304, or the audio data is output to the memory 302 for further processing. Audio circuitry 305 may also include an ear jack to provide for communication of peripheral headphones with computer equipment.

输入单元306可用于接收输入的数字、字符消息或用户特征消息(例如指纹、虹膜、面部消息等),以及产生与用户设置以及功能控制有关的键盘、鼠标、操作杆、光学或者轨迹球信号输入。Input unit 306 may be used to receive input numeric, character messages or user characteristic messages (eg fingerprint, iris, facial message, etc.), as well as generate keyboard, mouse, joystick, optical or trackball signal input related to user settings and function control .

电源307用于给计算机设备300的各个部件供电。可选的,电源307可以通过电源管理系统与处理器301逻辑相连,从而通过电源管理系统实现管理充电、放电、以及功耗管理等功能。电源307还可以包括一个或一个以上的直流或交流电源、再充电系统、电源故障检测电路、电源转换器或者逆变器、电源状态指示器等任意组件。Power supply 307 is used to power various components of computer device 300 . Optionally, the power supply 307 may be logically connected to the processor 301 through a power management system, so that functions such as charging, discharging, and power consumption management are implemented through the power management system. The power source 307 may also include one or more DC or AC power sources, recharging systems, power failure detection circuits, power converters or inverters, power status indicators, and any other components.

尽管图3中未示出,计算机设备300还可以包括摄像头、传感器、无线保真模块、蓝牙模块等,在此不再赘述。Although not shown in FIG. 3 , the computer device 300 may further include a camera, a sensor, a Wi-Fi module, a Bluetooth module, and the like, which will not be repeated here.

在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。In the above-mentioned embodiments, the description of each embodiment has its own emphasis. For parts that are not described in detail in a certain embodiment, reference may be made to the relevant descriptions of other embodiments.

由上可知,本实施例提供的计算机设备,通过获取目标用户的历史通话记录,上述历史通话记录中包括基于各种类型的话术模板对目标用户进行通话时的通话记录;基于预设的双模态神经网络模型,对历史通话记录对应的语音信息和文本信息分别进行特征提取,得到目标用户面对各种类型的话术模板时的语音特征和文本特征;根据目标用户面对各种类型的话术模板时的语音特征和文本特征确定目标用户面对各种类型的话术模板时的偏好程度;基于目标用户面对各种类型的话术模板时的偏好程度,从各种类型的话术模板中确定偏好程度最高的第一目标话术模板,并将第一目标话术模板进行提示操作,以使客服坐席根据第一目标话术模板与目标用户进行通话。As can be seen from the above, the computer equipment provided in this embodiment obtains the historical call records of the target user, and the above-mentioned historical call records include the call records when the target user is called based on various types of speech templates; A neural network model is used to extract features from the voice information and text information corresponding to the historical call records, respectively, to obtain the voice features and text features of the target user when faced with various types of discourse templates; The speech features and text features of the template determine the preference degree of the target user when faced with various types of discourse templates; based on the target user's preference when faced with various types of discourse templates, the preferences are determined from various types of discourse templates The first target speech template with the highest degree is selected, and a prompt operation is performed on the first target speech template, so that the customer service agent can communicate with the target user according to the first target speech template.

本领域普通技术人员可以理解,上述实施例的各种方法中的全部或部分步骤可以通过指令来完成,或通过指令控制相关的硬件来完成,该指令可以存储于一计算机可读存储介质中,并由处理器进行加载和执行。Those of ordinary skill in the art can understand that all or part of the steps in the various methods of the above-mentioned embodiments can be completed by instructions, or by instructions that control relevant hardware, and the instructions can be stored in a computer-readable storage medium, and loaded and executed by the processor.

为此,本申请实施例提供一种计算机可读存储介质,其中存储有多条计算机程序,该计算机程序能够被处理器进行加载,以执行本申请实施例所提供的任一种通话处理方法中的步骤。例如,该计算机程序可以执行如下步骤:To this end, the embodiments of the present application provide a computer-readable storage medium, in which a plurality of computer programs are stored, and the computer programs can be loaded by a processor to execute any of the call processing methods provided by the embodiments of the present application. A step of. For example, the computer program may perform the following steps:

获取目标用户的历史通话记录,上述历史通话记录中包括基于各种类型的话术模板对目标用户进行通话时的通话记录;Obtain the historical call records of the target user, the above-mentioned historical call records include the call records when the target user is called based on various types of speech templates;

基于预设的双模态神经网络模型,对历史通话记录对应的语音信息和文本信息分别进行特征提取,得到目标用户面对各种类型的话术模板时的语音特征和文本特征;Based on the preset dual-modal neural network model, feature extraction is performed on the voice information and text information corresponding to the historical call records, respectively, to obtain the voice features and text features of the target user when faced with various types of discourse templates;

根据目标用户面对各种类型的话术模板时的语音特征和文本特征确定目标用户面对各种类型的话术模板时的偏好程度;Determine the preference degree of the target user when faced with various types of discourse templates according to the speech features and text characteristics of the target user when faced with various types of discourse templates;

基于目标用户面对各种类型的话术模板时的偏好程度,从各种类型的话术模板中确定偏好程度最高的第一目标话术模板,并将第一目标话术模板进行提示操作,以使客服坐席根据第一目标话术模板与目标用户进行通话。Based on the preference degree of the target user when faced with various types of speech templates, the first target speech template with the highest preference is determined from the various types of speech templates, and the first target speech template is prompted to operate, so that the The customer service agent communicates with the target user according to the first target speech template.

以上各个操作的具体实施可参见前面的实施例,在此不再赘述。For the specific implementation of the above operations, reference may be made to the foregoing embodiments, and details are not described herein again.

其中,该存储介质可以包括:只读存储器(ROM,Re客户端账户d Only Memory)、随机存取记忆体(R客户端账户M,R客户端账户ndom客户端账户ccess Memory)、磁盘或光盘等。Wherein, the storage medium may include: read-only memory (ROM, Re client account d Only Memory), random access memory (R client account M, R client account ndom client account access Memory), magnetic disk or optical disk Wait.

由于该存储介质中所存储的计算机程序,可以执行本申请实施例所提供的任一种通话处理方法中的步骤,因此,可以实现本申请实施例所提供的任一种通话处理方法所能实现的有益效果,详见前面的实施例,在此不再赘述。Since the computer program stored in the storage medium can execute the steps in any of the call processing methods provided by the embodiments of the present application, it is possible to realize the implementation of any call processing methods provided by the embodiments of the present application. For the beneficial effects, please refer to the previous embodiments for details, which will not be repeated here.

以上对本申请实施例所提供的一种通话处理方法、装置、存储介质及计算机设备进行了详细介绍,本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本申请的限制。A call processing method, device, storage medium, and computer equipment provided by the embodiments of the present application have been described in detail above, and the principles and implementations of the present application are described with specific examples. The descriptions of the above embodiments are only It is used to help understand the method and the core idea of the present application; meanwhile, for those skilled in the art, according to the idea of the present application, there will be changes in the specific embodiments and application scope. In summary, this specification The content should not be construed as a limitation on this application.

Claims (10)

1.一种通话处理方法,其特征在于,所述方法包括:1. A call processing method, characterized in that the method comprises: 获取目标用户的历史通话记录,所述历史通话记录中包括基于各种类型的话术模板对所述目标用户进行通话时的通话记录;Obtaining the historical call records of the target user, the historical call records including the call records when the target user is on a call based on various types of discourse templates; 基于预设的双模态神经网络模型,对所述历史通话记录对应的语音信息和文本信息分别进行特征提取,得到所述目标用户面对各种类型的话术模板时的语音特征和文本特征;Based on a preset dual-modal neural network model, feature extraction is performed on the voice information and text information corresponding to the historical call records, respectively, to obtain the voice features and text features of the target user when faced with various types of discourse templates; 根据所述目标用户面对各种类型的话术模板时的语音特征和文本特征确定所述目标用户面对各种类型的话术模板时的偏好程度;Determine the degree of preference of the target user when facing various types of discourse templates according to the speech features and text characteristics of the target user when facing various types of discourse templates; 基于所述目标用户面对各种类型的话术模板时的偏好程度,从所述各种类型的话术模板中确定偏好程度最高的第一目标话术模板,并将所述第一目标话术模板进行提示操作,以使客服坐席根据所述第一目标话术模板与所述目标用户进行通话。Based on the preference degree of the target user when facing various types of speech templates, a first target speech template with the highest preference degree is determined from the various types of speech templates, and the first target speech template A prompt operation is performed, so that the customer service agent communicates with the target user according to the first target speech template. 2.根据权利要求1所述的方法,其特征在于,还包括:2. The method of claim 1, further comprising: 当所述目标用户为新用户时,获取所述目标用户的属性信息;When the target user is a new user, obtain attribute information of the target user; 根据所述目标用户的属性信息确定与所述目标用户属性相似的历史用户;Determine historical users whose attributes are similar to the target user according to the attribute information of the target user; 获取所述历史用户对应的偏好程度最高的第二目标话术模板,并将所述第二目标话术模板进行提示操作,以使所述客服坐席根据所述第二目标话术模板与所述目标用户进行通话。Obtain the second target speech template with the highest degree of preference corresponding to the historical user, and perform a prompt operation on the second target speech template, so that the customer service agent can match the second target speech template with the The target user makes a call. 3.根据权利要求1所述的方法,其特征在于,还包括:3. The method of claim 1, further comprising: 当所述目标用户为新用户时,获取所述目标用户的预设时间段内的通话信息;When the target user is a new user, acquiring call information within a preset time period of the target user; 根据所述通话信息确定所述目标用户所需的业务场景;Determine the business scenario required by the target user according to the call information; 根据所述业务场景确定所述业务场景对应的第三目标话术模板,并将所述第三目标话术模板进行提示操作,以使所述客服坐席根据所述第三目标话术模板与所述目标用户进行通话。Determine a third target speech template corresponding to the business scenario according to the business scenario, and perform a prompt operation on the third target speech template, so that the customer service agent can match the third target speech template with the third target speech template according to the third target speech template. the target user to make a call. 4.根据权利要求1所述的方法,其特征在于,在将所述第一目标话术模板进行提示操作之后,还包括:4. The method according to claim 1, wherein after the prompt operation is performed on the first target speech template, the method further comprises: 实时获取通话过程中的通话记录,将所述通话记录输入预设的神经网络模型进行偏好程度预测,得到所述目标用户对所述第一目标话术模板的偏好程度;Obtain the call records during the call in real time, input the call records into a preset neural network model to predict the preference degree, and obtain the preference degree of the target user for the first target speech template; 当所述偏好程度小于预设阈值时,根据所述通话记录从预设的模板集合中选取第四目标话术模板,对所述第四目标话术模板进行提示操作,以使所述客服坐席根据所述第四目标话术模板与所述目标用户进行通话。When the preference degree is less than the preset threshold, select a fourth target speech template from the preset template set according to the call record, and perform a prompt operation on the fourth target speech template, so that the customer service agent Talk with the target user according to the fourth target speech template. 5.根据权利要求4所述的方法,其特征在于,所述根据所述通话记录从预设的模板集合中选取第四目标话术模板,包括:5. The method according to claim 4, wherein the selecting a fourth target speech template from a preset template set according to the call record comprises: 根据所述通话记录确定所述目标用户当前所需的业务场景;Determine the business scenario currently required by the target user according to the call record; 从所述模板集合中选取所述业务场景对应的第四目标话术模板。A fourth target speech template corresponding to the business scenario is selected from the template set. 6.根据权利要求1所述的方法,其特征在于,所述第一目标话术模板包括至少一个业务阶段,所述业务阶段对应至少一个话术子模板,在将所述第一目标话术模板进行提示操作之后,还包括:6 . The method according to claim 1 , wherein the first target discourse template comprises at least one business phase, the business phase corresponds to at least one discourse sub-template, and the first target discourse template After the template performs the prompt operation, it also includes: 获取目标用户的当前答复信息,根据所述当前答复信息确定当前业务阶段;Obtain the current reply information of the target user, and determine the current business stage according to the current reply information; 从所述当前业务阶段对应的至少一个话术子模板中选取目标子模板,并对所述目标子模板进行标亮提示。A target sub-template is selected from at least one speech sub-template corresponding to the current business stage, and a highlighting prompt is given to the target sub-template. 7.根据权利要求1所述的方法,其特征在于,所述从所述当前业务阶段对应的至少一个话术子模板中选取目标子模板,包括:7. The method according to claim 1, wherein the selecting a target sub-template from at least one speech sub-template corresponding to the current service stage comprises: 获取所述目标用户的情绪状态;obtaining the emotional state of the target user; 根据所述情绪状态从所述当前业务阶段对应的至少一个话术子模板中选取目标子模板。A target sub-template is selected from at least one speech sub-template corresponding to the current business stage according to the emotional state. 8.一种通话处理装置,其特征在于,所述装置包括:8. A call processing device, wherein the device comprises: 记录获取模块,用于获取目标用户的历史通话记录,所述历史通话记录中包括基于各种类型的话术模板对所述目标用户进行通话时的通话记录;a record acquisition module, used for acquiring the historical call records of the target user, the historical call records including the call records when the target user is on a call based on various types of speech templates; 特征提取模块,用于基于预设的双模态神经网络模型,对所述历史通话记录对应的语音信息和文本信息分别进行特征提取,得到所述目标用户面对各种类型的话术模板时的语音特征和文本特征;The feature extraction module is used to perform feature extraction on the voice information and text information corresponding to the historical call records based on a preset dual-modal neural network model, respectively, to obtain the target user when faced with various types of discourse templates. Voice features and text features; 程度确定模块,用于根据所述目标用户面对各种类型的话术模板时的语音特征和文本特征确定所述目标用户面对各种类型的话术模板时的偏好程度;a degree determination module, configured to determine the degree of preference of the target user when facing various types of discourse templates according to the speech features and text characteristics of the target user when facing various types of discourse templates; 提示操作模块,用于基于所述目标用户面对各种类型的话术模板时的偏好程度,从所述各种类型的话术模板中确定偏好程度最高的第一目标话术模板,并将所述第一目标话术模板进行提示操作,以使客服坐席根据所述第一目标话术模板与所述目标用户进行通话。The prompt operation module is configured to, based on the preference degree of the target user when facing various types of speech templates, determine the first target speech template with the highest preference from the various types of speech templates, and assign the The first target speech template performs a prompting operation, so that the customer service agent can communicate with the target user according to the first target speech template. 9.一种计算机设备,包括存储器,处理器及存储在存储器上并在处理器上运行的计算机程序,其中,所述处理器执行所述程序时实现如权利要求1至7任一项所述的通话处理方法。9. A computer device comprising a memory, a processor and a computer program stored on the memory and running on the processor, wherein the processor implements the program described in any one of claims 1 to 7 when the processor executes the program call handling method. 10.一种存储介质,其特征在于,所述存储介质存储有多条指令,所述指令适于处理器进行加载,以执行权利要求1至7任一项所述的通话处理方法。10 . A storage medium, characterized in that the storage medium stores a plurality of instructions, and the instructions are adapted to be loaded by a processor to execute the call processing method according to any one of claims 1 to 7 .
CN202210731432.4A 2022-06-24 2022-06-24 Call processing method and device, computer equipment and storage medium Pending CN115118820A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210731432.4A CN115118820A (en) 2022-06-24 2022-06-24 Call processing method and device, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210731432.4A CN115118820A (en) 2022-06-24 2022-06-24 Call processing method and device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
CN115118820A true CN115118820A (en) 2022-09-27

Family

ID=83331616

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210731432.4A Pending CN115118820A (en) 2022-06-24 2022-06-24 Call processing method and device, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN115118820A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116208712A (en) * 2023-05-04 2023-06-02 北京智齿众服技术咨询有限公司 Intelligent outbound method, system, equipment and medium for improving user intention

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8923489B1 (en) * 2013-04-24 2014-12-30 West Corporation Applying user preferences, behavioral patterns and environmental factors to an automated customer support application
CN106686267A (en) * 2015-11-10 2017-05-17 中国移动通信集团公司 A method and system for realizing personalized voice service
CN111683182A (en) * 2020-06-11 2020-09-18 中国银行股份有限公司 Service node processing method and system
CN113688221A (en) * 2021-09-08 2021-11-23 中国平安人寿保险股份有限公司 Model-based dialect recommendation method and device, computer equipment and storage medium
CN113779226A (en) * 2021-09-27 2021-12-10 泰康保险集团股份有限公司 Session requirement processing method and device, computer readable medium and electronic equipment
CN113821625A (en) * 2021-10-11 2021-12-21 中国平安人寿保险股份有限公司 Artificial intelligence based tactical recommendation method, device, equipment and medium
CN113961591A (en) * 2021-09-15 2022-01-21 北京三快在线科技有限公司 Information processing method, information processing device, electronic equipment and storage medium
CN114220461A (en) * 2021-12-15 2022-03-22 中国平安人寿保险股份有限公司 Customer service call guiding method, device, equipment and storage medium
CN114530142A (en) * 2022-03-23 2022-05-24 未鲲(上海)科技服务有限公司 Information recommendation method, device and equipment based on random forest and storage medium
CN114567702A (en) * 2022-02-25 2022-05-31 平安科技(深圳)有限公司 Intelligent seat auxiliary system, method, equipment and storage medium

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8923489B1 (en) * 2013-04-24 2014-12-30 West Corporation Applying user preferences, behavioral patterns and environmental factors to an automated customer support application
CN106686267A (en) * 2015-11-10 2017-05-17 中国移动通信集团公司 A method and system for realizing personalized voice service
CN111683182A (en) * 2020-06-11 2020-09-18 中国银行股份有限公司 Service node processing method and system
CN113688221A (en) * 2021-09-08 2021-11-23 中国平安人寿保险股份有限公司 Model-based dialect recommendation method and device, computer equipment and storage medium
CN113961591A (en) * 2021-09-15 2022-01-21 北京三快在线科技有限公司 Information processing method, information processing device, electronic equipment and storage medium
CN113779226A (en) * 2021-09-27 2021-12-10 泰康保险集团股份有限公司 Session requirement processing method and device, computer readable medium and electronic equipment
CN113821625A (en) * 2021-10-11 2021-12-21 中国平安人寿保险股份有限公司 Artificial intelligence based tactical recommendation method, device, equipment and medium
CN114220461A (en) * 2021-12-15 2022-03-22 中国平安人寿保险股份有限公司 Customer service call guiding method, device, equipment and storage medium
CN114567702A (en) * 2022-02-25 2022-05-31 平安科技(深圳)有限公司 Intelligent seat auxiliary system, method, equipment and storage medium
CN114530142A (en) * 2022-03-23 2022-05-24 未鲲(上海)科技服务有限公司 Information recommendation method, device and equipment based on random forest and storage medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116208712A (en) * 2023-05-04 2023-06-02 北京智齿众服技术咨询有限公司 Intelligent outbound method, system, equipment and medium for improving user intention

Similar Documents

Publication Publication Date Title
KR102624148B1 (en) Automatic navigation of interactive voice response (IVR) trees on behalf of human users
KR20240007888A (en) Electronic device and method for communicating with chatbot
US12424221B2 (en) Systems and methods for automating voice commands
CN112148850A (en) Dynamic interaction method, server, electronic device and storage medium
CN114500757B (en) Voice interaction method, device, computer equipment and storage medium
CN114631300A (en) Semi-delegated calls made by automated assistants on behalf of human participants
CN112148849A (en) Dynamic interaction method, server, electronic device and storage medium
CN107919138A (en) Mood processing method and mobile terminal in a kind of voice
CN114244793B (en) Information processing method, device, equipment and storage medium
CN115101053A (en) Emotion recognition-based conversation processing method and device, terminal and storage medium
US9369587B2 (en) System and method for software turret phone capabilities
CN112382290B (en) Voice interaction method, device, equipment and computer storage medium
CN116781652A (en) Message processing method, client, server, device and program product
CN115118820A (en) Call processing method and device, computer equipment and storage medium
WO2020221089A1 (en) Call interface display method, electronic device and computer readable medium
US10862841B1 (en) Systems and methods for automating voice commands
CN115103069A (en) Service recommendation method and device, computer equipment and storage medium
CN115514850B (en) Method, device, computer equipment and storage medium for value analysis of outbound call projects
WO2021109741A1 (en) Serving method, apparatus, system and device, and storage medium
CN107154996B (en) Incoming call interception method and device, storage medium and terminal
CN115065759A (en) External calling adjusting method, device, storage medium and computer equipment
CN115204631A (en) Product marketing management method and device, computer equipment and storage medium
CN115358776A (en) External calling adjusting method, device, storage medium and computer equipment
WO2021092415A1 (en) Systems and methods for automating voice commands
CN113472950A (en) Automatic response method, system and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination