CN116364090A

CN116364090A - Audio data analysis and processing method, server device and system

Info

Publication number: CN116364090A
Application number: CN202111612639.1A
Authority: CN
Inventors: 汤达祺
Original assignee: Guangzhou Shanchang Information Technology Co ltd
Current assignee: Guangzhou Shanchang Information Technology Co ltd
Priority date: 2021-12-27
Filing date: 2021-12-27
Publication date: 2023-06-30

Abstract

The present application relates to an audio data analysis and processing method, server device, system, equipment and storage medium. The audio data analysis and processing method described in the present application includes: obtaining the audio fragment of the target audio data at the current moment; if there is an incomplete audio frame in the obtained audio fragment, cutting the audio fragment, removing the A complete audio frame; analyzing and processing the clipped audio slices to obtain analysis and processing results of the audio slices; merging the analysis and processing results of continuous audio slices in the audio data. The audio data analysis and processing method described in this application has the advantage of improving the efficiency of audio data analysis and processing.

Description

Audio data analysis and processing method, server device and system

技术领域technical field

本申请涉及音频数据处理技术领域，特别是涉及一种音频数据分析处理方法、服务端装置、系统、设备以及存储介质。The present application relates to the technical field of audio data processing, in particular to an audio data analysis and processing method, server device, system, equipment and storage medium.

背景技术Background technique

随着计算机技术的普及，越来越多的用户可以通过应用程序进行录音上传来获得这段录音的分析结果，录音设备可以在录音的过程中同时进行上传工作，缩短上传的时间。With the popularization of computer technology, more and more users can upload the recording through the application program to obtain the analysis results of this recording. The recording device can upload the work at the same time during the recording process, shortening the upload time.

但在实践中发现，为了确保对数据的分析处理不出错误，服务端需要在接收到完整的录音后再进行分析处理工作，从而导致录音的音频数据分析处理效率不高。However, in practice, it is found that in order to ensure that the analysis and processing of the data do not make mistakes, the server needs to perform analysis and processing after receiving the complete recording, which leads to low efficiency of audio data analysis and processing of the recording.

发明人在研究过程中发现如何提高音频数据分析处理效率是一个需要解决的技术问题。During the research process, the inventor found that how to improve the efficiency of audio data analysis and processing is a technical problem that needs to be solved.

发明内容Contents of the invention

基于此，本申请的目的在于，提供一种音频数据分析处理方法、服务端装置、系统、设备以及存储介质，其具有提高音频数据分析处理效率的优点。Based on this, the purpose of the present application is to provide an audio data analysis and processing method, server device, system, equipment and storage medium, which have the advantage of improving the efficiency of audio data analysis and processing.

根据本申请实施例的第一方面，提供一种音频数据分析处理方法，所述音频数据分析处理方法包括：According to the first aspect of the embodiments of the present application, an audio data analysis and processing method is provided, the audio data analysis and processing method includes:

获取目标音频数据当前时刻的音频分片；Obtain the audio fragment of the target audio data at the current moment;

若获取的所述音频分片存在不完整的音频帧，对所述音频分片进行裁剪，移除不完整的音频帧；If there is an incomplete audio frame in the acquired audio fragment, cutting the audio fragment to remove the incomplete audio frame;

对裁剪后的所述音频分片进行分析处理，获得所述音频分片的分析处理结果；Analyzing and processing the clipped audio fragments to obtain analysis and processing results of the audio fragments;

合并在所述音频数据中连续的音频分片的分析处理结果。The analysis processing results of consecutive audio slices in the audio data are merged.

根据本申请实施例的第二方面，提供一种音频数据分析处理方法，所述音频数据分析处理方法包括：According to the second aspect of the embodiments of the present application, there is provided an audio data analysis and processing method, the audio data analysis and processing method comprising:

在客户端采集目标音频数据；当检测到当前采集到的目标音频数据满足预设条件时，将当前采集到的目标音频数据切割为音频分片，将所述音频分片上传至服务端；Collecting target audio data at the client; when detecting that the target audio data currently collected meets preset conditions, cutting the currently collected target audio data into audio fragments, and uploading the audio fragments to the server;

在服务端获取目标音频数据的音频分片；若获取的所述音频分片存在不完整的音频帧，对所述音频分片进行裁剪，移除不完整的音频帧；对裁剪后的所述音频分片进行分析处理，获得所述音频分片分析处理结果；合并在所述音频数据中连续的音频分片的分析处理结果，并发送至客户端。Acquire the audio fragments of the target audio data at the server; if there are incomplete audio frames in the obtained audio fragments, cut the audio fragments to remove the incomplete audio frames; The audio fragments are analyzed and processed to obtain the analysis and processing results of the audio fragments; the analysis and processing results of the continuous audio fragments in the audio data are combined and sent to the client.

根据本申请实施例的第三方面，提供一种音频数据分析处理系统，包括客户端和服务端，其特征在于，所述客户端用于采集目标音频数据；当检测到当前采集到的目标音频数据满足预设条件时，将当前采集到的目标音频数据截断为音频分片，将所述音频分片上传至服务端；According to a third aspect of the embodiment of the present application, an audio data analysis and processing system is provided, including a client and a server, wherein the client is used to collect target audio data; when the currently collected target audio is detected When the data meets the preset conditions, the currently collected target audio data is truncated into audio fragments, and the audio fragments are uploaded to the server;

所述服务端用于获取目标音频数据的音频分片；若获取的所述音频分片存在不完整的音频帧，对所述音频分片进行裁剪，移除不完整的音频帧；对裁剪后的所述音频分片进行分析处理，获得所述音频分片分析处理结果；合并在所述音频数据中连续的音频分片的分析处理结果，并发送至客户端。The server is used to obtain the audio fragments of the target audio data; if there are incomplete audio frames in the obtained audio fragments, the audio fragments are cut to remove the incomplete audio frames; Analyzing and processing the audio fragments to obtain the analysis and processing results of the audio fragments; merging the analysis and processing results of consecutive audio fragments in the audio data, and sending them to the client.

根据本申请实施例的第四方面，提供一种音频数据分析处理服务端装置，所述装置包括：According to a fourth aspect of the embodiments of the present application, there is provided an audio data analysis and processing server device, the device comprising:

获取模块，用于获取目标音频数据当前时刻的音频分片；The obtaining module is used to obtain the audio fragmentation of the current moment of the target audio data;

裁剪模块，用于若获取的所述音频分片存在不完整的音频帧，对所述音频分片进行裁剪，移除不完整的音频帧；A clipping module, configured to clip the audio clips to remove incomplete audio frames if there are incomplete audio frames in the acquired audio clips;

分析处理模块，用于对裁剪后的所述音频完整分片进行分析处理，获得所述音频分片的分析处理结果；An analysis and processing module, configured to analyze and process the clipped complete audio fragments, and obtain analysis and processing results of the audio fragments;

结果合并模块，用于合并在所述音频数据中连续的音频分片的分析处理结果。The result merging module is used for merging the analysis and processing results of continuous audio slices in the audio data.

根据本申请实施例的第五方面，提供一种电子设备，包括：处理器和存储器；其中，所述存储器存储有计算机程序，所述计算机程序适于由所述处理器加载并执行任意一项所述的音频数据分析处理方法。According to a fifth aspect of the embodiments of the present application, there is provided an electronic device, including: a processor and a memory; wherein, the memory stores a computer program, and the computer program is suitable for being loaded by the processor and executing any one The audio data analysis and processing method.

根据本申请实施例的第六方面，提供一种计算机可读存储介质，其上储存有计算机程序，其特征在于，该计算机程序被处理器执行时实现任意一项所述的音频数据分析处理方法。According to the sixth aspect of the embodiments of the present application, there is provided a computer-readable storage medium, on which a computer program is stored, and it is characterized in that, when the computer program is executed by a processor, any one of the audio data analysis and processing methods described above is implemented .

本申请中，可以在获取到一个单独的音频分片时，若所述音频分片存在不完整的音频帧，对所述音频分片进行裁剪，移除不完整的音频帧，可以避免因不完整的音频帧导致的解析失败问题，从而可以对单独的音频分片进行分析处理，不需要等待所有的音频分片上传完成后进行合并再进行分析处理，可以提前进行分析处理，在对多个音频分片进行分析处理后，可以将在内容上具有连续性的音频分片的分析结果进行合并，不需要等待所有的音频分片都分析处理完毕后再逐一进行合并，进一步缩短了音频数据分析处理的时间，从而提高了音频数据分析处理的效率。In this application, when a single audio fragment is obtained, if there is an incomplete audio frame in the audio fragment, the audio fragment can be trimmed to remove the incomplete audio frame, which can avoid The problem of parsing failure caused by the complete audio frame, so that individual audio fragments can be analyzed and processed. There is no need to wait for all audio fragments to be uploaded and merged before analysis and processing. Analysis and processing can be performed in advance. Multiple After the audio fragments are analyzed and processed, the analysis results of the audio fragments with continuity in content can be merged. There is no need to wait for all the audio fragments to be analyzed and processed before merging them one by one, which further shortens the audio data analysis. Processing time, thereby improving the efficiency of audio data analysis and processing.

应当理解的是，以上的一般描述和后文的细节描述仅是示例性和解释性的，并不能限制本申请。It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the application.

为了更好地理解和实施，下面结合附图详细说明本申请。For better understanding and implementation, the present application will be described in detail below in conjunction with the accompanying drawings.

附图说明Description of drawings

为了更清楚地说明本申请实施例或现有技术中的技术方案，下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图仅仅是本申请的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动的前提下，还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present application or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only These are some embodiments of the present application. Those skilled in the art can also obtain other drawings based on these drawings without creative work.

图1为本申请一个实施例提供的一种音频数据分析处理方法的应用环境的示意框图；FIG. 1 is a schematic block diagram of an application environment of an audio data analysis and processing method provided by an embodiment of the present application;

图2为本申请一个实施例提供的一种音频数据分析处理方法的流程图；FIG. 2 is a flowchart of an audio data analysis and processing method provided by an embodiment of the present application;

图3为本申请另一个实施例提供的一种音频数据分析处理方法的流程图；FIG. 3 is a flow chart of an audio data analysis and processing method provided in another embodiment of the present application;

图4为本申请另一个实施例提供的一种音频数据分析处理方法的流程图；FIG. 4 is a flow chart of an audio data analysis and processing method provided in another embodiment of the present application;

图5为本申请另一个实施例提供的一种音频数据分析处理方法的交互图；FIG. 5 is an interactive diagram of an audio data analysis and processing method provided by another embodiment of the present application;

图6为本申请一个实施例提供的音频数据分析处理服务端装置的结构示意图；FIG. 6 is a schematic structural diagram of an audio data analysis and processing server device provided by an embodiment of the present application;

图7为本申请一个实施例提供的一种电子设备的结构示意框图。Fig. 7 is a schematic structural block diagram of an electronic device provided by an embodiment of the present application.

具体实施方式Detailed ways

为使本申请的目的、技术方案和优点更加清楚，下面将结合附图对本申请实施例方式作进一步地详细描述。In order to make the purpose, technical solution and advantages of the present application clearer, the embodiments of the present application will be further described in detail below in conjunction with the accompanying drawings.

应当明确，所描述的实施例仅仅是本申请一部分实施例，而不是全部的实施例。基于本申请中的实施例，本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其它实施例，都属于本申请保护的范围。It should be clear that the described embodiments are only some of the embodiments of the present application, not all of the embodiments. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of this application.

下面的描述涉及附图时，除非另有表示，不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本申请相一致的所有实施方式。相反，它们仅是如所附权利要求书中所详述的、本申请的一些方面相一致的装置和方法的例子。When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary embodiments do not represent all implementations consistent with this application. Rather, they are merely examples of apparatuses and methods consistent with aspects of the present application as recited in the appended claims.

在本申请的描述中，需要理解的是，术语“第一”、“第二”、“第三”等仅用于区别类似的对象，而不必用于描述特定的顺序或先后次序，也不能理解为指示或暗示相对重要性。对于本领域的普通技术人员而言，可以根据具体情况理解上述术语在本申请中的具体含义。在本申请和所附权利要求书中所使用的单数形式的“一种”、“所述”和“该”也旨在包括多数形式，除非上下文清楚地表示其他含义。在此所使用的词语“如果”/“若”可以被解释成为“在……时”或“当……时”或“响应于确定”。此外，在本申请的描述中，除非另有说明，“多个”是指两个或两个以上。“和/或”，描述关联对象的关联关系，表示可以存在三种关系，例如，A和/或B，可以表示：单独存在A，同时存在A和B，单独存在B这三种情况。字符“/”一般表示前后关联对象是一种“或”的关系。In the description of the present application, it should be understood that the terms "first", "second", "third", etc. are only used to distinguish similar objects, and are not necessarily used to describe a specific order or sequence, nor can they be Read as indicating or implying relative importance. Those of ordinary skill in the art can understand the specific meanings of the above terms in this application according to specific situations. As used in this application and the appended claims, the singular forms "a", "the", and "the" are intended to include the plural forms as well, unless the context clearly dictates otherwise. As used herein, the words "if"/"if" may be construed as "at" or "when" or "in response to a determination". In addition, in the description of the present application, unless otherwise specified, "plurality" means two or more. "And/or" describes the association relationship of associated objects, indicating that there may be three types of relationships, for example, A and/or B may indicate: A exists alone, A and B exist simultaneously, and B exists independently. The character "/" generally indicates that the contextual objects are an "or" relationship.

请参阅图1，其是本申请一个实施例提供的一种音频数据分析处理方法的应用环境的示意框图。如图1所示，本申请实施例的音频数据分析处理方法的应用环境包括客户端1000以及服务端2000。其中，客户端1000用于采集目标音频数据；当检测到当前采集到的目标音频数据满足预设条件时，将当前采集到的目标音频数据截断为音频分片，将所述音频分片上传至服务端2000；服务端2000用于获取目标音频数据的音频分片；若获取的所述音频分片存在不完整的音频帧，对所述音频分片进行裁剪，移除不完整的音频帧；对裁剪后的所述音频分片进行分析处理，获得所述音频分片分析处理结果；合并在所述音频数据中连续的音频分片的分析处理结果，并发送至客户端1000。Please refer to FIG. 1 , which is a schematic block diagram of an application environment of an audio data analysis and processing method provided by an embodiment of the present application. As shown in FIG. 1 , the application environment of the audio data analysis and processing method of the embodiment of the present application includes a client 1000 and a server 2000 . Wherein, the client 1000 is used to collect target audio data; when it is detected that the currently collected target audio data meets the preset conditions, the currently collected target audio data is truncated into audio fragments, and the audio fragments are uploaded to The server 2000; the server 2000 is used to obtain the audio fragment of the target audio data; if there is an incomplete audio frame in the obtained audio fragment, the audio fragment is cut to remove the incomplete audio frame; Analyzing and processing the clipped audio fragments to obtain the analysis and processing results of the audio fragments; merging the analysis and processing results of the continuous audio fragments in the audio data, and sending them to the client 1000 .

实施例一：Embodiment one:

本申请实施例公开了一种音频数据分析处理方法，该方法应用于服务端。The embodiment of the present application discloses a method for analyzing and processing audio data, and the method is applied to a server.

下面将结合附图2，对本申请实施例提供的一种音频数据分析处理方法进行详细介绍。A method for analyzing and processing audio data provided by an embodiment of the present application will be described in detail below with reference to FIG. 2 .

请参阅图2，本申请实施例提供的一种音频数据分析处理方法，包括如下步骤：Please refer to Figure 2, a method for analyzing and processing audio data provided in the embodiment of the present application, including the following steps:

步骤S101：获取目标音频数据当前时刻的音频分片。Step S101: Obtain the audio segment of the target audio data at the current moment.

其中，所述目标音频数据可以是指用户在客户端设备进行录音的一段完整的录音内容数据。Wherein, the target audio data may refer to a piece of complete recording content data recorded by the user on the client device.

其中，所述音频分片可以是指所述目标音频数据的一部分数据。Wherein, the audio fragment may refer to a part of the target audio data.

在本实施例中，当检测到录音内容满足一定的条件后，例如录音时长达到了预设的时间长度阈值，客户端可以先将这部分录音内容打包成一个音频分片上传至服务端，服务端可以获取到目标音频数据当前时刻的音频分片。假设所述目标音频数据为X＝{x₁，x₂，...，x_m}，当录音内容满足一定条件后，切割为一个单独的音频分片x_n(0<n≤m)，通过网络将音频分片x_n提前传输给服务器端，在这个过程中，用户同步进行下一段音频分片x_n+1的录制。可选的，获取到音频分片x_n后，可以将音频分片传递一份到共享存储中，用于音频合并等其他操作。In this embodiment, when it is detected that the recording content satisfies certain conditions, for example, the recording duration reaches the preset time length threshold, the client can first package this part of the recording content into an audio segment and upload it to the server, and the service The terminal can obtain the audio fragment of the target audio data at the current moment. Assuming that the target audio data is X={x ₁ , x ₂ ,..., x _m }, when the recording content satisfies certain conditions, it is cut into a separate audio slice x _n (0<n≤m), The audio fragment x _n is transmitted to the server in advance through the network. During this process, the user simultaneously records the next audio fragment x _n+1 . Optionally, after the audio fragment x _n is obtained, a copy of the audio fragment can be transferred to the shared storage for other operations such as audio merging.

随着客户端的不断录音，切割以及打包音频分片上传，服务端可依次获取到多个由客户端上传的连续的音频分片，当然，由于网络传输延迟等因素影响，也可能服务端获取到多个音频分片并不连续，或者只有一部分连续。As the client continues to record, cut and package audio fragments to upload, the server can obtain multiple consecutive audio fragments uploaded by the client in turn. Of course, due to network transmission delays and other factors, the server may also obtain Multiple audio fragments are not contiguous, or only some of them are contiguous.

作为一种可选的实施方式，所述获取目标音频数据当前时刻的音频分片之后，所述音频数据分析处理方法还包括：As an optional implementation manner, after the acquisition of the audio slice at the current moment of the target audio data, the audio data analysis and processing method further includes:

获取位于所述音频分片的起始位置的起始音频帧，以及获取位于所述音频分片的末尾位置的末尾音频帧；obtaining a start audio frame located at a start position of the audio slice, and obtaining an end audio frame located at an end position of the audio slice;

若所述起始音频帧为不完整的音频帧和/或所述末尾音频帧为不完整的音频帧，确定所述音频分片存在不完整的音频帧；If the start audio frame is an incomplete audio frame and/or the end audio frame is an incomplete audio frame, determine that there is an incomplete audio frame in the audio slice;

或，or,

若所述起始音频帧为不完整的音频帧和/或所述末尾音频帧为不完整的音频帧，确定所述音频分片存在不完整的音频帧。If the start audio frame is an incomplete audio frame and/or the end audio frame is an incomplete audio frame, it is determined that there is an incomplete audio frame in the audio slice.

本申请实施例中，音频分片是切割数据获取到的，这可能会导致位于所述音频分片的起始位置的起始音频帧被切割出一部分，或者可能会导致位于所述音频分片的末尾位置的末尾音频帧被切割出一部分。位于所述音频分片其他位置的音频帧不会被切割，可以确定位于所述音频分片其他位置的音频帧是完整的音频帧。因此，若所述起始音频帧为不完整的音频帧和/或所述末尾音频帧为不完整的音频帧，可以确定所述音频分片存在不完整的音频帧。不需要对每一个音频帧进行判断，节约了系统资源，提高了系统相关操作的效率。In the embodiment of the present application, the audio fragment is obtained by cutting data, which may result in cutting out a part of the start audio frame located at the start position of the audio fragment, or may result in A portion of the end audio frame is cut off at the end position. The audio frames located in other positions of the audio slice will not be cut, and it can be determined that the audio frames located in other positions of the audio slice are complete audio frames. Therefore, if the start audio frame is an incomplete audio frame and/or the end audio frame is an incomplete audio frame, it may be determined that there is an incomplete audio frame in the audio slice. There is no need to judge each audio frame, which saves system resources and improves the efficiency of system-related operations.

步骤S102：若获取的所述音频分片存在不完整的音频帧，对所述音频分片进行裁剪，移除不完整的音频帧。Step S102: If there is an incomplete audio frame in the acquired audio segment, trim the audio segment to remove the incomplete audio frame.

其中，所述音频帧可以是指一个数据长度单位的音频采样数据，不同的编码格式可以对应不同的音频帧。Wherein, the audio frame may refer to audio sample data of one data length unit, and different encoding formats may correspond to different audio frames.

其中，音频分片是切割过的数据，可能会存在不完整音频帧，不完整的音频帧会导致音频分片的解析失败，即无法单独对音频分片进行分析处理，为了确定可以单独对音频分片进行分析处理，需要移除音频分片中不完整的音频帧。Among them, audio fragments are cut data, and there may be incomplete audio frames. Incomplete audio frames will cause the analysis of audio fragments to fail, that is, audio fragments cannot be analyzed and processed separately. In order to determine that audio fragments can be analyzed separately Fragments are analyzed and processed, and incomplete audio frames in the audio fragments need to be removed.

本发明实施例中，可以将不完整的音频帧确定出来，然后删除掉这些不完整的音频帧以完成对所述音频分片的裁剪。In the embodiment of the present invention, incomplete audio frames may be determined, and then these incomplete audio frames may be deleted to complete the clipping of the audio slices.

步骤S103：对裁剪后的所述音频分片进行分析处理，获得所述音频分片的分析处理结果。Step S103: Analyzing and processing the clipped audio fragments to obtain an analysis and processing result of the audio fragments.

其中，所述分析处理可以是根据预设的规则对音频分片的录音内容进行评价，得到的分析处理结果可以是一个分数或者等级，比如对一段英语录音的进行打分，得到分数，比如80分，或者得到等级，比如A。Wherein, the analysis processing can be to evaluate the recording content of the audio fragment according to preset rules, and the obtained analysis processing result can be a score or a grade, such as scoring a section of English recording, and obtaining a score, such as 80 points , or get a grade, say A.

步骤S104：合并在所述音频数据中连续的音频分片的分析处理结果。Step S104: Merge analysis and processing results of consecutive audio slices in the audio data.

其中，所述连续的音频分片可以是指已被接收到的具有相邻分片序号的音频分片。Wherein, the continuous audio fragments may refer to received audio fragments with adjacent fragment numbers.

其中，每个音频分片在网络进行传输之前可以按照音频分片本身在所述目标音频数据中的组成位置分配有一个序号，即分配有所述预设分片序号，全部的音频分片可以按照所述预设分片序号组合为所述音频数据的全部数据内容，具有相邻预设分片序号的音频分片在整体上可以构成一段连续的数据内容。Wherein, each audio fragment can be assigned a sequence number according to the composition position of the audio fragment itself in the target audio data before the network is transmitted, that is, the preset fragment sequence number is assigned, and all audio fragments can be According to the combination of the preset segment numbers into the entire data content of the audio data, audio segments with adjacent preset segment numbers may constitute a piece of continuous data content as a whole.

可选的，所述合并在所述音频数据中连续的音频分片的分析处理结果包括：Optionally, the analysis and processing results of merging the continuous audio slices in the audio data include:

若所述目标音频数据的历史分片中存在连续分片，对所述连续分片的分析处理结果进行合并，其中，所述连续分片是具有相邻预设分片序号的历史分片。If there are continuous fragments in the historical fragments of the target audio data, the analysis and processing results of the continuous fragments are merged, wherein the continuous fragments are historical fragments with adjacent preset fragment numbers.

其中，所述历史分片可以是指在当前时刻之前已获取到的并属于所述目标音频数据的音频分片。Wherein, the historical segment may refer to an audio segment that has been obtained before the current moment and belongs to the target audio data.

其中，每个音频分片在网络进行传输之前可以按照音频分片本身在所述目标音频数据中的组成位置分配有一个序号，即分配有所述预设分片序号，具有相邻预设分片序号的音频分片在整体上可以构成一段连续的数据内容。Wherein, each audio fragment can be assigned a sequence number according to the composition position of the audio fragment itself in the target audio data before the network transmits, that is, the preset fragment sequence number is assigned, and there are adjacent preset fragments. The audio slices of the slice number can constitute a piece of continuous data content as a whole.

可选的，若获取到的所述目标音频数据的历史分片中不存在连续分片，则无法进行分析处理结果的合并，可以将当前得到的分析处理结果暂时保存起来，待后续的音频分片到达后继续分析处理。Optionally, if there are no continuous fragments in the acquired historical fragments of the target audio data, the analysis and processing results cannot be merged, and the currently obtained analysis and processing results can be temporarily saved for subsequent audio analysis. Analysis and processing continued after slices arrived.

其中，合并后的分析处理结果可以保存在共享存储中，可以用于和后续的分析处理结果进行合并，提前进行合并操作，提高了处理效率。对所述连续分片的分析结果进行合并，可以根据分析结果的不同类型进行不同的合并方式，比如：若分析结果是一个评价分数，则合并方式可以为求多个分析结果的平均值；若分析结果是检测出来的语句错误，则合并方式可以将多个分析结果组成一个错误集合，可以用数组或者其他数据结构表示。Wherein, the combined analysis and processing results can be stored in the shared storage, and can be used for merging with subsequent analysis and processing results, and the merging operation can be performed in advance to improve the processing efficiency. Merging the analysis results of the continuous slices can be combined in different ways according to different types of analysis results, for example: if the analysis result is an evaluation score, the combination method can be to find the average value of multiple analysis results; if If the analysis result is a detected statement error, the merge method can combine multiple analysis results into an error set, which can be represented by an array or other data structures.

作为一种可选的实施方式，所述合并在所述音频数据中连续的音频分片的分析处理结果之后，所述音频数据分析处理方法还包括：As an optional implementation manner, after the merging of the analysis and processing results of continuous audio slices in the audio data, the audio data analysis and processing method further includes:

当接收到所述目标音频数据所有的音频分片后，合并所有的音频分片的分析处理结果；After receiving all the audio fragments of the target audio data, merging the analysis and processing results of all the audio fragments;

发送合并后的所述分析处理结果。Sending the combined analysis and processing results.

本申请实施例中，当接收到所述目标音频数据所有的音频分片后，即表示已经完成了所有的音频分片的分析处理工作，可以合并所有的音频分片的分析处理结果，得到最终的分析处理结果。可以将合并后的所述分析处理结果发送至客户端，由客户端将合并后的所述分析处理结果显示给用户。In the embodiment of the present application, when all audio fragments of the target audio data are received, it means that the analysis and processing work of all audio fragments has been completed, and the analysis and processing results of all audio fragments can be combined to obtain the final analysis and processing results. The combined analysis and processing results may be sent to the client, and the client will display the combined analysis and processing results to the user.

可选的，可以统计分析处理结果的数量，获得结果数量，若所述结果数量等于预设数量阈值，可以确定接收到所述目标音频数据所有的音频分片。Optionally, the number of analysis and processing results may be counted to obtain the number of results, and if the number of results is equal to a preset number threshold, it may be determined that all audio fragments receiving the target audio data are received.

可选的，当接收到携带有最后一个预设分片序号的音频分片后，则可以确定接收到所述目标音频数据所有的音频分片，可以阻塞等待和返回所有音频分片的合并、分析处理结果的合并等操作。Optionally, after receiving the audio fragment carrying the last preset fragment sequence number, it can be determined that all audio fragments of the target audio data have been received, and the merging of all audio fragments can be blocked, waited and returned, Operations such as merging of analysis and processing results.

本申请实施例中，可以在获取到一个单独的音频分片时，若所述音频分片存在不完整的音频帧，对所述音频分片进行裁剪，移除不完整的音频帧，可以避免因不完整的音频帧导致的解析失败问题，从而可以对单独的音频分片进行分析处理，不需要等待所有的音频分片上传完成后进行合并再进行分析处理，可以提前进行分析处理，在对多个音频分片进行分析处理后，可以将在内容上具有连续性的音频分片的分析结果进行合并，不需要等待所有的音频分片都分析处理完毕后再逐一进行合并，进一步缩短了音频数据分析处理的时间，从而提高了音频数据分析处理的效率。In the embodiment of the present application, when a single audio fragment is obtained, if there is an incomplete audio frame in the audio fragment, the audio fragment can be trimmed to remove the incomplete audio frame, which can avoid Due to the problem of parsing failure caused by incomplete audio frames, individual audio fragments can be analyzed and processed. There is no need to wait for all audio fragments to be uploaded and merged before analysis and processing. Analysis and processing can be performed in advance. After multiple audio fragments are analyzed and processed, the analysis results of the audio fragments with continuity in content can be merged. There is no need to wait for all the audio fragments to be analyzed and processed before merging them one by one, further shortening the audio frequency. The time for data analysis and processing is reduced, thereby improving the efficiency of audio data analysis and processing.

实施例二：Embodiment two:

请参阅图3，本实施例与实施例一的主要区别在于，在获取目标音频数据当前时刻的音频分片之后，还包括如下步骤：Please refer to Fig. 3, the main difference between this embodiment and Embodiment 1 is that, after obtaining the audio slice of the current moment of the target audio data, the following steps are also included:

步骤S201：若所述目标音频数据的历史分片中存在连续分片，对所述连续分片进行合并，其中，所述连续分片是具有相邻预设分片序号的历史分片；Step S201: If there are continuous fragments in the historical fragments of the target audio data, merge the continuous fragments, wherein the continuous fragments are historical fragments with adjacent preset fragment numbers;

其中，合并可以是指将所述连续分片按照其预设分片序号的前后顺序进行组合。Wherein, merging may refer to combining the consecutive segments according to the sequence of their preset segment numbers.

作为一种可选的实施方式，所述音频数据分析处理方法还包括：As an optional implementation manner, the audio data analysis and processing method further includes:

若获取到的所述目标音频数据的历史分片中不存在连续分片，将所述音频分片添加至所述历史分片中。If there is no continuous segment in the acquired historical segment of the target audio data, add the audio segment to the historical segment.

可选的，还可在所述历史分片中将所述连续分片替换为合并后的连续分片。Optionally, in the historical shards, the continuous shards may also be replaced with merged continuous shards.

其中，服务端接收获取到音频分片会在一个缓存区域作为历史分片进行存储，当服务端进行连续分片的合并后，可以在所述缓存区域内将用于合并的连续分片删除掉，然后将合并后的连续分片存储进去，完成替换，合并后的连续分片拥有用于合并的连续分片的全部预设分片序号，合并后的连续分片可以用于和之后接收到的音频分片进行合并，避免了重复合并造成的资源浪费。Among them, the audio fragments received by the server will be stored in a cache area as historical fragments. After the server merges the continuous fragments, the continuous fragments used for merging can be deleted in the cache area. , and then store the merged continuous fragments to complete the replacement. The merged continuous fragments have all the preset fragment serial numbers of the merged continuous fragments, and the merged continuous fragments can be used for and received later The audio fragments are merged to avoid resource waste caused by repeated merging.

步骤S202：当接收到所述目标音频数据所有的音频分片后，合并所有的音频分片。Step S202: After receiving all the audio fragments of the target audio data, merge all the audio fragments.

本申请实施例中，如图4所示，在获取目标音频数据当前时刻的音频分片之后，所述音频分片存储至共享存储中，可以对音频分片进行合并操作，是和对应的分析处理操作同时进行的，可以通过多线程或者多个服务端同时进行不同的工作，通过预设分片序号确定音频分片的合并顺序，避免了音频分片乱序的问题，服务端接收到乱序的音频分片也可以立刻进行合并操作，不需要等待所有的音频分片到达后才开始进行合并操作，当接收到最后一个音频分片时，只需要对该最后一个音频分片进行合并就可以完成所有音频分片的合并操作，提高了音频分片的合并效率。In the embodiment of the present application, as shown in Figure 4, after acquiring the audio fragments of the target audio data at the current moment, the audio fragments are stored in the shared storage, and the audio fragments can be merged, which is the corresponding analysis If the processing operation is carried out at the same time, different tasks can be performed at the same time through multiple threads or multiple servers. The merging sequence of the audio fragments is determined by the preset fragment number, which avoids the problem of out-of-order audio fragments. The sequenced audio fragments can also be merged immediately. It is not necessary to wait for all the audio fragments to arrive before starting the merge operation. When the last audio fragment is received, only the last audio fragment needs to be merged. The merging operation of all audio fragments can be completed, and the merging efficiency of the audio fragments is improved.

步骤S203：存储合并后的音频分片。Step S203: storing the combined audio fragments.

所述合并后的音频分片可储存于永久存储区域，可以让用户随时进行访问。The combined audio fragments can be stored in a permanent storage area and can be accessed by users at any time.

本申请实施例中，可以将合并后的音频分片进行持久化存储，便于让用户后续进行访问。In the embodiment of the present application, the merged audio fragments can be persistently stored for subsequent access by users.

可选的，若未接收到所述目标音频数据所有的音频分片，可以生成并发送接收成功信息。Optionally, if all audio fragments of the target audio data have not been received, reception success information may be generated and sent.

在该可选的实施例中，可以生成提示信息用于提示用户该音频分片已接收成功。In this optional embodiment, prompt information may be generated to prompt the user that the audio fragment has been successfully received.

可选的，若已接收到所述目标音频数据所有的音频分片，生成并发送等待分析处理结果信息。Optionally, if all audio fragments of the target audio data have been received, generate and send waiting analysis and processing result information.

在该可选的实施例中，若已接收到所述目标音频数据所有的音频分片，可以生成提示信息来提示用户等待分析处理结果。In this optional embodiment, if all the audio fragments of the target audio data have been received, prompt information may be generated to prompt the user to wait for the analysis and processing result.

实施例三：Embodiment three:

请参阅图5，本申请实施例提供的一种音频数据分析处理方法，包括如下步骤：Please refer to Fig. 5, a method for analyzing and processing audio data provided by the embodiment of the present application, including the following steps:

步骤S301：在客户端采集目标音频数据，当检测到当前采集到的目标音频数据满足预设条件时，将当前采集到的目标音频数据切割为音频分片，将所述音频分片上传至服务端。Step S301: Collect target audio data on the client side, and when it is detected that the currently collected target audio data meets the preset conditions, cut the currently collected target audio data into audio fragments, and upload the audio fragments to the service end.

其中，客户端可以采集用户的音频信息，将音频信息转换为音频数据，并可以将音频数据打包成音频分片进行数据上传等操作。客户端可以在录音的录制过程中同步进行对录制内容的切割，并打包成音频分片进行上传。Among them, the client can collect the user's audio information, convert the audio information into audio data, and can package the audio data into audio fragments for data upload and other operations. The client can cut the recorded content synchronously during the recording process, and package it into audio fragments for uploading.

其中，所述预设条件可以是预设的录音时长阈值，假设录音时长阈值为10秒，录音过程中每隔10秒就将当前采集到的音频数据打包成音频分片，并添加一个自增的分片序号到音频分片里去，比如当前打包的音频分片是第3个音频分片，则添加的分片序号为3，完成对当前获得的音频数据的切割。Wherein, the preset condition may be a preset recording duration threshold, assuming that the recording duration threshold is 10 seconds, the currently collected audio data is packaged into audio fragments every 10 seconds during the recording process, and an auto-incrementing The segment number of the audio segment is added to the audio segment. For example, if the currently packaged audio segment is the third audio segment, the added segment number is 3 to complete the cutting of the currently obtained audio data.

步骤S302：在服务端获取目标音频数据的音频分片；若获取的所述音频分片存在不完整的音频帧，对所述音频分片进行裁剪，移除不完整的音频帧；对裁剪后的所述音频分片进行分析处理，获得所述音频分片分析处理结果；合并在所述音频数据中连续的音频分片的分析处理结果，并发送至客户端。Step S302: Obtain the audio fragment of the target audio data at the server; if there is an incomplete audio frame in the obtained audio fragment, trim the audio fragment to remove the incomplete audio frame; Analyzing and processing the audio fragments to obtain the analysis and processing results of the audio fragments; merging the analysis and processing results of consecutive audio fragments in the audio data, and sending them to the client.

本申请实施例中，客户端接收到服务端返回的分析处理结果后，可以将该分析处理结果输出至显示界面中，可选的，可以根据该分析处理结果生成语音进行播报。In the embodiment of the present application, after receiving the analysis and processing result returned by the server, the client can output the analysis and processing result to the display interface. Optionally, it can generate voice according to the analysis and processing result for broadcasting.

本实施例中的服务端操作与实施例一、实施例二的相同，在此不再赘述。The operation of the server in this embodiment is the same as that in Embodiment 1 and Embodiment 2, and will not be repeated here.

实施例四：Embodiment four:

本实施例提供一种音频数据分析处理服务端装置，可以用于执行本申请实施例一、二的音频数据分析处理方法。对于本实施例中未披露的细节，请参照本申请的实施例一、二。This embodiment provides an audio data analysis and processing server device, which can be used to execute the audio data analysis and processing methods in Embodiments 1 and 2 of the present application. For details not disclosed in this embodiment, please refer to Embodiments 1 and 2 of this application.

请参阅图6，图6是本申请实施例公开的一种音频数据分析处理服务端装置的结构示意图。所述的音频数据分析处理服务端装置可以运行于服务器中。如图6所示，该服务端装置包括：Please refer to FIG. 6 . FIG. 6 is a schematic structural diagram of an audio data analysis and processing server device disclosed in an embodiment of the present application. The audio data analysis and processing server device can run in a server. As shown in Figure 6, the server device includes:

获取模块701，用于获取目标音频数据当前时刻的音频分片。The acquiring module 701 is configured to acquire audio fragments of the target audio data at the current moment.

裁剪模块702，用于若获取的所述音频分片存在不完整的音频帧，对所述音频分片进行裁剪，移除不完整的音频帧。The cropping module 702 is configured to, if there are incomplete audio frames in the acquired audio fragments, crop the audio fragments to remove the incomplete audio frames.

分析处理模块703，用于对裁剪后的所述音频完整分片进行分析处理，获得所述音频分片的分析处理结果。The analysis and processing module 703 is configured to analyze and process the clipped complete audio fragment, and obtain an analysis and processing result of the audio fragment.

结果合并模块704，用于合并在所述音频数据中连续的音频分片的分析处理结果。The result combining module 704 is configured to combine the analysis and processing results of continuous audio slices in the audio data.

在一个可选的实施例中，所述获取模块701，还用于获取位于所述音频分片的起始位置的起始音频帧，以及获取位于所述音频分片的末尾位置的末尾音频帧。In an optional embodiment, the acquiring module 701 is further configured to acquire a start audio frame located at the start position of the audio slice, and acquire an end audio frame located at the end position of the audio slice .

如图6所示，所述音频数据分析处理服务端装置还可包括：As shown in Figure 6, the audio data analysis and processing server device may also include:

确定模块705，用于若所述起始音频帧为不完整的音频帧和/或所述末尾音频帧为不完整的音频帧，确定所述音频分片存在不完整的音频帧。A determining module 705, configured to determine that there is an incomplete audio frame in the audio slice if the start audio frame is an incomplete audio frame and/or the end audio frame is an incomplete audio frame.

所述确定模块705，还用于若所述起始音频帧为完整的音频帧，且所述末尾音频帧为完整的音频帧，确定所述音频分片不存在不完整的音频帧。The determination module 705 is further configured to determine that there is no incomplete audio frame in the audio slice if the start audio frame is a complete audio frame and the end audio frame is a complete audio frame.

在一个可选的实施例中，所述结果合并模块704还用于若获取到的所述目标音频数据的历史分片中存在连续分片，对所述连续分片的分析处理结果进行合并。In an optional embodiment, the result merging module 704 is further configured to merge the analysis and processing results of the continuous segments if there are continuous segments in the acquired historical segments of the target audio data.

在一个可选的实施例中，如图6所示，所述音频数据分析处理服务端装置还包括：In an optional embodiment, as shown in FIG. 6, the audio data analysis and processing server device further includes:

音频合并模块706，用于若获取到的所述目标音频数据的历史分片中存在连续分片，对所述连续分片进行合并。The audio merging module 706 is configured to merge the continuous segments if there are continuous segments in the acquired historical segments of the target audio data.

所述音频合并模块706，还用于当接收到所述目标音频数据所有的音频分片后，合并所有的音频分片。The audio merging module 706 is further configured to merge all the audio fragments after receiving all the audio fragments of the target audio data.

存储模块707，用于存储合并后的音频分片。The storage module 707 is configured to store the merged audio fragments.

作为一种可选的实施方式，所述音频数据分析处理服务端装置还可以包括：As an optional implementation manner, the audio data analysis and processing server device may also include:

添加模块，用于若获取到的所述目标音频数据的历史分片中不存在连续分片，将所述音频分片添加至所述历史分片中。An adding module, configured to add the audio segment to the historical segment if there is no continuous segment in the acquired historical segment of the target audio data.

替换模块，用于在所述历史分片中将所述连续分片替换为合并后的连续分片。A replacement module, configured to replace the continuous shards in the historical shards with merged continuous shards.

作为一种可选的实施方式，如图6所示，所述音频数据分析处理服务端装置还可包括：As an optional implementation manner, as shown in FIG. 6, the audio data analysis and processing server device may also include:

信息收发模块708，用于若未接收到所述目标音频数据所有的音频分片，生成并发送接收成功信息。The information transceiving module 708 is configured to generate and send reception success information if all audio fragments of the target audio data have not been received.

作为一种可选的实施方式，所述信息收发模块708还用于若已接收到所述目标音频数据所有的音频分片，生成并发送等待分析处理结果信息。As an optional implementation manner, the information transceiving module 708 is further configured to generate and send waiting analysis and processing result information if all audio fragments of the target audio data have been received.

作为一种可选的实施方式，所述结果合并模块704还用于所述合并在所述音频数据中连续的音频分片的分析处理结果之后，当接收到所述目标音频数据所有的音频分片后，合并所有的音频分片的分析处理结果。As an optional implementation manner, the result merging module 704 is also used for merging the analysis and processing results of the continuous audio segments in the audio data, when all the audio segments of the target audio data are received After fragmentation, the analysis and processing results of all audio fragments are merged.

所述音频数据分析处理服务端装置还可以包括：The audio data analysis and processing server device may also include:

发送模块，用于发送合并后的所述分析处理结果。A sending module, configured to send the combined analysis and processing results.

实施例五：Embodiment five:

如图1所示，本实施例还提供一种音频数据分析处理系统，包括客户端1000以及服务端2000；As shown in Figure 1, this embodiment also provides an audio data analysis and processing system, including a client 1000 and a server 2000;

客户端1000用于采集目标音频数据；当检测到当前采集到的目标音频数据满足预设条件时，将当前采集到的目标音频数据截断为音频分片，将所述音频分片上传至服务端2000；The client 1000 is used to collect target audio data; when it is detected that the currently collected target audio data meets the preset conditions, the currently collected target audio data is truncated into audio fragments, and the audio fragments are uploaded to the server 2000;

其中，客户端1000可以采集用户的音频信息，将音频信息转换为音频数据，并可以将音频数据打包成音频分片进行数据上传等操作。Wherein, the client 1000 can collect audio information of the user, convert the audio information into audio data, and package the audio data into audio fragments for data uploading and other operations.

服务端2000用于获取目标音频数据的音频分片；若获取的所述音频分片存在不完整的音频帧，对所述音频分片进行裁剪，移除不完整的音频帧；对裁剪后的所述音频分片进行分析处理，获得所述音频分片分析处理结果；合并在所述音频数据中连续的音频分片的分析处理结果，并发送至客户端1000。The server 2000 is used to obtain the audio fragments of the target audio data; if there are incomplete audio frames in the obtained audio fragments, the audio fragments are trimmed to remove the incomplete audio frames; The audio fragments are analyzed and processed to obtain the analysis and processing results of the audio fragments; the analysis and processing results of consecutive audio fragments in the audio data are combined and sent to the client 1000 .

其中，客户端1000可以将从服务端2000中接收到的信息输出至显示界面显示给用户。Wherein, the client 1000 may output the information received from the server 2000 to a display interface for display to the user.

所述服务端2000与本申请实施例四所述的音频数据分析处理服务端装置相同，在此不再赘述。The server 2000 is the same as the audio data analysis and processing server device described in Embodiment 4 of the present application, and will not be repeated here.

实施例六：Embodiment six:

本实施例提供一种电子设备，可以用于执行本申请实施例一、二的音频数据分析处理方法的全部或部分步骤。对于本实施例中未披露的细节，请参照本申请的实施例一、二。This embodiment provides an electronic device that can be used to execute all or part of the steps of the audio data analysis and processing method in Embodiments 1 and 2 of the present application. For details not disclosed in this embodiment, please refer to Embodiments 1 and 2 of this application.

请参阅图7，图7为本申请实施例提供的电子设备的结构示意图。所述电子设备900可以但不限于是各种服务器、个人计算机、笔记本电脑、智能手机、平板电脑等设备的一个或多个的组合。Please refer to FIG. 7 . FIG. 7 is a schematic structural diagram of an electronic device provided by an embodiment of the present application. The electronic device 900 may be, but not limited to, a combination of one or more of various servers, personal computers, notebook computers, smart phones, tablet computers and other devices.

本申请在本申请较佳实施例中，所述电子设备900包括存储器901、至少一个处理器902、至少一条通信总线903及收发器904。In a preferred embodiment of the present application, the electronic device 900 includes a memory 901 , at least one processor 902 , at least one communication bus 903 and a transceiver 904 .

本领域技术人员应该了解，图7示出的电子设备的结构并不构成本申请实施例的限定，既可以是总线型结构，也可以是星形结构，所述电子设备900还可以包括比图示更多或更少的其他硬件或者软件，或者不同的部件布置。Those skilled in the art should understand that the structure of the electronic device shown in Figure 7 does not constitute a limitation of the embodiment of the present application, it can be either a bus structure or a star structure, and the electronic device 900 can also include a ratio diagram more or less other hardware or software, or a different arrangement of components.

在一些实施例中，所述电子设备900是一种能够按照事先设定或存储的指令，自动进行数值计算和/或信息处理的设备，其硬件包括但不限于微处理器、专用集成电路、可编程门阵列、数字处理器及嵌入式设备等。所述电子设备900还可包括客户设备，所述客户设备包括但不限于任何一种可与客户通过键盘、鼠标、遥控器、触摸板或声控设备等方式进行人机交互的电子产品，例如，个人计算机、平板电脑、智能手机、数码相机等。In some embodiments, the electronic device 900 is a device capable of automatically performing numerical calculations and/or information processing according to preset or stored instructions, and its hardware includes but not limited to microprocessors, application-specific integrated circuits, Programmable gate arrays, digital processors and embedded devices, etc. The electronic device 900 may also include a client device, which includes but is not limited to any electronic product that can interact with the client through a keyboard, mouse, remote control, touch pad, or voice-activated device, for example, Personal computers, tablets, smartphones, digital cameras, etc.

需要说明的是，所述电子设备900仅为举例，其他现有的或今后可能出现的电子产品如可适应于本申请，也应包含在本申请的保护范围以内，并以引用方式包含于此。It should be noted that the electronic device 900 is only an example, and other existing or future electronic products that can be adapted to this application should also be included in the scope of protection of this application, and are included here by reference .

在一些实施例中，所述存储器901中存储有计算机程序，所述计算机程序被所述至少一个处理器902执行时实现如所述实施例一、二的音频数据分析处理方法中的全部或者部分步骤。所述存储器901包括只读存储器(Read-Only Memory，ROM)、可编程只读存储器(Programmable Read-Only Memory，PROM)、可擦除可编程只读存储器(ErasableProgrammable Read-Only Memory，EPROM)、一次可编程只读存储器(One-timeProgrammable Read-Only Memory，OTPROM)、电子擦除式可复写只读存储器(Electrically-Erasable Programmable Read-Only Memory，EEPROM)、只读光盘(CompactDisc Read-Only Memory，CD-ROM)或其他光盘存储器、磁盘存储器、磁带存储器、或者能够用于携带或存储数据的计算机可读的任何其他介质。In some embodiments, a computer program is stored in the memory 901, and when the computer program is executed by the at least one processor 902, all or part of the audio data analysis and processing methods in the first and second embodiments are implemented. step. The memory 901 includes a read-only memory (Read-Only Memory, ROM), a programmable read-only memory (Programmable Read-Only Memory, PROM), an erasable programmable read-only memory (Erasable Programmable Read-Only Memory, EPROM), One-time Programmable Read-Only Memory (OTPROM), Electronically Erasable Programmable Read-Only Memory (EEPROM), Compact Disc Read-Only Memory, CD-ROM) or other optical disk storage, magnetic disk storage, magnetic tape storage, or any other computer-readable medium that can be used to carry or store data.

在一些实施例中，所述至少一个处理器902是所述电子设备900的控制核心(Control Unit)，利用各种接口和线路连接整个电子设备900的各个部件，通过运行或执行存储在所述存储器901内的程序或者模块，以及调用存储在所述存储器901内的数据，以执行电子设备900的各种功能和处理数据。例如，所述至少一个处理器902执行所述存储器中存储的计算机程序时实现本申请实施例中所述的音频数据分析处理方法的全部或者部分步骤；或者实现音频数据分析处理服务端装置的全部或者部分功能。所述至少一个处理器902可以由集成电路组成，例如可以由单个封装的集成电路所组成，也可以是由多个相同功能或不同功能封装的集成电路所组成，包括一个或者多个中央处理器(CentralProcessingunit，CPU)、微处理器、数字处理芯片、图形处理器及各种控制芯片的组合等。In some embodiments, the at least one processor 902 is the control core (Control Unit) of the electronic device 900, using various interfaces and lines to connect the various components of the entire electronic device 900, by running or executing the Programs or modules in the memory 901 and call data stored in the memory 901 to execute various functions of the electronic device 900 and process data. For example, when the at least one processor 902 executes the computer program stored in the memory, it realizes all or part of the steps of the audio data analysis and processing method described in the embodiment of the present application; or realizes all the steps of the audio data analysis and processing server device. or partial functions. The at least one processor 902 may be composed of an integrated circuit, for example, may be composed of a single packaged integrated circuit, or may be composed of multiple integrated circuits with the same function or different functions, including one or more central processing units (Central Processing unit, CPU), microprocessor, digital processing chip, graphics processor and a combination of various control chips, etc.

在一些实施例中，所述至少一条通信总线903被设置为实现所述存储器901以及所述至少一个处理器902等之间的连接通信。In some embodiments, the at least one communication bus 903 is configured to implement communication between the memory 901 and the at least one processor 902 and so on.

所述电子设备900还可以包括多种传感器、蓝牙模块、Wi-Fi模块等，在此不再赘述。The electronic device 900 may also include various sensors, a Bluetooth module, a Wi-Fi module, etc., which will not be repeated here.

实施例七：Embodiment seven:

本实施例提供一种计算机可读存储介质，其上储存有计算机程序，所述指令适于由处理器加载并执行本申请实施例一、二的音频数据分析处理方法，具体执行过程可以参见实施例一、实施例二所示实施例的具体说明，在此不进行赘述。This embodiment provides a computer-readable storage medium, on which a computer program is stored, and the instructions are suitable for being loaded by a processor and executing the audio data analysis and processing methods of Embodiments 1 and 2 of the present application. For the specific execution process, please refer to the implementation The specific descriptions of the embodiments shown in Example 1 and Embodiment 2 will not be repeated here.

对于设备实施例而言，由于其基本对应于方法实施例，所以相关之处参见方法实施例的部分说明即可。以上所描述的设备实施例仅仅是示意性的，其中所述作为分离部件说明的组件可以是或者也可以不是物理上分开的，作为单元显示的部件可以是或者也可以不是物理单元，即可以位于一个地方，或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本申请方案的目的。本领域普通技术人员在不付出创造性劳动的情况下，即可以理解并实施。As for the device embodiment, since it basically corresponds to the method embodiment, for related parts, please refer to part of the description of the method embodiment. The device embodiments described above are only illustrative, and the components described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in One place, or it can be distributed to multiple network elements. Part or all of the modules can be selected according to actual needs to achieve the purpose of the solution of this application. It can be understood and implemented by those skilled in the art without creative effort.

本领域内的技术人员应明白，本申请的实施例可提供为方法、系统、或计算机程序产品。因此，本申请可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且，本申请可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art should understand that the embodiments of the present application may be provided as methods, systems, or computer program products. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.

本申请是参照根据本申请实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器，使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present application is described with reference to flowcharts and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the present application. It should be understood that each procedure and/or block in the flowchart and/or block diagram, and a combination of procedures and/or blocks in the flowchart and/or block diagram can be realized by computer program instructions. These computer program instructions may be provided to a general purpose computer, special purpose computer, embedded processor, or processor of other programmable data processing equipment to produce a machine such that the instructions executed by the processor of the computer or other programmable data processing equipment produce a An apparatus for realizing the functions specified in one or more procedures of the flowchart and/or one or more blocks of the block diagram.

以上仅为本申请的实施例而已，并不用于限制本申请。对于本领域技术人员来说，本申请可以有各种更改和变化。凡在本申请的精神和原理之内所作的任何修改、等同替换、改进等，均应包含在本申请的权利要求范围之内。The above are only examples of the present application, and are not intended to limit the present application. For those skilled in the art, various modifications and changes may occur in this application. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present application shall be included within the scope of the claims of the present application.

Claims

1. An audio data analysis processing method, characterized in that the audio data analysis processing method comprises:

acquiring an audio fragment of the current moment of the target audio data;

if the obtained audio fragments have incomplete audio frames, cutting the audio fragments, and removing the incomplete audio frames;

analyzing and processing the cut audio fragments to obtain analysis and processing results of the audio fragments;

and merging analysis processing results of the continuous audio fragments in the audio data.

2. The audio data analysis processing method according to claim 1, wherein after the audio fragment of the current time of the target audio data is acquired, the audio data analysis processing method further comprises:

acquiring a starting audio frame positioned at the starting position of the audio fragment, and acquiring an end audio frame positioned at the end position of the audio fragment;

if the initial audio frame is a complete audio frame and the end audio frame is a complete audio frame, determining that the audio fragment does not have an incomplete audio frame;

or alternatively, the first and second heat exchangers may be,

if the initial audio frame is an incomplete audio frame and/or the end audio frame is an incomplete audio frame, determining that the audio fragment has an incomplete audio frame.

3. The audio data analysis processing method according to claim 1, wherein the merging of analysis processing results of consecutive audio slices in the audio data includes:

if continuous fragments exist in the historical fragments of the target audio data, merging analysis processing results of the continuous fragments, wherein the continuous fragments are the historical fragments with adjacent preset fragment serial numbers.

4. The audio data analysis processing method according to any one of claims 1 to 3, wherein after the acquisition of the audio slice of the current time of the target audio data, the audio data analysis processing method further comprises:

if continuous fragments exist in the historical fragments of the target audio data, merging the continuous fragments, wherein the continuous fragments are the historical fragments with adjacent preset fragment serial numbers;

when all the audio fragments of the target audio data are received, combining all the audio fragments,

and storing the combined audio fragments.

5. The audio data analysis processing method according to claim 4, characterized in that the audio data analysis processing method further comprises:

and if no continuous fragments exist in the obtained historical fragments of the target audio data, adding the audio fragments into the historical fragments.

6. The audio data analysis processing method according to claim 4, wherein after the merging of the continuous slices, the audio data analysis processing method further comprises:

and replacing the continuous fragments in the history fragments with the combined continuous fragments.

7. The audio data analysis processing method according to any one of claims 1 to 3, wherein after the acquisition of the audio slice of the current time of the target audio data, the audio data analysis processing method further comprises:

if all the audio fragments of the target audio data are not received, generating and sending successful receiving information; or alternatively, the first and second heat exchangers may be,

and if all the audio fragments of the target audio data are received, generating and transmitting information of the analysis waiting processing result.

8. The audio data analysis processing method according to any one of claims 1 to 3, characterized in that the audio data analysis processing method further comprises, after the merging of the analysis processing results of the audio fragments that are continuous in the audio data:

after receiving all the audio fragments of the target audio data, merging analysis processing results of all the audio fragments;

and sending the combined analysis and processing result.

9. An audio data analysis processing method, characterized in that the audio data analysis processing method comprises:

collecting target audio data at a client; when the current collected target audio data is detected to meet the preset condition, cutting the current collected target audio data into audio fragments, and uploading the audio fragments to a server;

acquiring an audio fragment of target audio data at a server; if the obtained audio fragments have incomplete audio frames, cutting the audio fragments, and removing the incomplete audio frames; analyzing and processing the cut audio fragments to obtain an analysis and processing result of the audio fragments; and merging analysis processing results of the continuous audio fragments in the audio data, and sending the analysis processing results to the client.

10. An audio data analysis processing server device, comprising:

the acquisition module is used for acquiring the audio fragments of the target audio data at the current moment;

the clipping module is used for clipping the audio fragments and removing the incomplete audio frames if the obtained audio fragments have the incomplete audio frames;

the analysis processing module is used for analyzing and processing the cut complete audio fragments to obtain analysis processing results of the audio fragments;

and the result merging module is used for merging analysis processing results of the continuous audio fragments in the audio data.

11. The audio data analysis processing system comprises a client and a server, and is characterized in that the client is used for collecting target audio data; when the current collected target audio data is detected to meet the preset condition, cutting off the current collected target audio data into audio fragments, and uploading the audio fragments to a server;

the server side is used for acquiring the audio fragments of the target audio data; if the obtained audio fragments have incomplete audio frames, cutting the audio fragments, and removing the incomplete audio frames; analyzing and processing the cut audio fragments to obtain an analysis and processing result of the audio fragments; and merging analysis processing results of the continuous audio fragments in the audio data, and sending the analysis processing results to the client.

12. An electronic device, comprising: a processor and a memory; wherein the memory stores a computer program adapted to be loaded by the processor and to perform the audio data analysis processing method according to any one of claims 1 to 9.

13. A computer-readable storage medium, on which a computer program is stored, characterized in that the computer program, when executed by a processor, implements the audio data analysis processing method according to any one of claims 1 to 9.