CN116521925A

CN116521925A - Video recording, playing, retrieving and playback method and device, electronic equipment and medium

Info

Publication number: CN116521925A
Application number: CN202310489598.4A
Authority: CN
Inventors: 方斌; 段克; 马起礼
Original assignee: Beijing Shitong Science And Technology Co ltd
Current assignee: Beijing Shitong Science And Technology Co ltd
Priority date: 2023-05-04
Filing date: 2023-05-04
Publication date: 2023-08-01

Abstract

The present invention relates to the field of video playback control technologies, and in particular, to a video recording and playing retrieval playback method, device, electronic equipment, and medium. The method comprises the following steps: processing the original video based on the acquired first playback instruction of the user and the called preset processing rule, and determining a video clip based on the processing result, wherein the first playback instruction of the user is an instruction which needs to be played back by the user; generating prompt information based on the video clips, wherein the prompt information is information for informing a user whether the video clips need to be screened again; and if a second playback instruction fed back by the user based on the prompt information is acquired, screening the video clips based on the second playback instruction, determining a target video clip and feeding back and displaying the target video clip. The method and the device can ensure that the target video clip wanted by the user is positioned quickly and accurately, and improve the watching experience of the user.

Description

A method, device, electronic equipment and medium for video recording and playback retrieval and playback

技术领域technical field

本申请涉及数据查询技术的领域，尤其是涉及一种视频录播检索回放方法、装置、电子设备及介质。The present application relates to the field of data query technology, in particular to a method, device, electronic device and medium for video recording and broadcasting retrieval and playback.

背景技术Background technique

随着网络技术的发展，视频已经成为了人们学习或娱乐的一个重要方式，人们足不出户就可以通过视频观看自己感兴趣的内容，且对于一些现场录制的视频，人们也可以通过视频回放来了解现场的状况。With the development of network technology, video has become an important way for people to learn or entertain. People can watch the content they are interested in through video without leaving home, and for some live recorded videos, people can also play them back through video. Come to know what's going on.

传统的视频回放技术有两种方式，当用户想要对视频中部分片段重复观看时，方式一为从头开始进行倍速观看，播放到目标视频片段再调成正常倍速，方式二为通过凭借自身对于视频的记忆拖动进度条，直到找到目标视频片段。There are two methods of traditional video playback technology. When the user wants to watch part of the video repeatedly, the first method is to watch at double speed from the beginning, and then adjust to the normal double speed when the target video clip is played. Video Memory Drag the progress bar until the target video segment is found.

传统的视频回放技术，不停拖动进度条确认目标视频片段的操作太过繁琐，且在短时间内无法精准定位到目标视频片段，导致用户的观看体验下降。With traditional video playback technology, the operation of continuously dragging the progress bar to confirm the target video segment is too cumbersome, and the target video segment cannot be accurately located in a short period of time, resulting in a decline in the user's viewing experience.

发明内容Contents of the invention

为了提高用户的观看体验，本申请提供一种视频录播检索回放方法、装置、电子设备及介质。In order to improve the user's viewing experience, the present application provides a method, device, electronic device and medium for retrieving and playing back video recording and broadcasting.

第一方面，本申请提供一种视频录播检索回放方法，采用如下的技术方案：In the first aspect, the present application provides a method for retrieving and playing back video recording and broadcasting, which adopts the following technical solution:

一种视频录播检索回放方法，包括：A method for retrieving and playing back video recording and playback, comprising:

基于获取的用户第一回放指令以及调取的预设处理规则，对原视频进行处理，基于处理结果，确定视频片段；Processing the original video based on the acquired user's first playback instruction and the recalled preset processing rules, and determining the video segment based on the processing result;

所述用户第一回放指令为用户需要进行回放的指令；The user's first playback instruction is an instruction that the user needs to play back;

基于所述视频片段生成提示信息；generating prompt information based on the video segment;

所述提示信息为告知用户是否需要对所述视频片段进行再一步筛选的信息；The prompt information is information to inform the user whether to further screen the video segment;

若获取到用户基于所述提示信息反馈的第二回放指令，则基于所述第二回放指令对所述视频片段进行筛选处理，确定目标视频片段并反馈显示。If the second playback instruction fed back by the user based on the prompt information is acquired, the video segment is screened based on the second playback instruction, and a target video segment is determined and displayed as feedback.

通过采用上述技术方案，首先获取用户需要进行回放的第一回放指令，并对原视频按照预设处理规则进行视频处理，确定视频片段，之后生成告知用户是否需要对视频片段再一步检索的提示信息，如用户选择需要，则获取用户的第二回放指令，并基于第二回放指令对视频片段进行对应的筛选处理，最终确定用户所需要的目标视频片段；从而能够快速准确的定位到用户想要的目标视频片段，提高用户的观看体验。By adopting the above technical solution, first obtain the first playback command that the user needs to play back, and perform video processing on the original video according to the preset processing rules, determine the video segment, and then generate prompt information informing the user whether to further retrieve the video segment , if the user chooses to, obtain the user's second playback instruction, and perform corresponding screening processing on the video segment based on the second playback instruction, and finally determine the target video segment required by the user; thus, it is possible to quickly and accurately locate the user's desired video clip. target video clips to improve the user's viewing experience.

在一种可能的实现方式中，所述基于获取的用户第一回放指令以及调取的预设处理规则，对原视频进行处理，包括：In a possible implementation manner, the processing of the original video based on the acquired user's first playback instruction and the recalled preset processing rules includes:

基于所述用户第一回放指令以及所述预设处理规则，确定所述原视频中的人物特征；Based on the user's first playback instruction and the preset processing rules, determine the characteristics of the characters in the original video;

确定所述人物特征对应的出现时间点和消失时间点；Determining the appearance time point and disappearance time point corresponding to the character features;

基于所述出现时间点以及所述消失时间点，确定所述人物特征对应的人物视频片段，并将所述人物视频片段设定为第一视频片段。Based on the appearance time point and the disappearance time point, determine a character video segment corresponding to the character feature, and set the character video segment as a first video segment.

通过采用上述技术方案，以用户第一回放指令以及预设处理规则作为前提条件，确定原视频中的所有人物特征，之后以出现的人物特征为基准，确定所有人物特征对应的出现时间点和消失时间点，即，每一个出现时间点和对应的消失时间点形成一个视频片段，最终确定所有的人物特征对应的视频片段，并设定为第一视频片段；为用户提供以人物为主要检索目标的检索方式。By adopting the above technical solution, with the user's first playback instruction and the preset processing rules as the preconditions, all character features in the original video are determined, and then the appearing time point and disappearance of all character features are determined based on the character features that appear Time point, that is, each appearance time point and the corresponding disappearance time point form a video clip, and finally determine the video clips corresponding to all character features, and set them as the first video clip; provide users with characters as the main retrieval target search method.

基于所述用户第一回放指令以及所述预设处理规则，确定所述原视频中的语音特征；Based on the user's first playback instruction and the preset processing rules, determine the voice features in the original video;

确定所述语音特征对应的出现时间点和消失时间点；Determining the appearance time point and disappearance time point corresponding to the voice feature;

基于所述出现时间点以及所述消失时间点，确定所述语音特征对应的语音视频片段，并将所述语音视频片段设定为第二视频片段。Based on the appearance time point and the disappearance time point, determine a voice video segment corresponding to the voice feature, and set the voice video segment as a second video segment.

通过采用上述技术方案，以用户第一回放指令以及预设处理规则作为前提条件，确定原视频中的所有语音特征，之后以预设语音停顿时间为基准，来确定所有语音特征对应的出现时间点和消失时间点，即，每一个出现时间点和对应的消失时间点形成一个视频片段，最终确定所有语音特征对应的视频片段，并设定为第二视频片段；为用户提供以语音为主要检索目标的检索方式。By adopting the above technical solution, taking the user's first playback instruction and preset processing rules as prerequisites, determine all voice features in the original video, and then use the preset voice pause time as a benchmark to determine the corresponding occurrence time points of all voice features and disappearing time points, that is, each appearing time point and corresponding disappearing time point form a video clip, finally determine the video clip corresponding to all voice features, and set it as the second video clip; provide users with voice-based retrieval How to retrieve the target.

基于所述用户第一回放指令以及所述预设处理规则，确定所述原视频中的人物特征和语音特征；Determining character features and voice features in the original video based on the user's first playback instruction and the preset processing rules;

确定所述人物特征和所述语音特征同时存在的时间节点；Determining the time node at which the character features and the voice features exist at the same time;

基于所述时间节点，确定人物语音视频片段，并将所述人物语音视频片段设定为第三视频片段。Based on the time node, determine the voice video segment of the character, and set the voice video segment of the character as the third video segment.

通过采用上述技术方案，以用户第一回放指令以及预设处理规则作为前提条件，确定原视频中的所有人物特征和语音特征，之后以出现的人物特征为基准，确定所有人物特征和语音特征同时存在的时间节点，即，每一个时间节点对应一个视频片段，最终确定所有人物特征和语音特征同时对应的视频片段，并设定为第三视频片段；为用户提供以人物和语音为主要检索目标的检索方式。By adopting the above-mentioned technical solution, taking the user's first playback instruction and preset processing rules as prerequisites, determine all character features and voice features in the original video, and then determine all character features and voice features based on the character features that appear at the same time Existing time nodes, that is, each time node corresponds to a video segment, and finally determine the video segments corresponding to all character features and voice features at the same time, and set it as the third video segment; provide users with characters and voice as the main retrieval target search method.

在一种可能的实现方式中，所述目标视频片段包含至少一个子目标视频片段，所述若获取到用户基于提示信息反馈的第二回放指令，则基于所述用户第二回放指令对所述视频片段进行筛选处理，确定目标视频片段并反馈显示，包括：In a possible implementation manner, the target video segment includes at least one sub-target video segment, and if a second playback instruction fed back by the user based on the prompt information is obtained, the user's second playback instruction is based on the user's second playback instruction. The video clips are screened, and the target video clips are determined and displayed as feedback, including:

基于所述第二回放指令，生成搜索框并反馈显示；Based on the second playback instruction, generate a search box and display it in feedback;

获取用户在所述搜索框输入的关键词特征；Obtaining the keyword features input by the user in the search box;

基于所述关键词特征，对所述视频片段进行筛选处理，确定目标视频片段。Based on the keyword feature, the video segment is screened to determine a target video segment.

通过采用上述技术方案，获取到用户需要对视频片段进一步筛选的第二回放指令，自动生成搜索框并反馈显示，在接收到用户在搜索框中输入与视频片段对应的关键词特征后，将关键词特征在视频片段中进行筛选处理，最终确定目标视频片段；从而更精确的定位用户所需的目标视频片段。By adopting the above technical solution, the second playback instruction that the user needs to further screen the video clips is obtained, and a search box is automatically generated and displayed as a feedback. After receiving the keyword feature corresponding to the video clip input by the user in the search box, the key The word features are screened in the video clips to finally determine the target video clip; thereby more accurately locating the target video clip required by the user.

在一种可能的实现方式中，所述所述关键词特征为人物文本特征和/或语音文本特征，所述获取用户在所述搜索框输入的关键词特征，之后还包括：In a possible implementation manner, the keyword features are character text features and/or speech text features, and the acquisition of the keyword features input by the user in the search box further includes:

若用户输入人物文本特征，提取所述视频片段中的所述人物特征，并以文字的形式表示，确定所有的所述人物文本特征；If the user inputs character text features, extract the character features in the video clip, and express in the form of text, and determine all the character text features;

若用户输入语音文本特征，将所述视频片段中的语音特征进行文本转换，确定所有的所述语音文本特征；If the user inputs the voice-text feature, the voice feature in the video segment is converted into text to determine all the voice-text features;

若用户输入人物文本特征和语音文本特征，提取所述视频片段中以文字形式表示的所有的所述人物文本特征，提取所述视频片段中所有的所述语音文本特征。If the user inputs character text features and speech text features, extract all the character text features expressed in text form in the video clip, and extract all the speech text features in the video clip.

通过采用上述技术方案，若用户输入人物文本特征，则提取视频片段中的所有人物特征，以文字的形式表示人物特征，若用户输入语音文本特征，则提取视频片段中的所有语音特征并转化成语音文本，若用户输入人物文本特征和语音文本特征，则同时提取视频片段中的人物文本特征和语音文本特征；从而提供不同组合的方式让用户检索到自己所需的目标视频片段。By adopting the above-mentioned technical scheme, if the user inputs character text features, then extract all the character features in the video clip, and represent the character features in the form of text; if the user inputs voice text features, then extract all the voice features in the video clip and convert them into Speech text, if the user inputs character text features and voice text features, the character text features and voice text features in the video clip are extracted at the same time; thus different combinations are provided to allow users to retrieve the target video clips they need.

在一种可能的实现方式中，所述目标视频片段包含至少一个子目标视频片段；所述若获取到用户基于提示信息反馈的第二回放指令，则基于所述第二回放指令对所述视频片段进行筛选处理，确定目标视频片段并反馈显示，包括：In a possible implementation manner, the target video segment includes at least one sub-target video segment; and if the second playback instruction fed back by the user based on the prompt information is obtained, the video is played based on the second playback instruction. The fragments are screened, the target video fragments are determined and displayed as feedback, including:

生成所述子目标视频片段对应的时间起始点；Generate the time starting point corresponding to the sub-target video segment;

若所述子目标视频片段的数量为预设数量，则直接跳转至所述子目标视频片段对应的时间起始点进行播放；If the quantity of the sub-target video segment is a preset number, then directly jump to the corresponding time starting point of the sub-target video segment to play;

若所述子目标视频片段的数量大于预设数量，则优先跳转至时间起始点最早的子目标视频片段对应的时间起始点进行播放，并将所述目标视频片段包含的其余子目标视频片段对应的时间起始点进行高亮处理。If the quantity of the sub-target video clips is greater than the preset number, then preferentially jump to the corresponding time start point of the earliest sub-target video clip of the time start point to play, and the remaining sub-target video clips included in the target video clip The corresponding time starting point is highlighted.

通过采用上述技术方案，电子设备筛选处理完成后，基于确定的所有子目标视频片段的出现时间点，生成时间起始点，之后判断子目标视频片段的数量，若子目标视频片段数量为预设数量，则表示筛选后的目标视频片段是唯一的，则电子设备直接定位到该目标视频片段的时间起始点开始播放；若子目标视频片段数量超过预设数量，则表示该目标视频片段存在多个符合条件的子目标视频片段，则根据时间起始点的前后排序，电子设备定位到最先出现的时间起始点开始播放，其余的时间起始点作高亮处理，从而帮助用户可以清楚的看到目标视频片段在原视频的进度条上的位置。By adopting the above-mentioned technical solution, after the screening process of the electronic device is completed, the time starting point is generated based on the determined occurrence time points of all sub-target video segments, and then the number of sub-target video segments is judged. If the number of sub-target video segments is a preset number, It means that the filtered target video clip is unique, and the electronic device directly locates the time starting point of the target video clip to start playing; if the number of sub-target video clips exceeds the preset number, it means that there are multiple eligible video clips in the target video clip The sub-target video clips are sorted according to the sequence of time start points, the electronic device locates the first time start point to start playing, and the rest of the time start points are highlighted, so that users can clearly see the target video clips The position on the progress bar of the original video.

第二方面，本申请提供一种视频录播检索回放装置，采用如下的技术方案：In the second aspect, the present application provides a video recording and broadcasting retrieval and playback device, which adopts the following technical solution:

一种视频录播检索回放装置，包括：视频片段确定模块、提示模块以及目标视频片段确定模块，其中，A device for retrieving and replaying video recording and broadcasting, comprising: a video segment determination module, a prompt module and a target video segment determination module, wherein,

视频片段确定模块，用于基于获取的用户第一回放指令以及调取的预设处理规则，对原视频进行处理，基于处理结果，确定视频片段；A video segment determination module, configured to process the original video based on the acquired user's first playback instruction and the recalled preset processing rules, and determine the video segment based on the processing result;

提示模块，用于基于所述视频片段生成提示信息；A prompt module, configured to generate prompt information based on the video clip;

目标视频片段确定模块，用于若获取到用户基于所述提示信息反馈的第二回放指令，则基于所述第二回放指令对所述视频片段进行筛选处理，确定目标视频片段并反馈显示。The target video segment determination module is configured to, if the second playback instruction fed back by the user based on the prompt information is obtained, perform screening processing on the video segment based on the second playback instruction, determine the target video segment and display it in feedback.

通过采用上述技术方案，首先视频片段确定模块获取用户需要进行回放的第一回放指令，并将原视频按照预设处理规则进行视频处理，确定视频片段，之后提示模块生成告知用户是否需要对视频片段再一步检索的提示信息，如用户选择需要，则目标视频片段确定模块获取用户的第二回放指令，并基于第二回放指令对视频片段进行对应的筛选处理，最终确定用户所需要的目标视频片段；从而能够快速准确的定位到用户想要的目标视频片段，提高用户的观看体验。By adopting the above-mentioned technical solution, firstly, the video clip determination module acquires the first playback instruction that the user needs to play back, and performs video processing on the original video according to the preset processing rules to determine the video clip, and then the prompt module generates and informs the user whether the video clip needs to be edited. The prompt information retrieved in another step, if the user selects the need, then the target video segment determination module obtains the user's second playback instruction, and based on the second playback instruction, the video segment is subjected to corresponding screening processing, and finally determines the target video segment required by the user ; so that the target video segment desired by the user can be quickly and accurately located, and the viewing experience of the user can be improved.

在一种可能的实现方式中，所述视频片段确定模块包括：人物确定单元、人物时间确定单元以及第一视频片段单元，其中，In a possible implementation manner, the video segment determination module includes: a character determination unit, a character time determination unit, and a first video segment unit, wherein,

人物确定单元，用于基于所述用户第一回放指令以及所述预设处理规则，确定所述原视频中的人物特征；A character determination unit, configured to determine the character characteristics in the original video based on the user's first playback instruction and the preset processing rules;

人物时间确定单元，用于确定所述人物特征对应的出现时间点和消失时间点；A character time determination unit, configured to determine the appearance time point and disappearance time point corresponding to the character features;

第一视频片段单元，用于基于所述出现时间点以及所述消失时间点，确定所述人物特征对应的人物视频片段，并将所述人物视频片段设定为第一视频片段。The first video segment unit is configured to determine a character video segment corresponding to the character feature based on the appearance time point and the disappearance time point, and set the character video segment as the first video segment.

在一种可能的实现方式中，所述视频片段确定模块包括：语音确定单元、语音时间确定单元以及第二视频片段单元，其中，In a possible implementation manner, the video segment determination module includes: a voice determination unit, a voice time determination unit, and a second video segment unit, wherein,

语音确定单元，用于基于所述用户第一回放指令以及所述预设处理规则，确定所述原视频中的语音特征；A voice determination unit, configured to determine voice features in the original video based on the user's first playback instruction and the preset processing rules;

语音时间确定单元，用于确定所述语音特征对应的出现时间点和消失时间点；A voice time determination unit, configured to determine the appearance time point and disappearance time point corresponding to the voice feature;

第二视频片段单元，用于基于所述出现时间点以及所述消失时间点，确定所述语音特征对应的语音视频片段，并将所述语音视频片段设定为第二视频片段。The second video segment unit is configured to determine a voice video segment corresponding to the voice feature based on the appearance time point and the disappearance time point, and set the voice video segment as a second video segment.

在一种可能的实现方式中，所述视频片段确定模块包括：人物语音确定单元、时间节点确定单元以及第三视频片段单元，其中，In a possible implementation manner, the video segment determination module includes: a character voice determination unit, a time node determination unit, and a third video segment unit, wherein,

人物语音确定单元，用于基于所述用户第一回放指令以及所述预设处理规则，确定所述原视频中的人物特征和语音特征；A character voice determining unit, configured to determine character features and voice features in the original video based on the user's first playback instruction and the preset processing rules;

时间节点确定单元，用于确定所述人物特征和所述语音特征同时存在的时间节点；A time node determination unit, configured to determine a time node at which the character features and the voice features coexist;

第三视频片段单元，用于基于所述时间节点，确定人物语音视频片段，并将所述人物语音视频片段设定为第三视频片段。The third video segment unit is configured to determine a character voice video segment based on the time node, and set the character voice video segment as a third video segment.

在一种可能的实现方式中，所述目标视频片段确定模块包括：搜索框单元、输入单元以及筛选单元，其中，In a possible implementation manner, the target video segment determination module includes: a search box unit, an input unit, and a screening unit, wherein,

搜索框单元，用于基于所述第二回放指令，生成搜索框并反馈显示；A search box unit, configured to generate a search box based on the second playback instruction and display it in feedback;

输入单元，用于获取用户在所述搜索框输入的关键词特征；an input unit, configured to acquire keyword features input by the user in the search box;

筛选单元，用于基于所述关键词特征，对所述视频片段进行筛选处理，确定目标视频片段。The screening unit is configured to perform screening processing on the video clips based on the keyword feature to determine a target video clip.

在一种可能的实现方式中，所述视频录播检索回放装置，还包括：人物文本模块、语音文本模块以及人物和语音文本模块，其中，In a possible implementation manner, the device for retrieving and playing back video recording and broadcasting further includes: a character text module, a voice text module, and a character and voice text module, wherein,

人物文本模块，用于若用户输入人物文本特征，提取所述视频片段中的所述人物特征，并以文字的形式表示，确定所有的所述人物文本特征；The character text module is used to extract the character features in the video clip if the user inputs the character text features, and express in the form of text, and determine all the character text features;

语音文本模块，用于若用户输入语音文本特征，将所述视频片段中的语音特征进行文本转换，确定所有的所述语音文本特征；The voice-to-text module is used to convert the voice features in the video clip to text if the user inputs the voice-to-text features to determine all the voice-to-text features;

人物和语音文本模块，用于若用户输入人物文本特征和语音文本特征，提取所述视频片段中以文字形式表示的所有的所述人物文本特征，提取所述视频片段中所有的所述语音文本特征。Character and voice text module, for if the user inputs character text features and voice text features, extract all the character text features expressed in text form in the video clip, extract all the voice text in the video clip feature.

在一种可能的实现方式中，所述目标视频片段确定模块包括：时间起始点单元、第一跳转单元以及第二跳转单元，其中，In a possible implementation manner, the target video clip determination module includes: a time starting point unit, a first jump unit, and a second jump unit, wherein,

时间起始点单元，用于生成所述子目标视频片段对应的时间起始点；A time starting point unit, configured to generate a time starting point corresponding to the sub-target video segment;

第一跳转单元，用于若所述子目标视频片段的数量为预设数量，则直接跳转至所述子目标视频片段对应的时间起始点进行播放；The first jump unit is used to directly jump to the corresponding time starting point of the sub-target video segment to play if the number of the sub-target video segments is a preset number;

第二跳转单元，用于若所述子目标视频片段的数量大于预设数量，则优先跳转至时间起始点最早的子目标视频片段对应的时间起始点进行播放，并将所述目标视频片段包含的其余子目标视频片段对应的时间起始点进行高亮处理。The second jump unit is used for if the quantity of the sub-target video clip is greater than the preset number, then preferentially jump to the time start point corresponding to the earliest sub-target video segment of the time start point to play, and the target video The time start points corresponding to the remaining sub-target video clips included in the clip are highlighted.

第三方面，本申请提供一种电子设备，采用如下的技术方案：In a third aspect, the present application provides an electronic device, which adopts the following technical solution:

一种电子设备，该电子设备包括：An electronic device comprising:

至少一个处理器；at least one processor;

存储器；memory;

至少一个应用程序，其中至少一个应用程序被存储在存储器中并被配置为由至少一个处理器执行，所述至少一个应用程序配置用于：执行上述视频录播检索回放方法。At least one application program, wherein at least one application program is stored in a memory and is configured to be executed by at least one processor, and the at least one application program is configured to: execute the above video recording and playback retrieval and playback method.

第四方面，本申请提供一种计算机可读存储介质，采用如下的技术方案：In the fourth aspect, the present application provides a computer-readable storage medium, adopting the following technical solution:

一种计算机可读存储介质，包括：存储有能够被处理器加载并执行上述视频录播检索回放方法的计算机程序。A computer-readable storage medium, comprising: storing a computer program capable of being loaded by a processor and executing the above-mentioned method for retrieving and playing back video recording and broadcasting.

综上所述，本申请包括以下有益技术效果：In summary, the application includes the following beneficial technical effects:

首先获取用户需要进行回放的第一回放指令，并将原视频按照预设处理规则进行视频处理，确定视频片段，之后生成告知用户是否需要对视频片段再一步检索的提示信息，如用户选择需要，则获取用户的第二回放指令，并基于第二回放指令对视频片段进行对应的筛选处理，最终确定用户所需要的目标视频片段；从而能够快速准确的定位到用户想要的目标视频片段，提高用户的观看体验。First obtain the first playback command that the user needs to play back, and process the original video according to the preset processing rules to determine the video segment, and then generate a prompt message informing the user whether to further retrieve the video segment. If the user chooses to, Then obtain the user's second playback instruction, and based on the second playback instruction, perform corresponding screening processing on the video segment, and finally determine the target video segment required by the user; thereby being able to quickly and accurately locate the target video segment desired by the user, and improve User viewing experience.

附图说明Description of drawings

图1是本申请实施例视频录播检索回放方法的流程示意图；Fig. 1 is a schematic flow chart of the method for retrieving and replaying video recording and broadcasting according to the embodiment of the present application;

图2是本申请实施例基于视频录播检索回放装置的方框示意图；Fig. 2 is a schematic block diagram of an embodiment of the present application based on video recording and playback retrieval and playback device;

图3是本申请实施例电子设备的示意图。Fig. 3 is a schematic diagram of an electronic device according to an embodiment of the present application.

具体实施方式Detailed ways

以下结合附图1-3对本申请作进一步详细说明。The present application will be described in further detail below in conjunction with accompanying drawings 1-3.

为使本申请实施例的目的、技术方案和优点更加清楚，下面将结合本申请实施例中的附图，对本申请实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例是本申请一部分实施例，而不是全部的实施例。基于本申请中的实施例，本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例，都属于本申请保护的范围。In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments It is a part of the embodiments of this application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of this application.

本申请实施例提供了一种视频录播检索回放方法，由电子设备执行，该电子设备可以为服务器，也可以为终端设备，其中，该服务器可以是独立的物理服务器，也可以是多个物理服务器构成的服务器集群或者分布式设备，还可以是提供云计算服务的云服务器。终端设备可以是智能手机、平板电脑、笔记本电脑、台式电脑等，但并不局限于此，该终端设备以及服务器可以通过有线或无线通信方式进行直接或间接地连接，本申请实施例在此不做限制。The embodiment of the present application provides a method for retrieving and playing back video recording and broadcasting, which is executed by an electronic device. The electronic device can be a server or a terminal device, where the server can be an independent physical server or multiple physical servers. A server cluster or a distributed device composed of servers may also be a cloud server that provides cloud computing services. The terminal device may be a smart phone, a tablet computer, a notebook computer, a desktop computer, etc., but is not limited thereto. The terminal device and the server may be connected directly or indirectly through wired or wireless communication. Do limit.

参照图1，该方法包括：步骤S101、步骤S102以及步骤S103，其中：Referring to Fig. 1, the method includes: step S101, step S102 and step S103, wherein:

S101、基于获取的用户第一回放指令以及调取的预设处理规则，对原视频进行处理，基于处理结果，确定视频片段。S101. Process the original video based on the acquired user's first playback instruction and the recalled preset processing rule, and determine a video segment based on the processing result.

对于本申请实施例，用户第一回放指令表示用户需要进行回放的指令，预设处理规则为对原视频进行拆分处理的规则，包括人物规则、语音规则以及人物和语音规则。For this embodiment of the application, the user's first playback instruction indicates that the user needs to play back the instruction, and the preset processing rules are the rules for splitting and processing the original video, including character rules, voice rules, and character and voice rules.

具体地，电子设备获取到用户需要进行回放的指令后，调取预设处理规则，将原视频拆分处理成分别以人物规则、语音规则以及人物和语音规则为基准的三种视频片段，之后电子设备获取用户的选择方式，根据用户的选择，确定其中的一种视频片段展示给用户，例如用户选择语音规则，电子设备则确定经过语音规则拆分处理完成后的视频片段展示给用户。Specifically, after the electronic device obtains the user's instruction to play back, it invokes the preset processing rules, splits and processes the original video into three types of video clips based on character rules, voice rules, and character and voice rules, and then The electronic device obtains the user's selection method, and according to the user's selection, determines one of the video clips to display to the user. For example, the user selects a voice rule, and the electronic device determines that the video clip that has been split and processed by the voice rule is displayed to the user.

进一步地，预设处理规则的存储方式可以包含以下至少一种，方式一，可将预设处理规则存储于电子设备本身的存储空间；方式二，可将预设处理规则存储在与电子设备相连的硬件存储设备中；方式三，可将预设处理规则存储在云端存储空间，方便随时调取。Further, the storage method of the preset processing rule may include at least one of the following methods. In the first method, the preset processing rule may be stored in the storage space of the electronic device itself; in the second method, the preset processing rule may be stored in a computer connected to the electronic device. In the hardware storage device; method 3, the preset processing rules can be stored in the cloud storage space for easy recall at any time.

S102、基于视频片段，生成提示信息。S102. Generate prompt information based on the video segment.

对于本申请实施例，提示信息为告知用户是否需要对视频片段进行再一步筛选的信息。For the embodiment of the present application, the prompt information is information notifying the user whether to further screen the video segment.

具体地，电子设备根据展示给用户的视频片段，并在不影响用户观看的位置生成提示信息，询问是否需要对视频片段进行再一步筛选，提示信息可以了解用户对于预设处理规则处理后的视频片段的满意度，方便之后根据用户基于提示信息的反馈对预设处理规则进行不断改进。Specifically, based on the video clips displayed to the user, the electronic device generates prompt information at a position that does not affect the user's viewing, asking whether it is necessary to further screen the video clips. Fragment satisfaction, so that the preset processing rules can be continuously improved according to the user's feedback based on the prompt information.

需要说明的，对于提示信息的提示方式和提示位置，本申请实施例不做具体限定。It should be noted that the embodiment of the present application does not specifically limit the prompting manner and prompting position of the prompting information.

S103、若获取到用户基于提示信息反馈的第二回放指令，则基于第二回放指令对视频片段进行筛选处理，确定目标视频片段并反馈显示。S103. If the second playback instruction fed back by the user based on the prompt information is acquired, perform screening processing on the video clips based on the second playback instruction, determine a target video clip, and display it in feedback.

对于本申请实施例，筛选处理包含人物特征和/或语音特征筛选。For the embodiment of the present application, the screening process includes character feature and/or voice feature screening.

具体地，当用户对生成的提示信息做出反馈后，如果用户选择不需要进行再一步筛选处理，则关闭提示信息，方便用户观看视频片段，如果用户需要进行再一步筛选处理，则电子设备生成第二回放指令，并基于该第二回放指令，对视频片段进行进一步筛选，通过人物特征和/或语音特征对视频片段进行对比筛选，最终确定目标视频片段，并展示该目标视频片段，其中，目标视频片段可以包含多个目标视频片段，通过进一步的筛选处理，提供更为精确的搜索方式让用户的获得自己所需的目标视频片段。Specifically, after the user gives feedback on the generated prompt information, if the user chooses not to perform further screening processing, the prompt information is closed to facilitate the user to watch the video clip; if the user needs to perform further screening processing, the electronic device generates The second playback instruction, and based on the second playback instruction, the video clips are further screened, the video clips are compared and screened by character features and/or voice features, and finally the target video clip is determined, and the target video clip is displayed, wherein, The target video clip may contain multiple target video clips, and through further screening, a more precise search method is provided to allow the user to obtain the target video clip he needs.

本申请实施例提供了一种视频录播检索回放方法，当电子设备接收用户想要回放的第一回放指令时，电子设备首先通过人物规则、语音规则以及人物和语音规则同时对原视频进行拆分处理，之后基于用户对拆分方式的选择，显示对应的视频片段作为处理结果，同时生成提示信息，询问用户当前视频片段是否需要进行进一步的筛选处理，如不需要，则关闭提示信息，以方便用户对第一处理结果的视频片段进行观看，如需要，则电子设备接收用户的第二回放指令，将用户输入的特征与当前视频片段特征提取后得到的人物特征和/或语音特征进行比对筛选，确定目标视频片段作为最终处理结果，并展示给用户；从而实现快速准确的定位到用户想要回放的目标视频片段，提高用户的观看体验。The embodiment of the present application provides a method for retrieving and playing back video recording and broadcasting. When the electronic device receives the first playback instruction that the user wants to play back, the electronic device first simultaneously disassembles the original video through character rules, voice rules, and character and voice rules. Then, based on the user's selection of the splitting method, the corresponding video segment is displayed as the processing result, and a prompt message is generated to ask the user whether the current video segment needs to be further screened. If not, the prompt message is closed to It is convenient for the user to watch the video segment of the first processing result. If necessary, the electronic device receives the user's second playback instruction, and compares the features input by the user with the character features and/or voice features obtained after feature extraction of the current video segment. For screening, determine the target video segment as the final processing result, and display it to the user; so as to quickly and accurately locate the target video segment that the user wants to playback, and improve the user's viewing experience.

步骤S101中，基于获取的用户第一回放指令以及调取的预设处理规则，对原视频进行处理，具体包括：基于用户第一回放指令以及预设处理规则，确定原视频中的人物特征；确定人物特征对应的出现时间点和消失时间点；基于出现时间点以及消失时间点，确定人物特征对应的人物视频片段，并将人物视频片段设定为第一视频片段。In step S101, the original video is processed based on the acquired user's first playback instruction and the recalled preset processing rules, specifically including: determining the characteristics of characters in the original video based on the user's first playback instruction and preset processing rules; Determine the appearance time point and disappearance time point corresponding to the character feature; based on the appearance time point and disappearance time point, determine the character video segment corresponding to the character feature, and set the character video segment as the first video segment.

对于本申请实施例，出现时间点和消失时间点可以为多个，同理可得，第一视频片段也可以包含多个第一子视频片段，需要说明的是，此处阐述是为了说明视频片段中可以包含多个子视频片段，下文也将沿用，不再过多赘述。For the embodiment of this application, there can be multiple appearance time points and disappearance time points. Similarly, the first video clip can also contain multiple first sub-video clips. It should be noted that the description here is to illustrate the A clip can contain multiple sub-video clips, which will be used below and will not be repeated here.

具体地，电子设备接收到用户的回放指令时，采集原视频中的人物图像信息，同时电子设备记录人物头像信息的出现时间点和对应的消失时间点，并在原视频进度条中进行标记，即，确定一个第一子视频片段，重复以上操作，直到对原视频全部完成采集，并确保每一个第一子视频片段中均包含有人物图像信息，将所有第一子视频片段分别保存并汇总成第一视频片段，电子设备显示包含有多个时间节点的第一视频片段。Specifically, when the electronic device receives the user's playback instruction, it collects the character image information in the original video, and at the same time, the electronic device records the appearance time point and the corresponding disappearance time point of the character head information, and marks them in the progress bar of the original video, that is , determine a first sub-video segment, repeat the above operations until the original video is completely collected, and ensure that each first sub-video segment contains character image information, save and summarize all the first sub-video segments into For the first video segment, the electronic device displays the first video segment including multiple time nodes.

步骤S101中，基于获取的用户第一回放指令以及调取的预设处理规则，对原视频进行处理，具体还包括：基于用户第一回放指令以及预设处理规则，确定原视频中的语音特征；确定语音特征对应的出现时间点和消失时间点；基于出现时间点以及消失时间点，确定语音特征对应的语音视频片段，并将语音视频片段设定为第二视频片段。In step S101, the original video is processed based on the obtained user's first playback instruction and the recalled preset processing rules, which specifically includes: determining the voice features in the original video based on the user's first playback instruction and preset processing rules ; Determine the appearance time point and disappearance time point corresponding to the voice feature; based on the appearance time point and the disappearance time point, determine the voice video segment corresponding to the voice feature, and set the voice video segment as the second video segment.

对于本申请实施例，电子设备接收到用户的回放指令时，确定原视频中的语音音频信息，同时电子设备依次记录语音音频信息的出现时间点和对应的消失时间点，并在原视频进度条中进行标记，即，确定一个第二子视频片段，重复以上操作，直到对原视频全部完成采集，并确保每一个第二子视频片段中均包含有语音音频信息，将所有第二子视频片段分别保存并汇总成第二视频片段，电子设备显示包含有多个时间节点的第二视频片段。For the embodiment of the present application, when the electronic device receives the user's playback instruction, it determines the voice and audio information in the original video, and at the same time, the electronic device sequentially records the appearance time point and the corresponding disappearance time point of the voice and audio information, and displays them in the original video progress bar. Mark, that is, determine a second sub-video segment, repeat the above operations until the original video is completely collected, and ensure that each second sub-video segment contains voice and audio information, and separate all the second sub-video segments The second video clip is saved and summarized, and the electronic device displays the second video clip including multiple time nodes.

步骤S101中，基于获取的用户第一回放指令以及调取的预设处理规则，对原视频进行处理，具体还包括：基于用户第一回放指令以及预设处理规则，确定原视频中的人物特征和语音特征；确定人物特征和语音特征同时存在的时间节点；基于时间节点，确定人物语音视频片段，并将人物语音视频片段设定为第三视频片段。In step S101, the original video is processed based on the acquired user's first playback instruction and the recalled preset processing rules, which specifically includes: determining the characteristics of the characters in the original video based on the user's first playback instruction and the preset processing rules and voice features; determine the time node at which the character features and voice features coexist; based on the time node, determine the voice video segment of the character, and set the voice video segment of the character as the third video segment.

对于本申请实施例，电子设备接收到用户的回放指令时，采集原视频中的人物图像信息和语音音频信息，同时电子设备依次记录人物图像信息和语音音频信息同时出现和消失的时间节点，并在原视频进度条中进行标记，即，确定一个第三子视频片段，重复以上操作，直到对原视频全部完成采集，并确保每一个第三子视频片段中均包含有人物图像信息和语音音频信息，将所有第三子视频片段分别保存并汇总成第三视频片段，电子设备显示包含有多个时间节点的第三视频片段。For this embodiment of the application, when the electronic device receives the playback instruction from the user, it collects the character image information and voice and audio information in the original video, and at the same time, the electronic device sequentially records the time nodes when the character image information and the voice and audio information appear and disappear at the same time, and Mark in the original video progress bar, that is, determine a third sub-video segment, repeat the above operations until the original video is completely collected, and ensure that each third sub-video segment contains character image information and voice and audio information , saving and summarizing all the third sub-video segments into a third video segment, and the electronic device displays the third video segment including multiple time nodes.

步骤S103中，若获取到用户基于提示信息反馈的第二回放指令，则基于用户第二回放指令对视频片段进行筛选处理，确定目标视频片段并反馈显示，具体还包括：基于第二回放指令，生成搜索框并反馈显示；获取用户在搜索框输入的关键词特征；基于关键词特征，对视频片段进行筛选处理，确定目标视频片段。In step S103, if the second playback instruction fed back by the user based on the prompt information is obtained, the video segment is screened based on the user's second playback instruction, and the target video segment is determined and displayed as feedback. Specifically, it also includes: based on the second playback instruction, Generate a search box and display it in feedback; obtain the keyword features entered by the user in the search box; based on the keyword features, screen the video clips to determine the target video clip.

在本申请实施例中，关键词特征表示用户想要查询到的目标视频片段中的人物文本特征和/或语音文本特征。In the embodiment of the present application, the keyword features represent character text features and/or speech text features in the target video segment that the user wants to query.

具体地，以电子设备接收到的用户需要进一步筛选的第二回放指令为条件，生成搜索框并反馈显示，当用户在搜索框中输入关键词特征后，电子设备基于关键词特征，在之前选择的对应视频片段中筛选对应的人物文本特征和/或语音文本特征，并将符合特征的视频片段作为目标视频片段提取出来，例如，用户输入的关键词特征为语音文本特征“好好学习”，将对应第二视频片段中包含有“好好学习”语音的视频片段全部提取，作为最终的目标视频片段。Specifically, based on the second playback instruction received by the electronic device that the user needs to further filter as a condition, a search box is generated and displayed in feedback. When the user enters a keyword feature in the search box, the electronic device based on the keyword feature, previously selected Screen the corresponding character text features and/or voice-text features from the corresponding video clips, and extract the video clips that meet the features as target video clips. All video segments corresponding to the second video segment containing the voice of "study well" are extracted as the final target video segment.

进一步地，获取用户在搜索框输入的关键词特征，之前具体包括：若用户输入人物文本特征，提取视频片段中的人物特征，并以文字的形式表示，确定所有的人物文本特征；若用户输入语音文本特征，将所述视频片段中的语音特征进行文本转换，确定所有的所述语音文本特征；若用户输入人物文本特征和语音文本特征，提取视频片段中以文字形式表示的所有的人物文本特征，提取所述视频片段中所有的所述语音文本特征。Further, the acquisition of the keyword features input by the user in the search box specifically includes: if the user enters the character text features, extract the character features in the video clip and express them in the form of text, and determine all the character text features; if the user inputs Voice-to-text features, converting the voice features in the video clip to text to determine all the voice-to-text features; if the user inputs character text features and voice-to-text features, extract all character texts expressed in text form in the video clip feature, extracting all the speech-text features in the video clip.

在本申请实施例中，人物特征可以包括服装特征以及人物名称，语音特征可以包括音频信息。In the embodiment of the present application, the character features may include clothing features and character names, and the voice features may include audio information.

具体地，若用户输入人物文本特征，将视频片段中的服装特征和人物名称以文字形式表示，例如，视频中有一个片段西装男在演讲的视频，用户输入的关键词特征可以为“西装演讲”，便可以检索到对应的视频片段；若用户输入语音文本特征，将视频中的所有语音片段转化成语音文本，方便后期用户通过文本来进行检索，例如，视频中有一段音频是“好好学习天天向上”，用户输入的关键词特征可以为“学习”，也可以为“好好学习”，便可以检索到对应的视频片段，若用户输入人物文本特征和语音文本特征，同时提取人物文本特征和语音文本特征，例如，视频中有一段片段是西服男子在演讲关于如何学习的片段，用户输入的关键词特征可以为“西服演讲学习”，便可以检索到对应的视频片段。Specifically, if the user inputs character text features, the clothing features and character names in the video clip are represented in text form. For example, there is a clip of a man in a suit giving a speech in the video, and the keyword feature input by the user can be "suit speech ", the corresponding video segment can be retrieved; if the user enters the voice text feature, all the voice segments in the video will be converted into voice text, which is convenient for the user to search through the text later. Upward every day", the keyword feature input by the user can be "learning" or "study hard", and the corresponding video clip can be retrieved. If the user enters character text features and voice text features, the character text features and Speech text features, for example, there is a clip in the video where a man in a suit is giving a speech about how to learn, and the keyword feature input by the user can be "learn in a suit speech", and the corresponding video clip can be retrieved.

需要说明的是，关键词特征可以为各个语种的关键词特征，本申请不作具体限定。It should be noted that the keyword feature may be the keyword feature of each language, which is not specifically limited in this application.

步骤S103中，目标视频片段包含至少一个子目标视频片段，若获取到用户基于提示信息反馈的第二回放指令，则基于用户第二回放指令对视频片段进行筛选处理，确定目标视频片段并反馈显示，具体还包括：生成子目标视频片段对应的时间起始点；若子目标视频片段的数量为预设数量，则直接跳转至子目标视频片段对应的时间起始点进行播放；若子目标视频片段的数量大于预设数量，则优先跳转至时间起始点最早的子目标视频片段对应的时间起始点进行播放，并将目标视频片段包含的其余子目标视频片段对应的时间起始点进行高亮处理。In step S103, the target video segment includes at least one sub-target video segment. If the second playback instruction fed back by the user based on the prompt information is obtained, the video segment is screened based on the user's second playback instruction, and the target video segment is determined and displayed in feedback. , specifically includes: generating the time start point corresponding to the sub-target video segment; if the number of sub-target video segments is the preset number, directly jump to the time start point corresponding to the sub-target video segment to play; if the number of sub-target video segments If the number is greater than the preset number, jump to the time start point corresponding to the earliest sub-target video clip at the time start point first, and then highlight the time start points corresponding to the remaining sub-target video clips included in the target video clip.

在本申请实施例中，预设数量为1。In the embodiment of the present application, the preset number is 1.

具体地，电子设备筛选处理完成后，基于确定的所有子目标视频片段的出现时间点，生成时间起始点，之后判断子目标视频片段的数量，若子目标视频片段数量为1，则表示目标视频片段只有一个对应的时间起始点，则电子设备直接定位到该目标视频片段的时间起始点开始播放；若子目标视频片段数量超过1，则表示该目标视频片段存在多个符合条件的子目标视频片段，且包含多个时间起始点，则根据时间起始点的前后排序，电子设备优先定位到最先出现的时间起始点开始播放，其余的时间起始点作高亮处理，从而帮助用户可以清楚的看到目标视频片段在原视频的进度条上的位置。Specifically, after the electronic device screening process is completed, based on the determined occurrence time points of all sub-target video segments, a time starting point is generated, and then the number of sub-target video segments is judged. If the number of sub-target video segments is 1, it means that the target video segment If there is only one corresponding time starting point, the electronic device will directly locate the time starting point of the target video segment and start playing; if the number of sub-target video segments exceeds 1, it means that the target video segment has multiple sub-target video segments that meet the conditions. and contains multiple time starting points, according to the sequence of the time starting points, the electronic device will firstly locate the time starting point that appears first to start playing, and the rest of the time starting points will be highlighted, so as to help users see clearly The position of the target video clip on the progress bar of the original video.

视频录播检索回放装置20具体可以包括：视频片段确定模块201、提示模块202以及目标视频片段确定模块203，其中，The video recording and playback retrieval and playback device 20 may specifically include: a video segment determination module 201, a prompt module 202, and a target video segment determination module 203, wherein,

视频片段确定模块201，用于基于获取的用户第一回放指令以及调取的预设处理规则，对原视频进行处理，基于处理结果，确定视频片段；The video segment determination module 201 is used to process the original video based on the obtained user's first playback instruction and the preset processing rule called, and determine the video segment based on the processing result;

用户第一回放指令为用户需要进行回放的指令；The user's first playback instruction is an instruction that the user needs to play back;

提示模块202，用于基于视频片段，生成提示信息；Prompt module 202, for generating prompt information based on the video segment;

提示信息为告知用户是否需要对视频片段进行再一步筛选的信息；The prompt information is information to inform the user whether to further screen the video clip;

目标视频片段确定模块203，用于若获取到用户基于提示信息反馈的第二回放指令，则基于第二回放指令对视频片段进行筛选处理，确定目标视频片段并反馈显示。The target video segment determining module 203 is configured to, if the second playback instruction fed back by the user based on the prompt information is obtained, perform screening processing on the video segment based on the second playback instruction, determine the target video segment and display it as feedback.

本申请实施例的一种可能的实现方式，视频片段确定模块201，包括人物确定单元、人物时间确定单元以及第一视频片段单元，其中，In a possible implementation of the embodiment of the present application, the video segment determination module 201 includes a character determination unit, a character time determination unit, and a first video segment unit, wherein,

人物确定单元，用于基于用户第一回放指令以及预设处理规则，确定原视频中的人物特征；A character determination unit, configured to determine the character features in the original video based on the user's first playback instruction and preset processing rules;

人物时间确定单元，用于确定人物特征对应的出现时间点和消失时间点；A character time determination unit, configured to determine the appearance time point and disappearance time point corresponding to the character features;

第一视频片段单元，用于基于出现时间点以及消失时间点，确定人物特征对应的人物视频片段，并将人物视频片段设定为第一视频片段。The first video segment unit is configured to determine a character video segment corresponding to the character feature based on the appearance time point and the disappearance time point, and set the character video segment as the first video segment.

本申请实施例的一种可能的实现方式，视频片段确定模块201，还包括：语音确定单元、语音时间确定单元以及第二视频片段单元，其中，In a possible implementation of the embodiment of the present application, the video segment determination module 201 further includes: a voice determination unit, a voice time determination unit, and a second video segment unit, wherein,

语音确定单元，用于基于用户第一回放指令以及预设处理规则，确定原视频中的语音特征；A voice determination unit, configured to determine voice features in the original video based on the user's first playback instruction and preset processing rules;

语音时间确定单元，用于确定语音特征对应的出现时间点和消失时间点；A voice time determination unit, configured to determine the appearance time point and disappearance time point corresponding to the voice feature;

第二视频片段单元，用于基于出现时间点以及消失时间点，确定语音特征对应的语音视频片段，并将语音视频片段设定为第二视频片段。The second video segment unit is configured to determine the audio video segment corresponding to the voice feature based on the appearance time point and the disappearance time point, and set the audio video segment as the second video segment.

本申请实施例的一种可能的实现方式，视频片段确定模块201，还包括：人物语音确定单元、时间节点确定单元以及第三视频片段单元，其中，In a possible implementation of the embodiment of the present application, the video segment determination module 201 further includes: a character voice determination unit, a time node determination unit, and a third video segment unit, wherein,

人物语音确定单元，用于基于用户第一回放指令以及预设处理规则，确定原视频中的人物特征和语音特征；A character voice determining unit, configured to determine character features and voice features in the original video based on the user's first playback instruction and preset processing rules;

时间节点确定单元，用于确定人物特征和语音特征同时存在的时间节点；A time node determining unit, configured to determine a time node at which character features and voice features coexist;

第三视频片段单元，用于基于时间节点，确定人物语音视频片段，并将人物语音视频片段设定为第三视频片段。The third video segment unit is configured to determine the character voice video segment based on the time node, and set the character voice video segment as the third video segment.

本申请实施例的一种可能的实现方式，目标视频片段确定模块203，还包括：搜索框单元、输入单元以及筛选单元，其中，In a possible implementation of the embodiment of the present application, the target video segment determination module 203 further includes: a search box unit, an input unit, and a screening unit, wherein,

搜索框单元，用于基于第二回放指令，生成搜索框并反馈显示；A search box unit, configured to generate a search box based on the second playback instruction and display it in feedback;

输入单元，用于获取用户在搜索框输入的关键词特征；The input unit is used to obtain the keyword features entered by the user in the search box;

筛选单元，用于基于关键词特征，对视频片段进行筛选处理，确定目标视频片段。The screening unit is configured to perform screening processing on video clips based on keyword features to determine target video clips.

本申请实施例的一种可能的实现方式，视频录播检索回放的装置20，还包括：人物文本模块、语音文本模块以及人物和语音文本模块，其中，In a possible implementation of the embodiment of the present application, the device 20 for retrieving and playing back video recording and broadcasting further includes: a character text module, a voice text module, and a character and voice text module, wherein,

人物文本模块，用于若用户输入人物文本特征，提取视频片段中的人物特征，并以文字的形式表示，确定所有的人物文本特征；The character text module is used to extract the character features in the video clip if the user inputs the character text features, and express them in the form of text to determine all the character text features;

人物和语音文本模块，用于若用户输入人物文本特征和语音文本特征，提取视频片段中以文字形式表示的所有的人物文本特征，提取所述视频片段中所有的所述语音文本特征。The character and speech text module is used to extract all character text features expressed in text form in the video clip and extract all the speech text features in the video clip if the user inputs character text features and speech text features.

本申请实施例的一种可能的实现方式，目标视频片段确定模块203，还包括：时间起始点单元、第一跳转单元以及第二跳转单元，其中，In a possible implementation of the embodiment of the present application, the target video segment determination module 203 further includes: a time starting point unit, a first jump unit, and a second jump unit, wherein,

时间起始点单元，用于生成子目标视频片段对应的时间起始点；The time start point unit is used to generate the time start point corresponding to the sub-target video segment;

第一跳转单元，用于若子目标视频片段的数量为预设数量，则直接跳转至子目标视频片段对应的时间起始点进行播放；The first jump unit is used to directly jump to the corresponding time starting point of the sub-target video segment to play if the quantity of the sub-target video segment is a preset number;

第二跳转单元，用于若子目标视频片段的数量大于预设数量，则优先跳转至时间起始点最早的子目标视频片段对应的时间起始点进行播放，并将目标视频片段包含的其余子目标视频片段对应的时间起始点进行高亮处理。The second jump unit is used to jump to the corresponding time starting point of the earliest sub-target video segment of the time starting point if the number of sub-target video segments is greater than the preset number, and play the remaining sub-target video segments included in the target video segment. The time starting point corresponding to the target video segment is highlighted.

所属领域的技术人员可以清楚地了解到，为描述的方便和简洁，上述描述的系统、装置和单元的具体工作过程，可以参考前述方法实施例中的对应过程，在此不再赘述。Those skilled in the art can clearly understand that for the convenience and brevity of the description, the specific working process of the above-described system, device and unit can refer to the corresponding process in the foregoing method embodiment, which will not be repeated here.

本申请实施例还从实体装置的角度介绍了一种电子设备，如图3所示，图3所示的电子设备30包括：处理器301和存储器303。其中，处理器301和存储器303相连，如通过总线302相连。可选地，电子设备30还可以包括收发器304。需要说明的是，实际应用中收发器304不限于一个，该电子设备30的结构并不构成对本申请实施例的限定。The embodiment of the present application also introduces an electronic device from the perspective of a physical device. As shown in FIG. 3 , the electronic device 30 shown in FIG. 3 includes: a processor 301 and a memory 303 . Wherein, the processor 301 is connected to the memory 303 , such as through a bus 302 . Optionally, the electronic device 30 may further include a transceiver 304 . It should be noted that, in practical applications, the transceiver 304 is not limited to one, and the structure of the electronic device 30 does not limit the embodiment of the present application.

处理器301可以是CPU（Central Processing Unit，中央处理器），通用处理器，DSP（Digital Signal Processor，数据信号处理器），ASIC（Application SpecificIntegrated Circuit，专用集成电路），FPGA（Field Programmable Gate Array，现场可编程门阵列）或者其他可编程逻辑器件、晶体管逻辑器件、硬件部件或者其任意组合。其可以实现或执行结合本申请公开内容所描述的各种示例性的逻辑方框，模块和电路。处理器301也可以是实现计算功能的组合，例如包含一个或多个微处理器组合，DSP和微处理器的组合等。The processor 301 may be a CPU (Central Processing Unit, central processing unit), a general-purpose processor, a DSP (Digital Signal Processor, a data signal processor), an ASIC (Application Specific Integrated Circuit, an application specific integrated circuit), an FPGA (Field Programmable Gate Array, field programmable gate array) or other programmable logic devices, transistor logic devices, hardware components, or any combination thereof. It can implement or execute the various illustrative logical blocks, modules and circuits described in connection with the present disclosure. The processor 301 may also be a combination that implements computing functions, for example, a combination of one or more microprocessors, a combination of a DSP and a microprocessor, and the like.

总线302可包括一通路，在上述组件之间传送信息。总线302可以是PCI（Peripheral Component Interconnect，外设部件互连标准）总线或EISA（ExtendedIndustry Standard Architecture，扩展工业标准结构）总线等。总线302可以分为地址总线、数据总线、控制总线等。为便于表示，图3中仅用一条粗线表示，但并不表示仅有一根总线或一种类型的总线。Bus 302 may include a path for communicating information between the components described above. The bus 302 may be a PCI (Peripheral Component Interconnect, Peripheral Component Interconnect Standard) bus or an EISA (Extended Industry Standard Architecture, Extended Industry Standard Architecture) bus or the like. The bus 302 can be divided into address bus, data bus, control bus and so on. For ease of representation, only one thick line is used in FIG. 3 , but it does not mean that there is only one bus or one type of bus.

存储器303可以是ROM（Read Only Memory，只读存储器）或可存储静态信息和指令的其他类型的静态存储设备，RAM（Random Access Memory，随机存取存储器）或者可存储信息和指令的其他类型的动态存储设备，也可以是EEPROM（Electrically ErasableProgrammable Read Only Memory，电可擦可编程只读存储器）、CD-ROM（Compact DiscRead Only Memory，只读光盘）或其他光盘存储、光碟存储（包括压缩光碟、激光碟、光碟、数字通用光碟、蓝光光碟等）、磁盘存储介质或者其他磁存储设备、或者能够用于携带或存储具有指令或数据结构形式的期望的程序代码并能够由计算机存取的任何其他介质，但不限于此。The memory 303 can be ROM (Read Only Memory, read-only memory) or other types of static storage devices that can store static information and instructions, RAM (Random Access Memory, random access memory) or other types of static storage devices that can store information and instructions Dynamic storage devices can also be EEPROM (Electrically Erasable Programmable Read Only Memory, Electrically Erasable Programmable Read-Only Memory), CD-ROM (Compact DiscRead Only Memory, CD-ROM) or other CD storage, CD storage (including compact CD, laser discs, compact discs, digital versatile discs, blu-ray discs, etc.), magnetic disk storage media or other magnetic storage devices, or any other medium capable of carrying or storing desired program code in the form of instructions or data structures that can be accessed by a computer media, but not limited to this.

存储器303用于存储执行本申请方案的应用程序代码，并由处理器301来控制执行。处理器301用于执行存储器303中存储的应用程序代码，以实现前述方法实施例所示的内容。The memory 303 is used to store application program codes for executing the solutions of the present application, and the execution is controlled by the processor 301 . The processor 301 is configured to execute the application program code stored in the memory 303, so as to realize the contents shown in the foregoing method embodiments.

其中，电子设备包括但不限于：移动电话、笔记本电脑、数字广播接收器、PDA（个人数字助理）、PAD（平板电脑）、PMP（便携式多媒体播放器）、车载终端（例如车载导航终端）等等的移动终端以及诸如数字TV、台式计算机等等的固定终端。还可以为服务器等。图3示出的电子设备仅仅是一个示例，不应对本申请实施例的功能和使用范围带来任何限制。Among them, electronic devices include but are not limited to: mobile phones, notebook computers, digital broadcast receivers, PDA (personal digital assistant), PAD (tablet computer), PMP (portable multimedia player), vehicle-mounted terminals (such as vehicle-mounted navigation terminals), etc. Mobile terminals such as digital TVs, desktop computers, etc. and fixed terminals. Also for servers etc. The electronic device shown in FIG. 3 is only an example, and should not limit the functions and scope of use of this embodiment of the present application.

应该理解的是，虽然附图的流程图中的各个步骤按照箭头的指示依次显示，但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明，这些步骤的执行并没有严格的顺序限制，其可以以其他的顺序执行。而且，附图的流程图中的至少一部分步骤可以包括多个子步骤或者多个阶段，这些子步骤或者阶段并不必然是在同一时刻执行完成，而是可以在不同的时刻执行，其执行顺序也不必然是依次进行，而是可以与其他步骤或者其他步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that although the various steps in the flow chart of the accompanying drawings are displayed sequentially according to the arrows, these steps are not necessarily executed sequentially in the order indicated by the arrows. Unless otherwise specified herein, there is no strict order restriction on the execution of these steps, and they can be executed in other orders. Moreover, at least some of the steps in the flowcharts of the accompanying drawings may include multiple sub-steps or multiple stages, and these sub-steps or stages may not necessarily be executed at the same time, but may be executed at different times, and the order of execution is also It is not necessarily performed sequentially, but may be performed alternately or alternately with at least a part of other steps or sub-steps or stages of other steps.

以上仅是本申请的部分实施方式，应当指出，对于本技术领域的普通技术人员来说，在不脱离本申请原理的前提下，还可以做出若干改进和润饰，这些改进和润饰也应视为本申请的保护范围。The above are only some implementations of the present application. It should be pointed out that for those of ordinary skill in the art, some improvements and modifications can be made without departing from the principle of the application, and these improvements and modifications should also be considered as For the scope of protection of this application.

Claims

1. A video recording, playback and retrieval method, comprising:

processing the original video based on the acquired first playback instruction of the user and the called preset processing rule, and determining a video clip based on a processing result;

the first playback instruction of the user is an instruction that the user needs to play back;

generating prompt information based on the video clips;

the prompt information is information for informing a user whether the video clips need to be screened again;

and if a second playback instruction fed back by the user based on the prompt information is acquired, screening the video clips based on the second playback instruction, determining a target video clip and feeding back and displaying the target video clip.

2. The method for video recording, playback and retrieval according to claim 1, wherein the processing the original video based on the acquired user first playback instruction and the invoked preset processing rule comprises:

Determining character features in the original video based on the first playback instruction of the user and the preset processing rule;

determining appearance time points and disappearance time points corresponding to the character features;

and determining a character video clip corresponding to the character feature based on the appearance time point and the disappearance time point, and setting the character video clip as a first video clip.

3. The method for video recording, playback and retrieval according to claim 1, wherein the processing the original video based on the acquired user first playback instruction and the invoked preset processing rule comprises:

determining voice characteristics in the original video based on the first playback instruction of the user and the preset processing rule;

determining an appearance time point and a disappearance time point corresponding to the voice feature;

and determining a voice video segment corresponding to the voice feature based on the occurrence time point and the disappearance time point, and setting the voice video segment as a second video segment.

4. The method for video recording, playback and retrieval according to claim 1, wherein the processing the original video based on the acquired user first playback instruction and the invoked preset processing rule comprises:

Determining character features and voice features in the original video based on the first playback instruction of the user and the preset processing rule;

determining a time node at which the character feature and the voice feature coexist;

and determining a character voice video segment based on the time node, and setting the character voice video segment as a third video segment.

5. The method of claim 1, wherein if a second playback instruction fed back by a user based on prompt information is obtained, screening the video segments based on the second playback instruction of the user, determining a target video segment, and feeding back and displaying the target video segment, comprising:

generating a search box and feeding back and displaying the search box based on the second playback instruction;

acquiring keyword characteristics input by a user in the search box;

and screening the video clips based on the keyword features to determine target video clips.

6. The method for video recording, searching and replaying according to claim 5, wherein the keyword features are human text features and/or voice text features, and the step of obtaining the keyword features input by the user in the search box further comprises the steps of:

If the user inputs character text features, extracting the character features in the video clips, representing the character features in a text form, and determining all the character text features;

if the user inputs the voice text features, performing text conversion on the voice features in the video clips, and determining all the voice text features;

if the user inputs character text features and voice text features, extracting all character text features expressed in a text form in the video segment, and extracting all voice text features in the video segment.

7. The video recording, retrieving and playback method as set forth in claim 1, wherein the target video clip comprises at least one sub-target video clip; if a second playback instruction fed back by the user based on the prompt information is acquired, screening the video clips based on the second playback instruction, determining a target video clip and feeding back and displaying the target video clip, including:

generating a time starting point corresponding to the sub-target video segment;

if the number of the sub-target video clips is the preset number, directly jumping to a time starting point corresponding to the sub-target video clips for playing;

If the number of the sub-target video clips is greater than the preset number, preferentially jumping to the time starting point corresponding to the sub-target video clip with the earliest time starting point for playing, and performing highlighting processing on the time starting points corresponding to the rest sub-target video clips contained in the target video clip.

8. A video recording, playback and retrieval device, comprising:

the video segment determining module is used for processing the original video based on the acquired first playback instruction of the user and the called preset processing rule and determining a video segment based on the processing result;

the prompt module is used for generating prompt information based on the video clips;

and the target video segment determining module is used for screening the video segments based on the second playback instruction if the second playback instruction fed back by the user based on the prompt information is acquired, determining the target video segments and feeding back and displaying the target video segments.

9. An electronic device, comprising:

At least one processor;

a memory;

at least one application program, wherein the at least one application program is stored in the memory and configured to be executed by the at least one processor, the at least one application program configured to: a video recording and playback retrieval method according to any one of claims 1 to 7.

10. A computer readable storage medium having stored thereon a computer program, characterized in that the computer program, when executed in a computer, causes the computer to perform a video recording, retrieving and playback method according to any one of claims 1 to 7.