CN114564614A - A kind of video clip automatic search method, system, device and readable storage medium - Google Patents
A kind of video clip automatic search method, system, device and readable storage medium Download PDFInfo
- Publication number
- CN114564614A CN114564614A CN202210108784.4A CN202210108784A CN114564614A CN 114564614 A CN114564614 A CN 114564614A CN 202210108784 A CN202210108784 A CN 202210108784A CN 114564614 A CN114564614 A CN 114564614A
- Authority
- CN
- China
- Prior art keywords
- video
- search
- keywords
- client
- storage device
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
- 
        - G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/73—Querying
- G06F16/735—Filtering based on additional data, e.g. user or group profiles
 
- 
        - G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/783—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
 
- 
        - G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/7867—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title and artist information, manually generated time, location and usage information, user ratings
 
- 
        - G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
 
- 
        - G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
 
- 
        - G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
 
- 
        - G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
 
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Multimedia (AREA)
- Data Mining & Analysis (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Library & Information Science (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Artificial Intelligence (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
技术领域technical field
本发明涉及视频数据搜索技术领域,更具体的说是涉及一种视频片段自动搜索方法、系统、装置及可读存储介质。The present invention relates to the technical field of video data search, and more particularly to an automatic search method, system, device and readable storage medium for video clips.
背景技术Background technique
随着多媒体业务的不断发展,视频的数量越来越多。为了摆脱了传统只能被动观看电视视频的方式,越来越多的人选择通过网络搜索的方式获取想要观看的视频。因此,视频搜索也成了获取视频的一个关键环节。With the continuous development of multimedia services, the number of videos is increasing. In order to get rid of the traditional way of passively watching TV videos, more and more people choose to search for the videos they want to watch through the Internet. Therefore, video search has also become a key link in obtaining videos.
目前,视频搜索功能通常是采用关键字搜索的方式来实现。此种方式仅能搜寻到标题或者描述符合的视频,且需将影片看完才能知道需要的片段位于几分几秒,又或者需将影片看完才能知道是否有需要的内容。但想快速查找资讯时,需看完影片是相当费时的,因此该如何快速搜寻有兴趣的片段是个重要的课题。At present, the video search function is usually implemented by means of keyword search. This method can only search for videos that match the title or description, and you need to watch the video to know the minutes and seconds of the desired segment, or you need to watch the video to know whether you have the desired content. However, when you want to quickly find information, it is time-consuming to watch the video, so how to quickly search for interesting clips is an important issue.
针对视频片段搜索功能的实现,现有技术中主要采用以下三种方式;For the realization of the video clip search function, the following three methods are mainly adopted in the prior art;
1、通过视频创作者自行于影片描述内自行标注时间戳记设定段落,以及重点说明,以供观看用户来直接点阅段落。1. The video creator can set the paragraph by marking the time stamp in the video description, as well as the key description, so that the viewing user can directly click on the paragraph.
2、通过预先人为设定剪辑片段,以供后续观看者进行点选。2. The clips are manually set in advance for subsequent viewers to click.
3、通过分析视频图像内容特征,并记录特征与时间点以供后续搜寻查找片段。3. By analyzing the characteristics of video image content, and recording the characteristics and time points for subsequent searches to find clips.
在上述现有的方法中,通过视频创作者自行标注段落的方法,需要创作者手动标注,并且视频仅能先通过标题被搜寻,接着观看用户点选视频后,才能再点选标注段落;若发生标题与段落内容出现差异,视频则无法被搜寻到。而通过分析视频图像内容特征进行标注的方法,因图像特征的选择与比对较复杂,需要较大的计算资源。In the above-mentioned existing method, the method of labeling paragraphs by the video creator requires the creator to manually label the video, and the video can only be searched through the title first, and then the watching user clicks the video before clicking the labeling paragraph; if If there is a discrepancy between the title and paragraph content, the video cannot be searched. However, the method of labeling by analyzing the content features of video images requires large computing resources due to the complicated selection and comparison of image features.
发明内容SUMMARY OF THE INVENTION
针对以上问题,本发明的目的在于提供一种视频片段自动搜索方法、系统、装置及可读存储介质,通过自动识别语音、语义的方式,记录视频内容,并实现了自动且低运算资源的视频片段记录与搜索。In view of the above problems, the purpose of the present invention is to provide an automatic search method, system, device and readable storage medium for video clips, which can record video content by automatically recognizing voice and semantics, and realize automatic and low computing resource video. Fragment recording and searching.
本发明为实现上述目的,通过以下技术方案实现:一种视频片段自动搜索方法,包括:In order to achieve the above object, the present invention is realized through the following technical solutions: an automatic search method for video clips, comprising:
视频创作者上传视频串流至视频存储装置,并同步通过语音识别AI模型建立串流视频的片段关键字资料库;The video creator uploads the video stream to the video storage device, and simultaneously establishes the segment keyword database of the stream video through the voice recognition AI model;
将视频搜索引擎分别与视频存储装置和关键字资料库关联;associating the video search engine with the video storage device and the keyword database respectively;
观看用户通过客户端登录视频搜索引擎并输入关键字,视频搜索引擎通过关键字从视频存储装置中找到对应的视频片段并推送至客户端;The viewing user logs in to the video search engine through the client and inputs a keyword, and the video search engine finds the corresponding video clip from the video storage device through the keyword and pushes it to the client;
观看用户通过客户端播放推送的视频片段。Watch users play the pushed video clips through the client.
进一步,所述通过语音识别AI模型建立串流视频的片段关键字资料库包括:通过语音识别AI模型对上传的视频串流进行语音识别,以建立具有时间轴的字幕档案;Further, the establishment of a segment keyword database of the streaming video through the voice recognition AI model includes: performing voice recognition on the uploaded video stream through the voice recognition AI model to establish a subtitle file with a timeline;
通过预设的语义识别系统自动筛选出字幕档案中的关键字作为视频片段的搜索关键字;The keywords in the subtitle file are automatically screened out as the search keywords of the video clips through the preset semantic recognition system;
语义识别系统根据搜索关键字建立串流视频的片段关键字资料库。The semantic recognition system builds a segment keyword database of the streaming video according to the search keywords.
进一步,所述语义识别系统根据搜索关键字建立串流视频的片段关键字资料库,包括:Further, the semantic recognition system establishes a segment keyword database of the streaming video according to the search keywords, including:
语义识别系统根据搜索关键字生成相应的视频编号和时间戳记;The semantic recognition system generates corresponding video numbers and timestamps according to the search keywords;
将搜索关键字、视频编号和时间戳记存储至片段关键字资料库。Store search keywords, video numbers and timestamps to the clip keyword database.
进一步,所述视频搜索引擎通过关键字从视频存储装置找到对应的视频片段并推送至客户端,包括:Further, the video search engine finds the corresponding video clips from the video storage device through keywords and pushes them to the client, including:
视频搜索引擎通过关键字在片段关键字资料库中找到相同的搜索关键字;The video search engine finds the same search keyword in the segment keyword database by keyword;
根据找到的搜索关键字取得相应的视频编号和时间戳记;Obtain the corresponding video number and timestamp according to the found search keywords;
根据视频编号和时间戳记在视频存储装置中找到对应的视频片段;Find the corresponding video segment in the video storage device according to the video number and time stamp;
视频存储装置将视频片段对应的视频串流发送至客户端。The video storage device sends the video stream corresponding to the video clip to the client.
进一步,所述观看用户通过客户端登录视频搜索引擎并输入关键字,具体为:Further, the viewing user logs into the video search engine through the client and inputs keywords, specifically:
观看用户通过客户端登录视频搜索引擎并通过AND运算符输入进阶关键字。Watch users log in to the video search engine through the client and enter advanced keywords through the AND operator.
相应的,本发明还公开了一种视频片段自动搜索系统,包括:Correspondingly, the present invention also discloses an automatic search system for video clips, including:
视频串流输入模块,用于视频创作者上传视频串流至视频存储装置,并同步通过语音识别AI模型建立串流视频的片段关键字资料库;The video stream input module is used for video creators to upload video streams to the video storage device, and synchronously establish the segment keyword database of the stream video through the voice recognition AI model;
关联模块,用于将视频搜索引擎分别与视频存储装置和关键字资料库关联;搜索模块,用于观看用户通过客户端登录视频搜索引擎并输入关键字,视频搜索引擎通过关键字从视频存储装置中找到对应的视频片段并推送至客户端;播放模块,用于观看用户通过客户端播放推送的视频片段。The association module is used to associate the video search engine with the video storage device and the keyword database respectively; the search module is used to watch the user log in to the video search engine through the client and input the keyword, and the video search engine uses the keyword from the video storage device. Find the corresponding video clip in the client and push it to the client; the playback module is used to watch the user play the pushed video clip through the client.
进一步,所述视频串流输入模块包括:Further, the video stream input module includes:
字库档案组建单元,用于通过语音识别AI模型对上传的视频串流进行语音识别,以建立具有时间轴的字幕档案;The font file building unit is used to perform speech recognition on the uploaded video stream through the speech recognition AI model to create a subtitle file with a timeline;
关键字筛选单元,用于通过预设的语义识别系统自动筛选出字幕档案中的关键字作为视频片段的搜索关键字;A keyword screening unit for automatically screening out keywords in the subtitle file as search keywords for video clips through a preset semantic recognition system;
片段关键字资料库组建单元,用于通过语义识别系统根据搜索关键字建立串流视频的片段关键字资料库。The segment keyword database building unit is used to establish a segment keyword database of the streaming video according to the search keyword through the semantic recognition system.
进一步,所述搜索模块包括:Further, the search module includes:
比对单元,用于视频搜索引擎通过关键字在片段关键字资料库中找到相同的搜索关键字;The comparison unit is used for the video search engine to find the same search keyword in the segment keyword database through keywords;
标记获取单元,用于根据找到的搜索关键字取得相应的视频编号和时间戳记;视频片段获取单元,用于根据视频编号和时间戳记在视频存储装置中找到对应的视频片段;Mark acquisition unit, for obtaining corresponding video number and time stamp according to the search keyword found; Video segment acquisition unit, for finding corresponding video segment in the video storage device according to video number and time stamp;
视频串流推送单元,用于通过视频存储装置将视频片段对应的视频串流发送至客户端。The video stream push unit is used for sending the video stream corresponding to the video clip to the client through the video storage device.
相应的,本发明公开了一种视频片段自动搜索装置,包括:Correspondingly, the present invention discloses a video clip automatic search device, comprising:
存储器,用于存储视频片段自动搜索程序;Memory, used to store the automatic search program for video clips;
处理器,用于执行所述视频片段自动搜索程序时实现如上文任一项所述视频片段自动搜索方法的步骤。The processor is configured to implement the steps of the automatic video clip search method according to any one of the above when executing the video clip automatic search program.
相应的,本发明公开了一种可读存储介质,所述可读存储介质上存储有视频片段自动搜索程序,所述视频片段自动搜索程序被处理器执行时实现如上文任一项所述视频片段自动搜索方法的步骤。Correspondingly, the present invention discloses a readable storage medium, on which an automatic video segment search program is stored, and when the video segment automatic search program is executed by a processor, the video as described in any of the above is realized. Steps for the Fragment AutoSearch method.
对比现有技术,本发明有益效果在于:Compared with the prior art, the beneficial effects of the present invention are:
1、本发明提供了一种视频片段自动搜索方法、系统、装置及可读存储介质,通过语音识别AI模型和语义识别系统,自动建立串流视频的片段关键字资料库,并能够利用关键字搜索片段关键字资料库,通过视频编号和时间戳记获取相应的视频片段。1. The present invention provides a video clip automatic search method, system, device and readable storage medium. Through the voice recognition AI model and the semantic recognition system, a fragment keyword database of streaming video is automatically established, and keywords can be used. Search the clip keyword database to get the corresponding video clip by video number and timestamp.
2、本发明可自动建立关键片段,无需人工标注。2. The present invention can automatically create key segments without manual annotation.
3、本发明进行智能语音/语义识别演算的计算资源远小于图像识别演算所需的计算资源。3. The computing resources required for the intelligent speech/semantic recognition algorithm in the present invention are far less than the computing resources required for the image recognition algorithm.
4、由于智能语音/语义识别演算的计算资源较小,因此本发明可实时识别高解析高帧率的视频。4. Since the computing resources of the intelligent speech/semantic recognition algorithm are small, the present invention can recognize videos with high resolution and high frame rate in real time.
由此可见,本发明与现有技术相比,具有突出的实质性特点和显著的进步,其实施的有益效果也是显而易见的。It can be seen that, compared with the prior art, the present invention has outstanding substantive features and significant progress, and the beneficial effects of its implementation are also obvious.
附图说明Description of drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据提供的附图获得其他的附图。In order to explain the embodiments of the present invention or the technical solutions in the prior art more clearly, the following briefly introduces the accompanying drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only It is an embodiment of the present invention. For those of ordinary skill in the art, other drawings can also be obtained according to the provided drawings without creative efforts.
附图1是本发明具体实施方式的方法流程图。FIG. 1 is a flow chart of a method of a specific embodiment of the present invention.
附图2是本发明具体实施方式的系统结构图。FIG. 2 is a system structure diagram of a specific embodiment of the present invention.
图中,1为视频串流输入模块;2为关联模块;3为搜索模块;4为播放模块;11为字库档案组建单元;12为关键字筛选单元;13为片段关键字资料库组建单元;31为比对单元;32为标记获取单元;33为视频片段获取单元;34为视频串流推送单元。In the figure, 1 is a video stream input module; 2 is an association module; 3 is a search module; 4 is a playback module; 11 is a font file building unit; 12 is a keyword screening unit; 13 is a segment keyword database building unit; 31 is a comparison unit; 32 is a mark acquisition unit; 33 is a video clip acquisition unit; 34 is a video stream push unit.
具体实施方式Detailed ways
本发明的核心是提供一种视频片段自动搜索方法,现有技术中,通过视频创作者自行标注段落的方法,需要创作者手动标注,并且视频仅能先通过标题被搜寻,接着观看用户点选视频后,才能再点选标注段落;若发生标题与段落内容出现差异,视频则无法被搜寻到。而通过分析视频图像内容特征进行标注的方法,因图像特征的选择与比对较复杂,需要较大的计算资源。The core of the present invention is to provide an automatic search method for video clips. In the prior art, the method of labeling paragraphs by video creators requires manual labeling by the creators, and videos can only be searched through the title first, and then the viewing user clicks After the video, you can click and mark the paragraph again; if there is a discrepancy between the title and the content of the paragraph, the video cannot be searched. However, the method of labeling by analyzing the content features of video images requires large computing resources due to the complicated selection and comparison of image features.
而本发明提供的视频片段自动搜索方法,首先,视频创作者上传视频串流至视频存储装置,并同步通过语音识别AI模型建立串流视频的片段关键字资料库。然后,将视频搜索引擎分别与视频存储装置和关键字资料库关联。此时,观看用户通过客户端登录视频搜索引擎并输入关键字,视频搜索引擎通过关键字从视频存储装置中找到对应的视频片段并推送至客户端;最后,观看用户通过客户端播放推送的视频片段。由此可见,本发明通过自动识别语音、语义的方式,记录视频内容,并实现了自动且低运算资源的视频片段记录与搜索。In the automatic search method for video clips provided by the present invention, firstly, the video creator uploads the video stream to the video storage device, and simultaneously establishes a clip keyword database of the stream video through the voice recognition AI model. Then, the video search engine is associated with the video storage device and the keyword database, respectively. At this point, the viewing user logs in to the video search engine through the client and inputs a keyword, and the video search engine finds the corresponding video clip from the video storage device through the keyword and pushes it to the client; finally, the viewing user plays the pushed video through the client Fragment. It can be seen that the present invention records video content by automatically recognizing voice and semantics, and realizes automatic recording and searching of video clips with low computing resources.
为了使本技术领域的人员更好地理解本发明方案,下面结合附图和具体实施方式对本发明作进一步的详细说明。显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make those skilled in the art better understand the solution of the present invention, the present invention will be further described in detail below with reference to the accompanying drawings and specific embodiments. Obviously, the described embodiments are only some, but not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.
实施例一:Example 1:
如图1所示,本实施例提供了一种视频片段自动搜索方法,包括如下步骤:As shown in FIG. 1, this embodiment provides an automatic search method for video clips, including the following steps:
S1:视频创作者上传视频串流至视频存储装置,并同步通过语音识别AI模型建立串流视频的片段关键字资料库。S1: The video creator uploads the video stream to the video storage device, and simultaneously establishes a segment keyword database of the stream video through the voice recognition AI model.
其中,在建立串流视频的片段关键字资料库时,首先通过语音识别AI模型对上传的视频串流进行语音识别,以建立具有时间轴的字幕档案。然后,通过预设的语义识别系统自动筛选出字幕档案中的关键字作为视频片段的搜索关键字。最后,语义识别系统根据搜索关键字建立串流视频的片段关键字资料库。Among them, when building the segment keyword database of the streaming video, firstly, the speech recognition AI model is used to perform speech recognition on the uploaded video stream to create a subtitle file with a timeline. Then, keywords in the subtitle file are automatically screened out as search keywords for video clips through a preset semantic recognition system. Finally, the semantic recognition system builds a segment keyword database of the streaming video according to the search keywords.
作为示例的,通过语义识别系统根据搜索关键字建立串流视频的片段关键字资料库时,首先需要通过语音识别AI模型对上传的视频串流进行语音识别,以建立具有时间轴的字幕档案。然后,通过预设的语义识别系统自动筛选出字幕档案中的关键字作为视频片段的搜索关键字。最后,由语义识别系统根据搜索关键字建立串流视频的片段关键字资料库。As an example, when using a semantic recognition system to create a segment keyword database of streaming videos based on search keywords, it is first necessary to perform speech recognition on the uploaded video stream through a speech recognition AI model to create a subtitle file with a timeline. Then, keywords in the subtitle file are automatically screened out as search keywords for video clips through a preset semantic recognition system. Finally, a segment keyword database of the streaming video is established by the semantic recognition system according to the search keywords.
本步骤基于现今的语音识别AI模型已经相当的成熟且精准,已广泛的利用在自动上字幕的应用上的前提下。因此当视频创作者上传影片串流时,可同时进行语音识别,自动建立具时间轴的字幕。This step is based on the premise that the current speech recognition AI model is quite mature and accurate, and has been widely used in the application of automatic subtitles. Therefore, when a video creator uploads a video stream, speech recognition can be performed at the same time to automatically create subtitles with a timeline.
运行时,通过语义识别系统,可自动选出字幕档案里的关键字作为视频片段的搜寻关键字,而此视频编号以及此片段时间轴(格式为XX:XX:XX,XXX)则建立为此关键字的结果。而语音识别AI模型与语义识别系统可整合为单一的”关键字自动识别”引擎,即可通过此引擎,实时的建立串流视频的片段关键字资料库。When running, through the semantic recognition system, the keyword in the subtitle file can be automatically selected as the search keyword for the video clip, and the video number and the clip timeline (in the format of XX:XX:XX,XXX) are established for this keyword results. The speech recognition AI model and the semantic recognition system can be integrated into a single "keyword automatic recognition" engine, and through this engine, a segment keyword database of streaming video can be created in real time.
S2:将视频搜索引擎分别与视频存储装置和关键字资料库关联。S2: Associate the video search engine with the video storage device and the keyword database respectively.
S3:观看用户通过客户端登录视频搜索引擎并输入关键字,视频搜索引擎通过关键字从视频存储装置中找到对应的视频片段并推送至客户端。S3: The viewing user logs in to the video search engine through the client and inputs a keyword, and the video search engine finds the corresponding video clip from the video storage device through the keyword and pushes it to the client.
具体来说,首先由视频搜索引擎通过关键字在片段关键字资料库中找到相同的搜索关键字。此时,根据找到的搜索关键字取得相应的视频编号和时间戳记。然后,根据视频编号和时间戳记在视频存储装置中找到对应的视频片段。最后,由视频存储装置将视频片段对应的视频串流发送至客户端。Specifically, the video search engine first finds the same search keyword in the segment keyword database through keywords. At this time, the corresponding video number and time stamp are obtained according to the found search keyword. Then, the corresponding video segment is found in the video storage device according to the video number and time stamp. Finally, the video storage device sends the video stream corresponding to the video clip to the client.
另外,在输入关键字时,可通过AND运算符输入进阶关键字。以实现进阶搜索。In addition, when entering keywords, advanced keywords can be entered through the AND operator. for advanced search.
作为示例的,视频片段搜索流程具体如下:As an example, the video clip search process is as follows:
1、用户输入关键字。1. The user enters a keyword.
2、通过视频搜索引擎进行关键字搜索。2. Keyword search through video search engine.
3、在片段关键字资料库中获取符合搜索条件的视频编号和时间戳记。3. Get video numbers and timestamps that match the search criteria in the segment keyword database.
4、从视频存储装置中找到对应的视频片段。4. Find the corresponding video clip from the video storage device.
5、将视频片段推荐给用户。5. Recommend video clips to users.
S4:观看用户通过客户端播放推送的视频片段。S4: Watch the video clips pushed by the user through the client.
本实施例提供了一种视频片段自动搜索方法,通过语音识别AI模型和语义识别系统,自动建立串流视频的片段关键字资料库,并能够利用关键字搜索片段关键字资料库,通过视频编号和时间戳记获取相应的视频片段。本方法通过自动识别语音、语义的方式,记录视频内容,并实现了自动且低运算资源的视频片段记录与搜索。This embodiment provides an automatic search method for video clips. Through the AI model of speech recognition and the semantic recognition system, a clip keyword database of streaming video is automatically established, and keywords can be used to search the clip keyword database, and the video number can be used to search the clip keyword database. and timestamp to get the corresponding video clip. The method records the video content by automatically recognizing voice and semantics, and realizes the automatic recording and searching of video clips with low computing resources.
实施例二:Embodiment 2:
基于实施例一,如图2所示,本发明还公开了一种视频片段自动搜索系统,包括:视频串流输入模块1、关联模块2、搜索模块3和播放模块4。Based on the first embodiment, as shown in FIG. 2 , the present invention also discloses an automatic video clip search system, including: a video stream input module 1 , an association module 2 , a search module 3 and a playback module 4 .
视频串流输入模块1,用于视频创作者上传视频串流至视频存储装置,并同步通过语音识别AI模型建立串流视频的片段关键字资料库。The video stream input module 1 is used for the video creator to upload the video stream to the video storage device, and synchronously establish the segment keyword database of the stream video through the voice recognition AI model.
具体来说,视频串流输入1模块包括:字库档案组建单元11、关键字筛选单元12和片段关键字资料库组建单元13。Specifically, the video stream input 1 module includes: a font file building unit 11 , a keyword screening unit 12 and a segment keyword database building unit 13 .
字库档案组建单元11,用于通过语音识别AI模型对上传的视频串流进行语音识别,以建立具有时间轴的字幕档案。The font file building unit 11 is used to perform speech recognition on the uploaded video stream through the speech recognition AI model, so as to create a subtitle file with a timeline.
关键字筛选单元12,用于通过预设的语义识别系统自动筛选出字幕档案中的关键字作为视频片段的搜索关键字。The keyword screening unit 12 is configured to automatically filter out keywords in the subtitle file as search keywords for video clips through a preset semantic recognition system.
片段关键字资料库组建单元13,用于通过语义识别系统根据搜索关键字建立串流视频的片段关键字资料库。The segment keyword database building unit 13 is configured to establish a segment keyword database of the streaming video according to the search keyword through the semantic recognition system.
关联模块2,用于将视频搜索引擎分别与视频存储装置和关键字资料库关联。The association module 2 is used to associate the video search engine with the video storage device and the keyword database respectively.
搜索模块3,用于观看用户通过客户端登录视频搜索引擎并输入关键字,视频搜索引擎通过关键字从视频存储装置中找到对应的视频片段并推送至客户端。The search module 3 is used to watch the user log in to the video search engine through the client and input a keyword, and the video search engine finds the corresponding video clip from the video storage device through the keyword and pushes it to the client.
具体来说,搜索模块3包括:比对单元31、标记获取单元32、视频片段获取单元33和视频串流推送单元34。Specifically, the search module 3 includes: a comparison unit 31 , a marker acquisition unit 32 , a video segment acquisition unit 33 and a video stream push unit 34 .
比对单元31,用于视频搜索引擎通过关键字在片段关键字资料库中找到相同的搜索关键字。The comparison unit 31 is used for the video search engine to find the same search keyword in the segment keyword database through keywords.
标记获取单元32,用于根据找到的搜索关键字取得相应的视频编号和时间戳记。The tag obtaining unit 32 is configured to obtain the corresponding video number and time stamp according to the found search keyword.
视频片段获取单元33,用于根据视频编号和时间戳记在视频存储装置中找到对应的视频片段。The video clip obtaining unit 33 is configured to find the corresponding video clip in the video storage device according to the video number and the time stamp.
视频串流推送单元34,用于通过视频存储装置将视频片段对应的视频串流发送至客户端。The video stream pushing unit 34 is configured to send the video stream corresponding to the video clip to the client through the video storage device.
播放模块4,用于观看用户通过客户端播放推送的视频片段。The playing module 4 is used to watch the video clips that the user plays and pushes through the client.
本实施例提供了一种视频片段自动搜索系统,能够通过语音识别AI模型和语义识别系统,自动建立串流视频的片段关键字资料库,并利用关键字搜索片段关键字资料库,通过视频编号和时间戳记获取相应的视频片段。This embodiment provides an automatic search system for video clips, which can automatically establish a clip keyword database of streaming videos through a voice recognition AI model and a semantic recognition system, and use keywords to search the clip keyword database. and timestamp to get the corresponding video clip.
实施例三:Embodiment three:
本实施例公开了一种视频片段自动搜索装置,包括处理器和存储器;其中,所述处理器执行所述存储器中保存的视频片段自动搜索程序时实现以下步骤:The present embodiment discloses a video clip automatic search device, including a processor and a memory; wherein, the processor implements the following steps when executing the video clip automatic search program saved in the memory:
1、视频创作者上传视频串流至视频存储装置,并同步通过语音识别AI模型建立串流视频的片段关键字资料库。1. The video creator uploads the video stream to the video storage device, and simultaneously establishes the segment keyword database of the stream video through the voice recognition AI model.
2、将视频搜索引擎分别与视频存储装置和关键字资料库关联。2. Associate the video search engine with the video storage device and the keyword database respectively.
3、观看用户通过客户端登录视频搜索引擎并输入关键字,视频搜索引擎通过关键字从视频存储装置中找到对应的视频片段并推送至客户端。3. The viewing user logs in to the video search engine through the client and inputs a keyword, and the video search engine finds the corresponding video clip from the video storage device through the keyword and pushes it to the client.
4、观看用户通过客户端播放推送的视频片段。4. Watch the video clips that users play through the client.
进一步的,本实施例中的视频片段自动搜索装置,还可以包括:Further, the apparatus for automatically searching for video clips in this embodiment may further include:
输入接口,用于获取外界导入的视频片段自动搜索程序,并将获取到的视频片段自动搜索程序保存至所述存储器中,还可以用于获取外界终端设备传输的各种指令和参数,并传输至处理器中,以便处理器利用上述各种指令和参数展开相应的处理。本实施例中,所述输入接口具体可以包括但不限于USB接口、串行接口、语音输入接口、指纹输入接口、硬盘读取接口等。The input interface is used to obtain the video clip automatic search program imported from the outside world, and save the obtained video clip automatic search program into the memory. It can also be used to obtain various instructions and parameters transmitted by the external terminal equipment, and transmit the into the processor, so that the processor can use the above various instructions and parameters to carry out corresponding processing. In this embodiment, the input interface may specifically include, but is not limited to, a USB interface, a serial interface, a voice input interface, a fingerprint input interface, a hard disk reading interface, and the like.
输出接口,用于将处理器产生的各种数据输出至与其相连的终端设备,以便于与输出接口相连的其他终端设备能够获取到处理器产生的各种数据。本实施例中,所述输出接口具体可以包括但不限于USB接口、串行接口等。The output interface is used to output various data generated by the processor to the terminal equipment connected to it, so that other terminal equipment connected to the output interface can obtain various data generated by the processor. In this embodiment, the output interface may specifically include, but is not limited to, a USB interface, a serial interface, and the like.
通讯单元,用于在视频片段自动搜索装置和外部服务器之间建立远程通讯连接,以便于视频片段自动搜索装置能够将镜像文件挂载到外部服务器中。本实施例中,通讯单元具体可以包括但不限于基于无线通讯技术或有线通讯技术的远程通讯单元。The communication unit is used for establishing a long-distance communication connection between the video clip automatic searching device and the external server, so that the video clip automatic searching device can mount the image file to the external server. In this embodiment, the communication unit may specifically include, but is not limited to, a remote communication unit based on a wireless communication technology or a wired communication technology.
键盘,用于获取用户通过实时敲击键帽而输入的各种参数数据或指令。The keyboard is used to obtain various parameter data or instructions input by the user by hitting the keycap in real time.
显示器,用于运行服务器供电线路短路定位过程的相关信息进行实时显示。The display is used to display the relevant information of the short-circuit positioning process of the power supply line of the server in real time.
鼠标,可以用于协助用户输入数据并简化用户的操作。A mouse can be used to assist the user in entering data and simplify the user's operations.
实施例四:Embodiment 4:
本实施例还公开了一种可读存储介质,这里所说的可读存储介质包括随机存储器(RAM)、内存、只读存储器(ROM)、电可编程ROM、电可擦除可编程ROM、寄存器、硬盘、可移动硬盘、CD-ROM或技术领域内所公知的任意其他形式的存储介质。可读存储介质中存储有视频片段自动搜索程序,所述视频片段自动搜索程序被处理器执行时实现以下步骤:This embodiment also discloses a readable storage medium, where the readable storage medium includes random access memory (RAM), internal memory, read only memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, Register, hard disk, removable hard disk, CD-ROM, or any other form of storage medium known in the art. The readable storage medium stores a video clip automatic search program, and when the video clip automatic search program is executed by the processor, the following steps are implemented:
1、视频创作者上传视频串流至视频存储装置,并同步通过语音识别AI模型建立串流视频的片段关键字资料库。1. The video creator uploads the video stream to the video storage device, and simultaneously establishes the segment keyword database of the stream video through the voice recognition AI model.
2、将视频搜索引擎分别与视频存储装置和关键字资料库关联。2. Associate the video search engine with the video storage device and the keyword database respectively.
3、观看用户通过客户端登录视频搜索引擎并输入关键字,视频搜索引擎通过关键字从视频存储装置中找到对应的视频片段并推送至客户端。3. The viewing user logs in to the video search engine through the client and inputs a keyword, and the video search engine finds the corresponding video clip from the video storage device through the keyword and pushes it to the client.
4、观看用户通过客户端播放推送的视频片段。4. Watch the video clips that users play through the client.
综上所述,本发明通过自动识别语音、语义的方式,记录视频内容,并实现了自动且低运算资源的视频片段记录与搜索。To sum up, the present invention records video content by automatically recognizing voice and semantics, and realizes automatic recording and searching of video clips with low computing resources.
本说明书中各个实施例采用递进的方式描述,每个实施例重点说明的都是与其它实施例的不同之处,各个实施例之间相同或相似部分互相参见即可。对于实施例公开的方法而言,由于其与实施例公开的系统相对应,所以描述的比较简单,相关之处参见方法部分说明即可。The various embodiments in this specification are described in a progressive manner, and each embodiment focuses on the differences from other embodiments, and the same or similar parts between the various embodiments may be referred to each other. For the method disclosed in the embodiment, since it corresponds to the system disclosed in the embodiment, the description is relatively simple, and the relevant part can be referred to the description of the method.
专业人员还可以进一步意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、计算机软件或者二者的结合来实现,为了清楚地说明硬件和软件的可互换性,在上述说明中已经按照功能一般性地描述了各示例的组成及步骤。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明的范围。Professionals may further realize that the units and algorithm steps of each example described in conjunction with the embodiments disclosed herein can be implemented in electronic hardware, computer software, or a combination of the two, in order to clearly illustrate the possibilities of hardware and software. Interchangeability, the above description has generally described the components and steps of each example in terms of functionality. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may implement the described functionality using different methods for each particular application, but such implementations should not be considered beyond the scope of the present invention.
在本发明所提供的几个实施例中,应该理解到,所揭露的系统、系统和方法,可以通过其它的方式实现。例如,以上所描述的系统实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,系统或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided by the present invention, it should be understood that the disclosed systems, systems and methods may be implemented in other manners. For example, the system embodiments described above are only illustrative. For example, the division of the units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored, or not implemented. On the other hand, the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of systems or units, and may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
另外,在本发明各个实施例中的各功能模块可以集成在一个处理单元中,也可以是各个模块单独物理存在,也可以两个或两个以上模块集成在一个单元中。In addition, each functional module in each embodiment of the present invention may be integrated into one processing unit, or each module may exist physically alone, or two or more modules may be integrated into one unit.
同理,在本发明各个实施例中的各处理单元可以集成在一个功能模块中,也可以是各个处理单元物理存在,也可以两个或两个以上处理单元集成在一个功能模块中。Similarly, each processing unit in each embodiment of the present invention may be integrated into one functional module, or each processing unit may exist physically, or two or more processing units may be integrated into one functional module.
结合本文中所公开的实施例描述的方法或算法的步骤可以直接用硬件、处理器执行的软件模块,或者二者的结合来实施。软件模块可以置于随机存储器(RAM)、内存、只读存储器(ROM)、电可编程ROM、电可擦除可编程ROM、寄存器、硬盘、可移动磁盘、CD-ROM、或技术领域内所公知的任意其它形式的存储介质中。The steps of a method or algorithm described in conjunction with the embodiments disclosed herein may be directly implemented in hardware, a software module executed by a processor, or a combination of the two. A software module can be placed in random access memory (RAM), internal memory, read only memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, removable disk, CD-ROM, or any other in the art. in any other known form of storage medium.
最后,还需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。Finally, it should also be noted that in this document, relational terms such as first and second are used only to distinguish one entity or operation from another, and do not necessarily require or imply these entities or there is any such actual relationship or sequence between operations. Moreover, the terms "comprising", "comprising" or any other variation thereof are intended to encompass a non-exclusive inclusion such that a process, method, article or device comprising a list of elements includes not only those elements, but also includes not explicitly listed or other elements inherent to such a process, method, article or apparatus. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in a process, method, article or apparatus that includes the element.
以上对本发明所提供的视频片段自动搜索方法、系统、装置及可读存储介质进行了详细介绍。本文中应用了具体个例对本发明的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本发明的方法及其核心思想。应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以对本发明进行若干改进和修饰,这些改进和修饰也落入本发明权利要求的保护范围内。The automatic search method, system, device and readable storage medium for video clips provided by the present invention have been described in detail above. The principles and implementations of the present invention are described herein by using specific examples, and the descriptions of the above embodiments are only used to help understand the method and the core idea of the present invention. It should be pointed out that for those skilled in the art, without departing from the principle of the present invention, several improvements and modifications can also be made to the present invention, and these improvements and modifications also fall within the protection scope of the claims of the present invention.
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title | 
|---|---|---|---|
| CN202210108784.4A CN114564614A (en) | 2022-01-28 | 2022-01-28 | A kind of video clip automatic search method, system, device and readable storage medium | 
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title | 
|---|---|---|---|
| CN202210108784.4A CN114564614A (en) | 2022-01-28 | 2022-01-28 | A kind of video clip automatic search method, system, device and readable storage medium | 
Publications (1)
| Publication Number | Publication Date | 
|---|---|
| CN114564614A true CN114564614A (en) | 2022-05-31 | 
Family
ID=81714100
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date | 
|---|---|---|---|
| CN202210108784.4A Pending CN114564614A (en) | 2022-01-28 | 2022-01-28 | A kind of video clip automatic search method, system, device and readable storage medium | 
Country Status (1)
| Country | Link | 
|---|---|
| CN (1) | CN114564614A (en) | 
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title | 
|---|---|---|---|---|
| CN108401189A (en) * | 2018-03-16 | 2018-08-14 | 百度在线网络技术(北京)有限公司 | A kind of method, apparatus and server of search video | 
- 
        2022
        - 2022-01-28 CN CN202210108784.4A patent/CN114564614A/en active Pending
 
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title | 
|---|---|---|---|---|
| CN108401189A (en) * | 2018-03-16 | 2018-08-14 | 百度在线网络技术(北京)有限公司 | A kind of method, apparatus and server of search video | 
Similar Documents
| Publication | Publication Date | Title | 
|---|---|---|
| US7653925B2 (en) | Techniques for receiving information during multimedia presentations and communicating the information | |
| EP2901631B1 (en) | Enriching broadcast media related electronic messaging | |
| US9633696B1 (en) | Systems and methods for automatically synchronizing media to derived content | |
| US8103507B2 (en) | Searchable multimedia stream | |
| US8930308B1 (en) | Methods and systems of associating metadata with media | |
| US20080097970A1 (en) | Intelligent Video Summaries in Information Access | |
| CN1723458A (en) | Method and system for utilizing video content to derive textual keywords or phrases to provide content-related links to web-based resources | |
| JP2003157288A (en) | Information association method, terminal device, server device, program | |
| CN111279333B (en) | Language-based search of digital content in a network | |
| JP2015090716A (en) | Video program recommendation method and server | |
| CN100501738C (en) | A search method, system and device for playable media files | |
| CN112383790A (en) | Live broadcast screen recording method and device, electronic equipment and storage medium | |
| CN104731938A (en) | Video searching method and device | |
| CN111629267B (en) | Audio labeling method, device, equipment and computer readable storage medium | |
| WO2021115348A1 (en) | Cloud storage system providing preview function, and preview method and apparatus | |
| KR100916310B1 (en) | Cross recommendation system and method between music and video based on audio signal processing | |
| TW201225669A (en) | System and method for synchronizing with multimedia broadcast program and computer program product thereof | |
| CN113407775B (en) | Video searching method and device and electronic equipment | |
| CN111741333B (en) | Live broadcast data acquisition method, device, computer equipment and storage medium | |
| WO2021101024A1 (en) | Cloud-based video virtual studio service system | |
| CN117493606A (en) | Video retrieval method, device, system, electronic equipment and storage medium | |
| CN114564614A (en) | A kind of video clip automatic search method, system, device and readable storage medium | |
| CN111177463A (en) | Conference record searching method and device, electronic equipment and medium | |
| Castro et al. | Managing Context for Internet Videoconferences: The Multimedia Internet Recorder and Archive | |
| KR101409019B1 (en) | Content creation method and apparatus | 
Legal Events
| Date | Code | Title | Description | 
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication | Application publication date: 20220531 |