CN104462262B - A kind of method for realizing phonetic search, device and browser client - Google Patents
A kind of method for realizing phonetic search, device and browser client Download PDFInfo
- Publication number
- CN104462262B CN104462262B CN201410676462.5A CN201410676462A CN104462262B CN 104462262 B CN104462262 B CN 104462262B CN 201410676462 A CN201410676462 A CN 201410676462A CN 104462262 B CN104462262 B CN 104462262B
- Authority
- CN
- China
- Prior art keywords
- voice
- search
- user
- unit
- server
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
技术领域technical field
本发明涉及互联网技术领域,具体涉及一种实现语音搜索的方法、装置和浏览器客户端。The invention relates to the technical field of the Internet, in particular to a method, device and browser client for realizing voice search.
背景技术Background technique
移动互联网技术的迅速发展,使得用户的搜索行为可以发生在各种不同的环境和场所,在不同的环境和场所中,用户会产生不同的搜索需求。With the rapid development of mobile Internet technology, users' search behavior can occur in various environments and places, and users will have different search needs in different environments and places.
传统的搜索方式是基于用户手动输入文字的搜索过程,人机交互方式往往是键盘输入或者手写输入,其操作过程繁琐,用户的输入体验差;并且在实施过程中存在很大的局限性,在用户无法进行手动输入的场景下,该搜索方案完全失效,例如,用户在开车时无法进行搜索,盲人用户无法进行搜索等。因此,传统的搜索方式根本无法满足当前用户在各种场景下想要进行搜索的需求。The traditional search method is based on the search process of users manually inputting text. The human-computer interaction method is often keyboard input or handwriting input. The operation process is cumbersome and the user's input experience is poor; and there are great limitations in the implementation process. In scenarios where the user cannot perform manual input, the search solution is completely ineffective. For example, the user cannot search while driving, and blind users cannot perform search. Therefore, traditional search methods cannot meet the needs of current users who want to search in various scenarios.
发明内容Contents of the invention
鉴于上述问题,提出了本发明以便提供一种克服上述问题或者至少部分地解决上述问题的一种实现语音搜索的方法、装置和浏览器客户端。In view of the above problems, the present invention is proposed to provide a method, device and browser client for implementing voice search that overcome the above problems or at least partially solve the above problems.
依据本发明的一个方面,提供了一种实现语音搜索的方法,该方法包括:According to one aspect of the present invention, a method for implementing voice search is provided, the method comprising:
获取语音搜索用户输入的语音信号;Obtain the voice signal input by the voice search user;
将所述语音信号发送到语音转换服务器;sending the voice signal to a voice conversion server;
接收语音转换服务器对所述语音信号进行识别后返回的相应的字符信息;Receiving the corresponding character information returned by the speech conversion server after recognizing the speech signal;
以所述字符信息作为搜索词,向搜索服务器发送搜索请求;Using the character information as a search word, send a search request to the search server;
接收和加载显示所述搜索服务器返回的搜索结果。Receiving and loading displaying the search results returned by the search server.
可选地,该方法进一步包括:Optionally, the method further includes:
获取所述语音搜索用户的标签信息;Obtain tag information of the voice search user;
根据该标签信息从语音转换服务器获取所述语音搜索用户的好友分享的与该标签信息匹配的经语音识别后的一个或多个关键字;Acquiring from the voice conversion server according to the tag information one or more keywords after speech recognition shared by friends of the voice search user and matched with the tag information;
当为一个关键字时以该关键字作为搜索词,当为多个关键字时从中选择一个关键字为搜索词,向搜索服务器发送搜索请求;When it is a keyword, the keyword is used as the search term; when there are multiple keywords, one keyword is selected as the search term, and a search request is sent to the search server;
接收和加载显示所述搜索服务器返回的搜索结果。Receiving and loading displaying the search results returned by the search server.
可选地,该方法进一步包括:Optionally, the method further includes:
根据所述语音搜索用户触发的指令,获取所述语音搜索用户输入的语音信号;Acquiring a voice signal input by the voice search user according to an instruction triggered by the voice search user;
将该语音信号与所述语音搜索用户的标签信息一起上传到语音转换服务器,供所述语音转换服务器将该语音信号识别成文字形式的关键字后,将该语音信号、识别出的关键字以及所述语音搜索用户的标签信息对应保存成一个分享信息条目。The voice signal is uploaded to the voice conversion server together with the tag information of the voice search user, and after the voice conversion server recognizes the voice signal into a keyword in text form, the voice signal, the recognized keyword and The tag information of the voice search user is correspondingly stored as a shared information item.
可选地,该方法进一步包括:Optionally, the method further includes:
从所述语音转换服务器下载与所述语音搜索用户的标签信息匹配的分享信息条目,并保存在本地。The shared information item matching the tag information of the voice search user is downloaded from the voice conversion server and stored locally.
可选地,该方法进一步包括:Optionally, the method further includes:
当识别所述字符信息为符合特定格式的号码数据时,展示该号码数据的相应操作的加载入口。When the character information is identified as number data conforming to a specific format, the loading entry of the corresponding operation of the number data is displayed.
可选地,Optionally,
当识别所述字符信息为符合特定格式的号码数据时,还加载搜索服务器返回的该号码数据对应的电子商务网页信息,并提供与该电子商务网页进行交互操作的入口。When the character information is identified as number data conforming to a specific format, the e-commerce web page information corresponding to the number data returned by the search server is also loaded, and an entry for interactive operation with the e-commerce web page is provided.
可选地,该方法进一步包括:Optionally, the method further includes:
获取移动终端的通话录音数据,对所述通话录音数据进行编辑得到用于进行语音搜索的语音信号。The call recording data of the mobile terminal is obtained, and the call recording data is edited to obtain a voice signal for voice search.
可选地,该方法进一步包括:Optionally, the method further includes:
添加语音识别模块,对所述语音搜索用户的自身的语音进行识别后记录,并标记为所述语音搜索用户的语音数据,以及对于好友分享或发送来的语音数据进行手动识别或进行机器学习识别后记录,并标记为相应好友的语音数据;Add a voice recognition module to recognize and record the voice search user's own voice, and mark it as the voice data of the voice search user, and perform manual recognition or machine learning recognition on the voice data shared or sent by friends After recording, mark it as the voice data of the corresponding friend;
对所述语音搜索用户的语音数据或其好友的语音数据进行编辑,得到用于进行语音搜索的语音信号。Edit the voice data of the voice search user or the voice data of his friend to obtain a voice signal for voice search.
可选地,该方法进一步包括:对所述字符信息进行语义识别,确定所述字符信息作为搜索词的分类;Optionally, the method further includes: performing semantic recognition on the character information, and determining the classification of the character information as a search term;
所述向搜索服务器发送搜索请求包括:向相应分类的搜索服务器发送搜索请求。The sending the search request to the search server includes: sending the search request to the search server of the corresponding category.
可选地,所述获取输入的语音信号包括:Optionally, said acquiring the input voice signal includes:
在浏览器客户端的首页中显示语音搜索图标;当该语音搜索图标被点击时,跳转到包含语音搜索控件的语音搜索页面;Display the voice search icon on the home page of the browser client; when the voice search icon is clicked, jump to the voice search page containing the voice search control;
响应于所述语音搜索控件被启动的事件,利用浏览器客户端所在移动设备的麦克风开始捕获语音信号;In response to the event that the voice search control is started, utilize the microphone of the mobile device where the browser client is located to start capturing voice signals;
响应于所述语音搜索控件被关闭的事件,停止捕获语音信号,获得输入的语音信号。In response to the event that the voice search control is turned off, stop capturing voice signals and obtain input voice signals.
可选地,Optionally,
在浏览器客户端的首页中显示语音搜索图标包括:在浏览器客户端的地址栏的尾部显示麦克风图标;Displaying the voice search icon on the home page of the browser client includes: displaying a microphone icon at the end of the address bar of the browser client;
所述语音搜索控件被启动的事件为:所述语音搜索控件开始被触摸;The event that the voice search control is started is: the voice search control starts to be touched;
所述语音搜索控件被关闭的事件为:所述语音搜索控件结束被触摸。The event that the voice search control is turned off is: the voice search control ends being touched.
可选地,该方法进一步包括:Optionally, the method further includes:
在所述语音搜索页面上展示N个热点搜索词;其中,所述N个热点搜索词是在一个统计周期内的统计得到的搜索排名靠前的N个搜索词,N为自然数;N hot search words are displayed on the voice search page; wherein, the N hot search words are the top N search words obtained by statistics within a statistical period, and N is a natural number;
或者,在所述语音搜索页面上展示搜索提示信息,用于提示用户如何进行语音搜索。Alternatively, search prompt information is displayed on the voice search page to prompt the user how to perform voice search.
可选地,该方法进一步包括:Optionally, the method further includes:
在所述语音搜索页面中显示页面关闭控件,当该页面关闭控件被触发时,关闭所述语音搜索页面,返回到浏览器客户端的首页。A page close control is displayed on the voice search page, and when the page close control is triggered, the voice search page is closed and the home page of the browser client is returned.
可选地,该方法进一步包括:Optionally, the method further includes:
响应于所述语音搜索页面上的热点搜索词被点击的事件,跳转到该热点搜索词对应的搜索结果页面。In response to an event that a hot search word on the voice search page is clicked, jump to a search result page corresponding to the hot search word.
可选地,所述接收语音转换服务器对所述语音信号进行识别后返回的相应的字符信息包括:接收语音转换服务器对所述语音信号进行识别后返回的相应的多个字符信息;Optionally, the corresponding character information returned by the receiving speech conversion server after recognizing the speech signal includes: a plurality of corresponding character information returned by the receiving speech conversion server after recognizing the speech signal;
则该方法进一步包括:展示所述多个字符信息,以供用户进行选择;Then the method further includes: displaying the plurality of character information for selection by the user;
所述以所述字符信息作为搜索词为,以用户选择的字符信息作为搜索词。The using the character information as the search term is to use the character information selected by the user as the search term.
可选地,在将所述语音信号发送到语音转换服务器之前,该方法进一步包括:Optionally, before sending the voice signal to the voice conversion server, the method further includes:
获取移动设备的网络连接状态;Obtain the network connection status of the mobile device;
如果移动设备没有接入网络,或者移动设备接入的是存在问题的网络,则显示表示移动设备的网络连接状态异常的信息;If the mobile device is not connected to the network, or the mobile device is connected to a problematic network, display information indicating that the network connection status of the mobile device is abnormal;
如果移动设备接入的是正常网络,则执行所述将所述语音信号发送到语音转换服务器的步骤。If the mobile device is connected to a normal network, the step of sending the voice signal to the voice conversion server is performed.
可选地,该方法进一步包括:Optionally, the method further includes:
当接收语音转换服务器对所述语音信号进行识别后返回的表示识别错误的信息时,展示所述表示识别错误的信息。When receiving the information indicating the recognition error returned by the speech conversion server after recognizing the voice signal, the information indicating the recognition error is displayed.
可选地,该方法进一步包括:Optionally, the method further includes:
接收所述语音转换服务器对所述字符信息进行语义识别,并根据识别后的语义返回的指令信息;receiving the voice conversion server performing semantic recognition on the character information and returning instruction information according to the recognized semantics;
根据所述指令信息执行相应的操作。Perform corresponding operations according to the instruction information.
根据本发明的另一方面,提供了一种实现语音搜索的装置,该装置包括:According to another aspect of the present invention, a device for implementing voice search is provided, the device comprising:
信号获取单元,适于获取语音搜索用户输入的语音信号;A signal acquisition unit adapted to acquire a voice signal input by a voice search user;
语音转换交互单元,适于将所述语音信号发送到语音转换服务器;并且适于接收语音转换服务器对所述语音信号进行识别后返回的相应的字符信息;The voice conversion interaction unit is adapted to send the voice signal to the voice conversion server; and is adapted to receive the corresponding character information returned by the voice conversion server after recognizing the voice signal;
搜索交互单元,适于以所述字符信息作为搜索词,向搜索服务器发送搜索请求;并且适于接收所述搜索服务器返回的搜索结果;The search interaction unit is adapted to use the character information as a search word to send a search request to a search server; and is adapted to receive search results returned by the search server;
展示单元,适于加载显示所述搜索服务器返回的搜索结果。The display unit is adapted to load and display the search results returned by the search server.
可选地,所述信号获取单元,进一步适于获取所述语音搜索用户的标签信息;Optionally, the signal acquisition unit is further adapted to acquire tag information of the voice search user;
所述语音转换交互单元,进一步适于根据该标签信息从语音转换服务器获取所述语音搜索用户的好友分享的与该标签信息匹配的经语音识别后的一个或多个关键字;The voice conversion interaction unit is further adapted to obtain from the voice conversion server according to the tag information one or more speech-recognized keywords shared by friends of the voice search user and matched with the tag information;
所述搜索交互单元,适于当所述语音转换交互单元获取到一个关键字时以该关键字作为搜索词,当所述语音转换交互单元获取到多个关键字时从中选择一个关键字为搜索词,向搜索服务器发送搜索请求;并且适于接收所述搜索服务器返回的搜索结果;The search interaction unit is adapted to use a keyword as a search word when the speech conversion interaction unit obtains a keyword, and selects a keyword as a search word when the speech conversion interaction unit obtains multiple keywords. word, send a search request to the search server; and be adapted to receive the search result returned by the search server;
所述展示单元,适于加载显示所述搜索服务器返回的搜索结果。The display unit is adapted to load and display the search results returned by the search server.
可选地,所述信号获取单元,进一步适于根据所述语音搜索用户触发的指令,获取所述语音搜索用户输入的语音信号;Optionally, the signal acquisition unit is further adapted to acquire a voice signal input by the voice search user according to an instruction triggered by the voice search user;
所述语音转换交互单元,进一步适于将该语音信号与所述语音搜索用户的标签信息一起上传到语音转换服务器,供所述语音转换服务器将该语音信号识别成文字形式的关键字后,将该语音信息、识别出的关键字以及所述语音搜索用户的标签信息对应保存成一个分享信息条目。The voice conversion interaction unit is further adapted to upload the voice signal and the tag information of the voice search user to the voice conversion server, so that after the voice conversion server recognizes the voice signal into a keyword in text form, the The voice information, the recognized keyword and the tag information of the voice search user are correspondingly stored as a shared information item.
可选地,所述语音转换交互单元,进一步适于从所述语音转换服务器下载与所述语音搜索用户的标签信息匹配的分享信息条目,并保存在本地。Optionally, the voice conversion interaction unit is further adapted to download the shared information item matching the voice search user's tag information from the voice conversion server, and save it locally.
可选地,所述展示单元,进一步适于当所述搜索交互单元识别所述字符信息为符合特定格式的号码数据时,展示该号码数据的相应操作的加载入口。Optionally, the display unit is further adapted to display a corresponding operation loading entry for the number data when the search interaction unit recognizes that the character information is number data conforming to a specific format.
可选地,所述展示单元,还适于当所述搜索交互单元识别所述字符信息为符合特定格式的号码数据时,加载搜索服务器返回的该号码数据对应的电子商务网页信息,并提供与该电子商务网页进行交互操作的入口。Optionally, the display unit is further adapted to load the e-commerce webpage information corresponding to the number data returned by the search server when the search interaction unit recognizes that the character information is number data conforming to a specific format, and provide information corresponding to the number data. The portal for interactive operation of the e-commerce webpage.
可选地,所述信号获取单元,进一步适于获取移动终端的通话录音数据,对所述通话录音数据进行编辑得到用于进行语音搜索的语音信号。Optionally, the signal acquisition unit is further adapted to acquire call recording data of the mobile terminal, and edit the call recording data to obtain a voice signal for voice search.
可选地,所述信号获取单元,进一步适于添加语音识别模块,对所述语音搜索用户的自身的语音进行识别后记录,并标记为所述语音搜索用户的语音数据,以及对于好友分享或发送来的语音数据进行手动识别或进行机器学习识别后记录,并标记为相应好友的语音数据;并且适于对所述语音搜索用户的语音数据或其好友的语音数据进行编辑,得到用于进行语音搜索的语音信号。Optionally, the signal acquisition unit is further adapted to add a voice recognition module to recognize and record the voice search user's own voice, and mark it as the voice data of the voice search user, and share it with friends or The sent voice data is recorded after manual recognition or machine learning recognition, and marked as the voice data of the corresponding friend; and it is suitable for editing the voice data of the voice search user or the voice data of his friend to obtain the Voice signal for voice search.
可选地,所述搜索交互单元,进一步适于对所述字符信息进行语义识别,确定所述字符信息作为搜索词的分类;并且适于向相应分类的搜索服务器发送搜索请求。Optionally, the search interaction unit is further adapted to perform semantic recognition on the character information, to determine the classification of the character information as a search term; and to send a search request to a search server of a corresponding classification.
可选地,所述信号获取单元包括:语音搜索跳转单元和语音获取控制单元;Optionally, the signal acquisition unit includes: a voice search jump unit and a voice acquisition control unit;
所述语音搜索跳转单元,适于在浏览器客户端的首页中显示语音搜索图标;当该语音搜索图标被点击时,跳转到包含语音搜索控件的语音搜索页面;The voice search jump unit is suitable for displaying a voice search icon on the homepage of the browser client; when the voice search icon is clicked, it jumps to a voice search page containing a voice search control;
所述语音获取控制单元,适于响应于所述语音搜索控件被启动的事件,利用浏览器客户端所在移动设备的麦克风开始捕获语音信号;并且适于响应于所述语音搜索控件被关闭的事件,停止捕获语音信号,获得输入的语音信号。The voice acquisition control unit is adapted to respond to the event that the voice search control is started, using the microphone of the mobile device where the browser client is located to start capturing voice signals; and is adapted to respond to the event that the voice search control is turned off , stop capturing the voice signal and get the input voice signal.
可选地,所述语音搜索跳转单元,适于在浏览器客户端的地址栏的尾部显示麦克风图标;Optionally, the voice search jump unit is adapted to display a microphone icon at the end of the address bar of the browser client;
所述语音获取控制单元,适于响应于所述语音搜索控件开始被触摸,利用浏览器客户端所在移动设备的麦克风开始捕获语音信号;并且适于响应于所述语音搜索控件结束被触摸,停止捕获语音信号,获得输入的语音信号。The voice acquisition control unit is adapted to start capturing voice signals with the microphone of the mobile device where the browser client is located in response to the voice search control being touched; and is adapted to stop being touched in response to the voice search control ending. Capture the speech signal to obtain the input speech signal.
可选地,所述信号获取单元进一步包括:语音搜索提示单元;Optionally, the signal acquisition unit further includes: a voice search prompt unit;
所述语音搜索提示单元,适于在所述语音搜索页面上展示N个热点搜索词;其中,所述N个热点搜索词是在一个统计周期内的统计得到的搜索排名靠前的N个搜索词,N为自然数;或者,适于在所述语音搜索页面上展示搜索提示信息,用于提示用户如何进行语音搜索。The voice search prompt unit is adapted to display N hot search words on the voice search page; wherein, the N hot search words are the top N search terms obtained from statistics within a statistical period words, N is a natural number; or, it is suitable for displaying search prompt information on the voice search page to prompt the user how to perform voice search.
可选地,所述语音搜索跳转单元,进一步适于在所述语音搜索页面中显示页面关闭控件,当该页面关闭控件被触发时,关闭所述语音搜索页面,返回到浏览器客户端的首页。Optionally, the voice search jump unit is further adapted to display a page closing control on the voice search page, and when the page closing control is triggered, close the voice search page and return to the home page of the browser client .
可选地,所述搜索交互单元,进一步适于响应于所述语音搜索页面上的热点搜索词被点击的事件,跳转到该热点搜索词对应的搜索结果页面;Optionally, the search interaction unit is further adapted to jump to the search result page corresponding to the hot search word in response to the event that the hot search word on the voice search page is clicked;
所述展示单元,进一步适于加载显示该热点搜索词对应的搜索结果页面。The display unit is further adapted to load and display a search result page corresponding to the hot search word.
可选地,所述语音转换交互单元,适于接收语音转换服务器对所述语音信号进行识别后返回的相应的多个字符信息;Optionally, the voice conversion interaction unit is adapted to receive the corresponding multiple character information returned by the voice conversion server after recognizing the voice signal;
所述展示单元,进一步适于展示所述多个字符信息,以供用户进行选择;The display unit is further adapted to display the plurality of character information for selection by the user;
所述搜索交互单元,适于以用户选择的字符信息作为搜索词,向搜索服务器发送搜索请求。The search interaction unit is adapted to send a search request to a search server using character information selected by a user as a search word.
可选地,该装置进一步包括:网络检测单元;Optionally, the device further includes: a network detection unit;
所述网络检测单元,适于在所述语音转换交互单元在将所述语音信号发送到语音转换服务器之前,获取移动设备的网络连接状态;The network detection unit is adapted to obtain the network connection status of the mobile device before the voice conversion interaction unit sends the voice signal to the voice conversion server;
所述语音转换交互单元,进一步适于在所述网络检测单元检测到移动设备接入正常网络时,将所述语音信号发送到语音转换服务器;The voice conversion interaction unit is further adapted to send the voice signal to a voice conversion server when the network detection unit detects that the mobile device is connected to a normal network;
所述展示单元,进一步适于在所述网络检测单元检测到移动设备接入存在问题的网络时,显示表示移动设备的网络连接状态异常的消息。The display unit is further adapted to display a message indicating that the network connection status of the mobile device is abnormal when the network detection unit detects that the mobile device has access to a problematic network.
可选地,所述展示单元,进一步适于当所述语音转换交互单元接收到语音转换服务器对所述语音信号进行识别后返回的表示识别错误的信息时,展示所述表示识别错误的信息。Optionally, the presentation unit is further adapted to display the information indicating a recognition error when the voice conversion interaction unit receives information indicating a recognition error returned by a voice conversion server after recognizing the voice signal.
可选地,该装置进一步包括:指令执行单元;Optionally, the device further includes: an instruction execution unit;
所述语音转换交互单元,进一步适于接收所述语音转换服务器对所述字符信息进行语义识别,并根据识别后的语义返回的指令信息;The voice conversion interaction unit is further adapted to receive the voice conversion server performing semantic recognition on the character information and returning instruction information according to the recognized semantics;
所述指令执行单元,适于根据所述指令信息执行相应的操作。The instruction executing unit is adapted to execute corresponding operations according to the instruction information.
依据本发明的又一个方面,提供了一种实现语音搜索的浏览器客户端,包括如上述任一项中所述的实现语音搜索的装置。According to yet another aspect of the present invention, a browser client for implementing voice search is provided, including the device for implementing voice search as described in any one of the above items.
由上述可知,本发明提供的技术方案通过与语音转换服务器的交互以及通过与搜索服务器的交互,完成了对用户输入的语音信号对应的字符信息进行搜索,并向用户展示搜索结果的过程,实现了以语音信号为载体的完整、有效的语音搜索方案,与传统的手动文字搜索相比,简化了操作流程,提高了搜索效率,使得搜索更具场景化和个性化,给用户提供了更为便捷、完美的体验。As can be seen from the above, the technical solution provided by the present invention completes the process of searching the character information corresponding to the voice signal input by the user and presenting the search results to the user through the interaction with the voice conversion server and the search server, realizing A complete and effective voice search solution using voice signals as the carrier, compared with the traditional manual text search, simplifies the operation process, improves the search efficiency, makes the search more scene-oriented and personalized, and provides users with more Convenient and perfect experience.
上述说明仅是本发明技术方案的概述,为了能够更清楚了解本发明的技术手段,而可依照说明书的内容予以实施,并且为了让本发明的上述和其它目的、特征和优点能够更明显易懂,以下特举本发明的具体实施方式。The above description is only an overview of the technical solution of the present invention. In order to better understand the technical means of the present invention, it can be implemented according to the contents of the description, and in order to make the above and other purposes, features and advantages of the present invention more obvious and understandable , the specific embodiments of the present invention are enumerated below.
附图说明Description of drawings
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本发明的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiment. The drawings are only for the purpose of illustrating a preferred embodiment and are not to be considered as limiting the invention. Also throughout the drawings, the same reference numerals are used to designate the same parts. In the attached picture:
图1示出了根据本发明一个实施例的一种实现语音搜索的方法的流程图;FIG. 1 shows a flowchart of a method for voice search according to an embodiment of the present invention;
图2示出了根据本发明另一个实施例的一种实现语音搜索的方法的流程图;FIG. 2 shows a flow chart of a method for voice search according to another embodiment of the present invention;
图3示出了根据本发明一个实施例的一种实现语音搜索的装置的示意图;Fig. 3 shows a schematic diagram of a device for implementing voice search according to an embodiment of the present invention;
图4示出了根据本发明一个实施例的一种实现语音搜索的装置的信号获取单元的示意图;FIG. 4 shows a schematic diagram of a signal acquisition unit of a device for realizing voice search according to an embodiment of the present invention;
图5示出了根据本发明另一个实施例的一种实现语音搜索的装置的信号获取单元的示意图;FIG. 5 shows a schematic diagram of a signal acquisition unit of a device for realizing voice search according to another embodiment of the present invention;
图6示出了根据本发明另一个实施例的一种实现语音搜索的装置的示意图;Fig. 6 shows a schematic diagram of a device for implementing voice search according to another embodiment of the present invention;
图7示出了根据本发明又一个实施例的一种实现语音搜索的装置的示意图;Fig. 7 shows a schematic diagram of a device for implementing voice search according to yet another embodiment of the present invention;
图8A示出了根据本发明一个实施例的浏览器客户端的首页的界面示意图;FIG. 8A shows a schematic interface diagram of a home page of a browser client according to an embodiment of the present invention;
图8B示出了根据本发明一个实施例的语音搜索页面的界面示意图。Fig. 8B shows a schematic interface of a voice search page according to an embodiment of the present invention.
具体实施方式detailed description
下面将参照附图更详细地描述本公开的示例性实施例。虽然附图中显示了本公开的示例性实施例,然而应当理解,可以以各种形式实现本公开而不应被这里阐述的实施例所限制。相反,提供这些实施例是为了能够更透彻地理解本公开,并且能够将本公开的范围完整的传达给本领域的技术人员。Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.
图1示出了根据本发明一个实施例的一种实现语音搜索的方法的流程图。如图1所示,该方法包括:Fig. 1 shows a flowchart of a method for voice search according to an embodiment of the present invention. As shown in Figure 1, the method includes:
步骤S110,获取语音搜索用户输入的语音信号。Step S110, acquiring a voice signal input by a voice search user.
步骤S120,将语音信号发送到语音转换服务器。Step S120, sending the voice signal to the voice conversion server.
步骤S130,接收语音转换服务器对语音信号进行识别后返回的相应的字符信息。Step S130, receiving the corresponding character information returned by the speech conversion server after recognizing the speech signal.
上述步骤S120和步骤S130完成了本地与语音转换服务器的交互过程。The above step S120 and step S130 complete the interaction process between the local and the voice conversion server.
步骤S140,以字符信息作为搜索词,向搜索服务器发送搜索请求。Step S140, sending a search request to the search server with the character information as the search word.
步骤S150,接收和加载显示搜索服务器返回的搜索结果。Step S150, receiving and loading and displaying the search results returned by the search server.
上述步骤S140和步骤S150完成了本地与搜索服务器的交互过程。The above step S140 and step S150 complete the interaction process between the local and the search server.
可见,图1所示的方法通过与语音转换服务器的交互以及通过与搜索服务器的交互,完成了对用户输入的语音信号对应的字符信息进行搜索,并向用户展示搜索结果的过程,实现了以语音信号为载体的完整、有效的语音搜索方案,与传统的手动文字搜索相比,简化了操作流程,提高了搜索效率,给用户提供了更为便捷、完美的体验。It can be seen that the method shown in FIG. 1 completes the process of searching for the character information corresponding to the voice signal input by the user and displaying the search results to the user through the interaction with the voice conversion server and the search server. The voice signal is a complete and effective voice search solution. Compared with the traditional manual text search, it simplifies the operation process, improves the search efficiency, and provides users with a more convenient and perfect experience.
图2示出了根据本发明另一个实施例的一种实现语音搜索的方法的流程图。如图2所示,该方法包括:Fig. 2 shows a flowchart of a method for voice search according to another embodiment of the present invention. As shown in Figure 2, the method includes:
步骤S210,获取语音搜索用户的标签信息。Step S210, acquiring tag information of voice search users.
本步骤中获取的标签信息能够反映语音搜索用户的兴趣、爱好、个性和需求。例如可以对用户的历史浏览数据进行统计分析确定用户的标签信息,例如该标签可以是体育爱好者、网购达人或者英语爱好者等等。The tag information obtained in this step can reflect the interest, hobbies, personality and needs of the voice search user. For example, the user's historical browsing data can be statistically analyzed to determine the user's label information, for example, the label can be a sports fan, an online shopper or an English lover, etc.
步骤S220,根据该标签信息从语音转换服务器获取语音搜索用户的好友分享的与该标签信息匹配的经语音识别后的一个或多个关键字。In step S220, according to the tag information, one or more speech-recognized keywords shared by friends of the voice search user and matched with the tag information are obtained from the voice conversion server.
本步骤中的关键词是指:每个语音搜索用户在进行语音搜索时,其输入的语音信号可以被语音转换服务器识别字符信息,并且对该字符信息提取关键字。在浏览器系统中,注册用户之间可以像社交网站中那样相互加好友,以便分享一些浏览信息。The keyword in this step means that when each voice search user conducts a voice search, the voice signal input by him can be recognized by the voice conversion server to extract keywords from the character information. In the browser system, registered users can add friends to each other as in social networking sites, so as to share some browsing information.
步骤S230,当为一个关键字时以该关键字作为搜索词,当为多个关键字时从中选择一个关键字为搜索词,向搜索服务器发送搜索请求。Step S230, if there is one keyword, the keyword is used as the search word; when there are multiple keywords, one keyword is selected as the search word, and a search request is sent to the search server.
本步骤中,如果在步骤220中获取的是一个关键字,则直接以该关键字为搜索词,如果步骤220中获取的是多个关键字,根据用户的选择指令选择一个关键字作为搜索词。In this step, if what obtain in step 220 is a keyword, then directly take this keyword as search term, if what obtain in step 220 is a plurality of keywords, select a keyword as search term according to user's selection instruction .
步骤S240,接收和加载显示搜索服务器返回的搜索结果。Step S240, receiving and loading and displaying the search results returned by the search server.
图2所示的方法通过对语音搜索用户的标签信息和其他语音搜索用户分享的语音搜索信息进行匹配,得到符合语音搜索用户的需求、兴趣和个性的语音搜索方案,进一步地,语音搜索用户可以将自己的语音搜索信息进行分享,以便其他好友用户进行匹配,这样就实现了在好友间分享语音搜索信息;其中,语音搜索用户分享的语音搜索信息是在图1所示方法的基础上,通过以下步骤实现的:The method shown in Figure 2 matches the tag information of the voice search user with the voice search information shared by other voice search users to obtain a voice search solution that meets the voice search user's needs, interests and personality. Further, the voice search user can Share your own voice search information so that other friend users can match, so as to realize the sharing of voice search information between friends; wherein, the voice search information shared by voice search users is based on the method shown in Figure 1, through This is achieved by the following steps:
步骤S221,根据语音搜索用户触发的指令,获取语音搜索用户输入的语音信号。Step S221, according to the instruction triggered by the voice search user, the voice signal input by the voice search user is acquired.
步骤S222,将该语音信号与语音搜索用户的标签信息一起上传到语音转换服务器,供语音转换服务器将该语音信号识别成文字形式的关键字后,将该语音信号、识别出的关键字以及语音搜索用户的标签信息对应保存成一个分享信息条目。Step S222, upload the voice signal and the tag information of the voice search user to the voice conversion server, for the voice conversion server to recognize the voice signal into a keyword in text form, and then use the voice signal, the recognized keyword and the voice The search user's tag information is correspondingly saved as a shared information item.
在本步骤中,每个保存在语音转换服务器侧的分享信息目录均如表1所示,包括一一对应的语音信号、识别出的关键词和语音搜索用户的标签信息,用于标识每个语音搜索用户的每次语音搜索。In this step, each shared information directory stored on the voice conversion server side is shown in Table 1, including one-to-one corresponding voice signals, recognized keywords, and tag information of voice search users, used to identify each Every voice search by a voice search user.
表1Table 1
例如,用户A的其标签信息为:“小说爱好者”,用户A输入的语音信号为:“推荐一本金庸的武侠小说”,该用户A的标签信息和语音信号被上传到语音转换服务器,且语音转换服务器识别语音信号“推荐一本金庸的武侠小说”的关键词为:“金庸、武侠小说”,此时,语音转换服务器侧保存的分享信息条目如表2所示:For example, the tag information of user A is: "novel lover", the voice signal input by user A is: "recommend a martial arts novel by Jin Yong", the tag information and voice signal of user A are uploaded to the voice conversion server, And the voice conversion server recognizes the keyword of the voice signal "recommend a Jin Yong's martial arts novel" as: "Jin Yong, martial arts novel". At this time, the shared information items saved on the voice conversion server side are shown in Table 2:
表2Table 2
用户A的好友用户B喜欢文学,其标签信息为“文学”,在该用户B进行语音搜索时,用户B的标签信息“文学”与用户A分享的关键词“武侠小说”相匹配,以关键词“武侠小说”作为搜索词,向用户B展示搜索“武侠小说”返回的搜索结果。User B, a friend of user A, likes literature, and its tag information is "literature". The word "martial arts novel" is used as a search term, and the search results returned by the search for "martial arts novel" are displayed to user B.
此外,在本发明的一些实施例中,语音搜索用户希望把与自身的标签信息相匹配的分享信息条目下载到本地,便于今后的查看、搜索和分享,因此,图2所示的方法可以进一步包括:从语音转换服务器下载与语音搜索用户的标签信息匹配的分享信息条目,并保存在本地。沿用上述的例子,用户B也可以从语音转换服务器下载表2所示的分享信息条目,保存在本地。In addition, in some embodiments of the present invention, voice search users wish to download shared information items that match their own label information to the local, so that they can be viewed, searched and shared in the future. Therefore, the method shown in FIG. 2 can further Including: downloading the shared information items matching the tag information of the voice search user from the voice conversion server, and saving them locally. Using the above example, user B can also download the shared information items shown in Table 2 from the voice conversion server and store them locally.
在本发明的一个实施例中,图1和图2所示的方法进一步包括:当识别语音转换服务器返回的字符信息为符合特定格式的号码数据时,展示该号码数据的相应操作的加载入口。其中,特定格式的号码数据可以是:如电话号码、微信公共号、QQ号等数据,例如,识别字符信息为电话号码,说明用户可能有拨打电话或发信息的需求,则向用户展示拨打电话或发送信息的加载入口;或者,识别字符信息为微信公共号,说明用户可能有关注该公共号的需求,则向用户展示关注公共号的加载入口;或者,识别字符信息为QQ号,说明用户可能有添加该QQ号的需求,则向用户展示添加QQ号的加载入口;诸如此类,不再一一列举。该方案进一步优化了语音搜索过程,使得语音搜索更加符合用户的搜索习惯。In one embodiment of the present invention, the method shown in FIG. 1 and FIG. 2 further includes: when recognizing that the character information returned by the speech conversion server is number data conforming to a specific format, displaying the loading entry of the corresponding operation of the number data. Among them, the number data in a specific format can be: data such as phone numbers, WeChat public numbers, QQ numbers, etc. Or the loading entry for sending information; or, identifying the character information as a WeChat public account, indicating that the user may have a need to follow the public account, and then showing the user the loading entry for following the public account; or, identifying the character information as a QQ account, indicating that the user If there is a need to add the QQ number, the loading entry for adding the QQ number will be displayed to the user; and so on, they will not be listed one by one. This solution further optimizes the voice search process, making the voice search more in line with the user's search habits.
在本发明的一些实施例中,用户对特定格式的号码数据进行语音搜索是想要访问该号码数据相关的网页信息,在此类情况下,图1所示方法进一步包括:当识别所述字符信息为符合特定格式的号码数据时,还可以加载搜索服务器返回的该号码数据对应的电子商务网页信息,并提供与该电子商务网页进行交互操作的入口。例如,当为某快递公司的热线号码时,提供该快递公司的网站入口,以便用户进行快递查询等操作。In some embodiments of the present invention, the user performs a voice search on the number data in a specific format because he wants to access the webpage information related to the number data. In this case, the method shown in FIG. 1 further includes: when recognizing the character When the information is number data conforming to a specific format, the e-commerce web page information corresponding to the number data returned by the search server can also be loaded, and an entry for interactive operation with the e-commerce web page can be provided. For example, when it is the hotline number of a courier company, provide the website entrance of the courier company so that users can perform operations such as courier inquiries.
在本发明的一个实施例中,图1和图2所示方法的语音信号获取过程可以是:获取移动终端的通话录音数据,对通话录音数据进行编辑得到用于进行语音搜索的语音信号。In an embodiment of the present invention, the voice signal acquisition process of the methods shown in FIG. 1 and FIG. 2 may be: acquire call recording data of the mobile terminal, and edit the call recording data to obtain a voice signal for voice search.
在本发明的一个实施例中,图1所示方法的步骤S110的语音信号获取过程也可以是:添加语音识别模块,对语音搜索用户的自身的语音进行识别后记录,并标记为语音搜索用户的语音数据,以及对于好友分享或发送来的语音数据进行手动识别或进行机器学习识别后记录,并标记为相应好友的语音数据;对语音搜索用户的语音数据或其好友的语音数据进行编辑,得到用于进行语音搜索的语音信号。In one embodiment of the present invention, the voice signal acquisition process of step S110 of the method shown in FIG. 1 may also be: add a voice recognition module to recognize and record the voice of the voice search user, and mark it as a voice search user , and record the voice data shared or sent by friends manually or after machine learning recognition, and mark it as the voice data of corresponding friends; edit the voice data of voice search users or their friends, A voice signal for voice search is obtained.
在上述两个实施例中,无需通过麦克风现场输入语音,而是对原保存的语音数据进行编辑得到用于搜索的语音信号。原保存的语音数据可以是移动终端的通话录音数据,也可以是进行识别并标识过的某个用户的语音数据。In the above two embodiments, it is not necessary to input voice through the microphone on site, but to edit the original saved voice data to obtain the voice signal for searching. The originally saved voice data may be the call recording data of the mobile terminal, or the voice data of a certain user who has been identified and identified.
在本发明的一个实施例中,图1和图2所示的方法在对返回的字符信息进行搜索时,可以进一步包括:对返回的字符信息进行语义识别,确定字符信息作为搜索词的分类;然后再向相应分类的搜索服务器发送搜索请求。例如,语音转换服务器返回的字符信息为“APEC会议的新闻”,对其进行语义识别,确定该字符信息作为搜索词的分类为新闻,就会以“APEC会议”作为搜索词,向新闻类的搜索服务器发送搜索请求。可知,当识别字符信息作为搜索词属于图片、音乐、视频、地图等类别时,分别向图片搜索服务器、音乐搜索服务器、视频搜索服务器和地图搜索服务器发送搜索请求,搜索过程与上述同理,在此不再赘述。本方案使得语音搜索更具指向性,搜索结果更加准确,符合用户需求。In one embodiment of the present invention, when the method shown in Fig. 1 and Fig. 2 searches the returned character information, it may further include: performing semantic recognition on the returned character information, and determining the classification of the character information as the search term; Then send a search request to the search server of the corresponding category. For example, the character information returned by the speech conversion server is "News of the APEC meeting", and semantic recognition is performed on it, and it is determined that the character information is classified as news as a search term, and "APEC meeting" is used as the search term to search for the news category. The search server sends a search request. It can be seen that when the recognized character information belongs to categories such as pictures, music, videos, and maps as search words, search requests are sent to the picture search server, music search server, video search server, and map search server respectively. The search process is the same as above. This will not be repeated here. This solution makes the voice search more directional, the search result is more accurate, and meets the needs of users.
在本发明的一个实施例中,图1和图2所示方法的获取语音搜索用户输入的语音信号的过程是现场通过麦克风获取用户的语音信号,具体包括:In one embodiment of the present invention, the process of obtaining the voice signal input by the voice search user in the method shown in Fig. 1 and Fig. 2 is to obtain the user's voice signal through a microphone on the spot, which specifically includes:
步骤S111,在浏览器客户端的首页中显示语音搜索图标;当该语音搜索图标被点击时,跳转到包含语音搜索控件的语音搜索页面。Step S111 , displaying a voice search icon on the home page of the browser client; when the voice search icon is clicked, jump to a voice search page including a voice search control.
本步骤中的语音搜索页面即为用户进行语音搜索时的交互界面。The voice search page in this step is an interactive interface when the user conducts voice search.
步骤S112,响应于语音搜索控件被启动的事件,利用浏览器客户端所在移动设备的麦克风开始捕获语音信号。Step S112, in response to the event that the voice search control is activated, start capturing voice signals with the microphone of the mobile device where the browser client is located.
步骤S113,响应于语音搜索控件被关闭的事件,停止捕获语音信号,获得输入的语音信号。Step S113, in response to the event that the voice search control is turned off, stop capturing the voice signal, and obtain the input voice signal.
本实施例的在浏览器客户端的首页中显示语音搜索图标可以是:在浏览器客户端的地址栏的尾部显示麦克风图标;步骤S112中的语音搜索控件被启动的事件可以是:语音搜索控件开始被触摸;步骤S113中的语音搜索控件被关闭的事件为:语音搜索控件结束被触摸。Displaying the voice search icon on the homepage of the browser client in this embodiment may be: displaying a microphone icon at the end of the address bar of the browser client; the event that the voice search control in step S112 is activated may be: the voice search control starts to be Touch; the event that the voice search control is turned off in step S113 is: the voice search control ends being touched.
图8A示出了根据本发明一个实施例的浏览器客户端的首页的界面示意图,图8B示出了根据本发明一个实施例的语音搜索页面的界面示意图。步骤S111在地址栏的尾部显示麦克风图标如图8A所示,当该麦克风图标被点击时,跳转到如图8B所示的包含语音搜索控件的语音搜索页面,在本实施例中,该语音搜索控件同样也是一个麦克风图标,响应于该语音搜索控件开始被触摸,利用浏览器客户端所在移动设备的麦克风开始捕获语音信号;响应于该语音搜索控件结束被触摸,停止捕获语音信号,获得输入的语音信号。FIG. 8A shows a schematic interface of a home page of a browser client according to an embodiment of the present invention, and FIG. 8B shows a schematic interface of a voice search page according to an embodiment of the present invention. Step S111 displays a microphone icon at the end of the address bar as shown in FIG. 8A. When the microphone icon is clicked, it jumps to a voice search page containing a voice search control as shown in FIG. 8B. In this embodiment, the voice The search control is also a microphone icon. In response to the start of the voice search control being touched, the microphone of the mobile device where the browser client is located begins to capture voice signals; in response to the end of the voice search control being touched, stop capturing voice signals and obtain input voice signal.
在本发明的一个实施例中,上述步骤S111可以进一步包括:在语音搜索页面上展示N个热点搜索词;其中,N个热点搜索词是在一个统计周期内的统计得到的搜索排名靠前的N个搜索词,N为自然数;或者,在语音搜索页面上展示搜索提示信息,用于提示用户如何进行语音搜索。如图8B所示,语音搜索提示单元530在该语音搜索页面上展示了“明天天气怎么样”、“红烧肉怎么做”等七个热点搜索词,并通过字体大小区分开各搜索词的搜索排名。In an embodiment of the present invention, the above step S111 may further include: displaying N hot search words on the voice search page; wherein, the N hot search words are the top search words obtained from statistics within a statistical period N search terms, where N is a natural number; or, display search prompt information on the voice search page to prompt users how to perform voice search. As shown in Figure 8B, the voice search prompt unit 530 displays seven hot search terms such as "how is the weather tomorrow" and "how to make braised pork" on the voice search page, and distinguishes the search terms of each search term by font size ranking.
在上述实施例所述的在语音搜索页面上展示N个热点搜索词的前提下,在本发明的一个实施例中,响应于语音搜索页面上的热点搜索词被点击的事件,跳转到该热点搜索词对应的搜索结果页面。例如,在图8B所示的语音搜索页面中,响应于“明天天气怎么样”热点搜索词被点击的事件,跳转到该热点搜索词对应的天气预报的搜索结果页面;并向用户加载显示该搜索结果页面。该方案向用户推荐了近期的热点搜索词,并在用户点击热点搜索词时跳转到相应的搜索页面,符合用户的搜索习惯。On the premise of displaying N hot search words on the voice search page as described in the above embodiment, in one embodiment of the present invention, in response to the event that the hot search words on the voice search page are clicked, jump to the The search result page corresponding to the hot search term. For example, in the voice search page shown in Figure 8B, in response to the event that the hot search word "what's the weather tomorrow" is clicked, jump to the search result page of the weather forecast corresponding to the hot search word; and load the display to the user The search results page. This solution recommends the recent hot search words to the user, and jumps to the corresponding search page when the user clicks the hot search word, which is in line with the user's search habits.
在本发明的一个实施例中,上述步骤S111可以进一步包括:在语音搜索页面中显示页面关闭控件,当该页面关闭控件被触发时,关闭语音搜索页面,返回到浏览器客户端的首页。如图8B所示,在该语音搜索页面的右下角显示页面关闭控件,当该页面关闭控件被触发时,关闭该语音搜索页面,返回到如图8A所示的浏览器客户端的首页。In an embodiment of the present invention, the above step S111 may further include: displaying a page closing control on the voice search page, and when the page closing control is triggered, closing the voice search page and returning to the homepage of the browser client. As shown in FIG. 8B , a page closing control is displayed in the lower right corner of the voice search page. When the page closing control is triggered, the voice search page is closed and the browser client homepage as shown in FIG. 8A is returned.
在本发明的一个实施例中,由于外界环境或其他因素的影响,导致语音信号不清晰或者具有多重含义,可能会产生如下两种情况:In one embodiment of the present invention, due to the influence of the external environment or other factors, the speech signal is unclear or has multiple meanings, and the following two situations may occur:
第一种情况,语音转换服务器识别语音信号得到多个字符信息,即图1和图2所示方法的接收语音转换服务器对语音信号进行识别后返回的相应的字符信息可以是:接收语音转换服务器对语音信号进行识别后返回的相应的多个字符信息;在本情况下,为了继续执行搜索过程,则该方法进一步包括:展示所述多个字符信息,以供用户进行选择;以用户选择的字符信息作为搜索词,向搜索服务器发送搜索请求。In the first case, the voice conversion server recognizes the voice signal to obtain a plurality of character information, that is, the corresponding character information returned by the receiving voice conversion server of the method shown in Fig. 1 and Fig. 2 after recognizing the voice signal can be: the receiving voice conversion server The corresponding multiple character information returned after the voice signal is recognized; in this case, in order to continue the search process, the method further includes: displaying the multiple character information for the user to choose; The character information is used as a search word, and a search request is sent to the search server.
第二种情况,语音转换服务器无法识别语音信号,将返回表示识别错误的信息,当接收语音转换服务器对所述语音信号进行识别后返回的表示识别错误的信息时,向用户展示表示识别错误的信息,告知用户本次语音搜索的语音信号识别发生错误。In the second case, the voice conversion server cannot recognize the voice signal, and will return information indicating that the recognition is wrong. information, informing the user that an error occurred in the voice signal recognition of this voice search.
在本发明的一个实施例中,在图1和图2所示方法获取到语音搜索用户输入的语音信号后,在将语音信号发送到语音转换服务器之前,进一步包括:获取移动设备的网络连接状态;如果移动设备没有接入网络,或者移动设备接入的是存在问题的网络,则显示表示移动设备的网络连接状态异常的信息,告知用户当前网络连接异常,无法进行语音搜索;如果移动设备接入的是正常网络,则执行将所述语音信号发送到语音转换服务器的步骤。In one embodiment of the present invention, after the method shown in Figure 1 and Figure 2 acquires the voice signal input by the voice search user, before sending the voice signal to the voice conversion server, it further includes: acquiring the network connection status of the mobile device ; If the mobile device is not connected to the network, or if the mobile device is connected to a problematic network, the information indicating that the network connection status of the mobile device is abnormal will be displayed to inform the user that the current network connection is abnormal and voice search cannot be performed; If the access is a normal network, then perform the step of sending the voice signal to the voice conversion server.
在本发明的一个实施例中,为了更好地实现语音搜索过程中的人机互动,语音转换服务器在对语音信号进行识别得到相应的字符信息后,可以进一步对该字符信息进行语义识别,并根据识别后的语义返回相应的指令信息;对应地,图1和图2所示的方法进一步包括:接收语音转换服务器对字符信息进行语义识别,并根据识别后的语义返回的指令信息;根据指令信息执行相应的操作。例如,用户输入的语音信号为:“我想看电影”,语音转换服务器识别该语音信号对应的字符信息后,根据该字符信息的语义返回指令信息:“搜索热门电影”或“查找附近的电影院”,接收到该指令信息后,以“热门电影”为搜索词向视频类的搜索服务器发起搜索,或者,以“电影院”为搜索词向地图类的搜索服务器发起搜索,向用户展示搜索结果。In one embodiment of the present invention, in order to better realize the human-computer interaction in the voice search process, after the voice conversion server recognizes the voice signal to obtain the corresponding character information, it can further perform semantic recognition on the character information, and Return corresponding instruction information according to the semantics after recognition; correspondingly, the method shown in Figure 1 and Figure 2 further includes: receiving the instruction information that the voice conversion server performs semantic recognition on the character information, and returns according to the semantics after recognition; according to the instruction information to perform the appropriate action. For example, the voice signal input by the user is: "I want to watch a movie". After the voice conversion server recognizes the character information corresponding to the voice signal, it returns instruction information according to the semantics of the character information: "Search for popular movies" or "Find nearby movie theaters" ", after receiving the instruction information, use "popular movies" as the search term to initiate a search to a video-type search server, or use "movie theater" as a search term to initiate a search to a map-type search server, and display the search results to the user.
图3示出了根据本发明一个实施例的一种实现语音搜索的装置的示意图。如图3所示,该实现语音搜索的装置300包括:Fig. 3 shows a schematic diagram of an apparatus for implementing voice search according to an embodiment of the present invention. As shown in Figure 3, the device 300 for implementing voice search includes:
信号获取单元310,适于获取语音搜索用户输入的语音信号。The signal obtaining unit 310 is adapted to obtain a voice signal input by a voice search user.
本信号获取单元310,在本发明的一个实施例中,可以是如图4所示的信号获取单元400或如图5所示的信号获取单元500,下文中将会详细阐述。The signal acquisition unit 310, in an embodiment of the present invention, may be the signal acquisition unit 400 shown in FIG. 4 or the signal acquisition unit 500 shown in FIG. 5, which will be described in detail below.
语音转换交互单元320,适于将语音信号发送到语音转换服务器;并且适于接收语音转换服务器对语音信号进行识别后返回的相应的字符信息。The voice conversion interaction unit 320 is adapted to send the voice signal to the voice conversion server; and is adapted to receive corresponding character information returned by the voice conversion server after recognizing the voice signal.
本单元用于与语音转换服务器进行交互。This unit is used to interact with the speech conversion server.
搜索交互单元330,适于以字符信息作为搜索词,向搜索服务器发送搜索请求;并且适于接收搜索服务器返回的搜索结果。The search interaction unit 330 is adapted to send a search request to the search server using character information as a search word; and is adapted to receive the search result returned by the search server.
本单元用于与搜索服务器进行交互。This unit is used to interact with the search server.
展示单元340,适于加载显示搜索服务器返回的搜索结果。The display unit 340 is adapted to load and display the search results returned by the search server.
可见,图3所示的实现语音搜索的装置300通过各单元的相互配合,实现与语音转换服务器以及搜索服务器的交互,完成了对用户输入的语音信号对应的字符信息进行搜索,并向用户展示搜索结果的过程,实现了以语音信号为载体的完整、有效的语音搜索方案,与传统的手动文字搜索相比,简化了操作流程,提高了搜索效率,给用户提供了更为便捷、完美的体验。It can be seen that the device 300 for implementing voice search shown in FIG. 3 realizes the interaction with the voice conversion server and the search server through the mutual cooperation of each unit, completes the search for the character information corresponding to the voice signal input by the user, and displays to the user The process of searching results realizes a complete and effective voice search solution based on voice signals. Compared with traditional manual text search, it simplifies the operation process, improves search efficiency, and provides users with more convenient and perfect search results. experience.
在本发明的一个实施例中,图3所示装置的信号获取单元310,进一步适于获取语音搜索用户的标签信息;本单元获取的标签信息能够反映语音搜索用户的兴趣、爱好、个性和需求。语音转换交互单元320,进一步适于根据该标签信息从语音转换服务器获取语音搜索用户的好友分享的与该标签信息匹配的经语音识别后的一个或多个关键字;搜索交互单元330,进一步适于当语音转换交互单元320获取到一个关键字时,以该关键字作为搜索词,当语音转换交互单元320获取到多个关键字时从中选择一个关键字为搜索词,向搜索服务器发送搜索请求;并且适于接收搜索服务器返回的搜索结果;展示单元340,适于加载显示所述搜索服务器返回的搜索结果。In one embodiment of the present invention, the signal acquisition unit 310 of the device shown in Figure 3 is further adapted to acquire tag information of voice search users; the tag information acquired by this unit can reflect the voice search user's interest, hobbies, personality and needs . The voice conversion interaction unit 320 is further adapted to obtain from the voice conversion server according to the label information one or more keywords after speech recognition that are shared by friends of the voice search user and matched with the label information; the search interaction unit 330 is further adapted to When the voice conversion and interaction unit 320 acquires a keyword, use the keyword as a search word; when the voice conversion interaction unit 320 acquires multiple keywords, select a keyword as the search term, and send a search request to the search server and adapted to receive the search results returned by the search server; the display unit 340 is adapted to load and display the search results returned by the search server.
在本实施例中,通过语音转换交互单元320对语音搜索用户的标签信息和其他语音搜索用户分享的语音搜索信息进行匹配,得到符合语音搜索用户的需求、兴趣和个性的语音搜索方案,进一步地,语音搜索用户可以将自己的语音搜索信息进行分享,以便其他好友用户进行匹配,这样就可以实现在好友间分享语音搜索信息;其中,语音搜索用户分享的语音搜索信息可以通过图3所示装置的信号获取单元310和语音转换交互单元320配合实现:In this embodiment, the voice conversion and interaction unit 320 matches the tag information of the voice search user with the voice search information shared by other voice search users to obtain a voice search solution that meets the voice search user's needs, interests and personality, and further , voice search users can share their own voice search information so that other friend users can match, so that voice search information can be shared between friends; wherein, the voice search information shared by voice search users can be passed through the device shown in Figure 3 The signal acquisition unit 310 and the voice conversion interaction unit 320 cooperate to realize:
信号获取单元310,进一步适于根据语音搜索用户触发的指令,获取语音搜索用户输入的语音信号。The signal acquisition unit 310 is further adapted to acquire a voice signal input by the voice search user according to an instruction triggered by the voice search user.
语音转换交互单元320,进一步适于将该语音信号与语音搜索用户的标签信息一起上传到语音转换服务器,供语音转换服务器将该语音信号识别成文字形式的关键字后,将该语音信号、识别出的关键字以及语音搜索用户的标签信息对应保存成一个分享信息条目。The voice conversion interaction unit 320 is further adapted to upload the voice signal and the label information of the voice search user to the voice conversion server, so that the voice conversion server recognizes the voice signal into a keyword in text form, and then the voice signal, recognition The generated keywords and tag information of voice search users are correspondingly saved as a shared information item.
本单元在语音转换服务器侧保存的每个分享信息目录如表1所示,包括一一对应的语音信号、识别出的关键词和语音搜索用户的标签信息,用于标识每个语音搜索用户的每次语音搜索。Each shared information directory saved by this unit on the voice conversion server side is shown in Table 1, including one-to-one corresponding voice signals, recognized keywords, and tag information of voice search users, used to identify each voice search user Every voice search.
例如,用户A喜欢美食,其标签信息为:“美食”,用户A输入的语音信号为:“推荐一本金庸的武侠小说”,该用户A的标签信息和语音信号被上传到语音转换服务器,且语音转换服务器识别语音信号“推荐一本金庸的武侠小说”的关键词为:“金庸、武侠小说”,此时,语音转换服务器侧保存的分享信息条目如表2所示。For example, user A likes food, and its label information is: "food", the voice signal input by user A is: "recommend a martial arts novel by Jin Yong", the label information and voice signal of user A are uploaded to the voice conversion server, And the voice conversion server recognizes the key word of the voice signal "recommend a martial arts novel by Jin Yong" as: "Jin Yong, martial arts novel". At this time, the shared information items saved on the voice conversion server side are shown in Table 2.
用户A的好友用户B喜欢文学,其标签信息为“文学”,在该用户B进行语音搜索时,用户B的标签信息“文学”与用户A分享的关键词“武侠小说”相匹配,以关键词“武侠小说”作为搜索词,向用户B展示搜索“武侠小说”返回的搜索结果。User B, a friend of user A, likes literature, and its tag information is "literature". The word "martial arts novel" is used as a search term, and the search results returned by the search for "martial arts novel" are displayed to user B.
此外,在本发明的一些实施例中,语音搜索用户希望把与自身的标签信息相匹配的分享信息条目下载到本地,便于今后的查看、搜索和分享,因此,图3所示装置的语音转换交互单元320,可以进一步适于从语音转换服务器下载与所述语音搜索用户的标签信息匹配的分享信息条目,并保存在本地。即沿用上述的例子,用户B也可以从语音转换服务器下载表2所示的分享信息条目,保存在本地。In addition, in some embodiments of the present invention, voice search users wish to download shared information items matching their own label information to the local, which is convenient for viewing, searching and sharing in the future. Therefore, the voice conversion of the device shown in FIG. 3 The interaction unit 320 may be further adapted to download the shared information item matching the tag information of the voice search user from the voice conversion server, and save it locally. That is, using the above example, user B can also download the shared information items shown in Table 2 from the voice conversion server and store them locally.
在本发明的一个实施例中,图3所示装置的展示单元340,进一步适于当搜索交互单元330识别字符信息为符合特定格式的号码数据时,展示该号码数据的相应操作的加载入口。其中,特定格式的号码数据可以是:电话号码、微信公共号、QQ号等数据,例如,搜索交互单元330识别字符信息为电话号码,说明用户可能有拨打电话或发信息的需求,则展示单元340向用户展示拨打电话或发送信息的加载入口;或者,搜索交互单元330识别字符信息为微信公共号,说明用户可能有关注该公共号的需求,则展示单元340向用户展示关注公共号的加载入口;或者,搜索交互单元330识别字符信息为QQ号,说明用户可能有添加该QQ号的需求,则展示单元340向用户展示添加QQ号的加载入口;诸如此类,不再一一列举,进一步优化了语音搜索方案。In one embodiment of the present invention, the display unit 340 of the device shown in FIG. 3 is further adapted to display the loading entry of the corresponding operation of the number data when the search interaction unit 330 recognizes that the character information is number data conforming to a specific format. Among them, the number data in a specific format can be: phone number, WeChat public number, QQ number and other data. For example, the search interaction unit 330 recognizes the character information as a phone number, indicating that the user may have a need to make a call or send a message, then the display unit 340 shows the user the loading entry for making a call or sending information; or, the search interaction unit 330 identifies the character information as a WeChat public number, indicating that the user may have a need to pay attention to the public number, and then the display unit 340 shows the user the loading of the concerned public number. Entrance; or, the search interaction unit 330 identifies the character information as a QQ number, indicating that the user may have a need to add the QQ number, and then the display unit 340 shows the user the loading entry for adding the QQ number; and so on, no longer list them one by one, further optimization voice search solution.
在本发明的一些实施例中,用户对特定格式的号码数据进行语音搜索是想要访问该号码数据相应的网页信息。在此类情况下,当搜索交互单元330识别字符信息为上述符合特定格式的号码数据时,展示单元340,进一步适于加载搜索服务器返回的该号码数据对应的电子商务网页信息,并提供与该电子商务网页进行交互操作的入口。In some embodiments of the present invention, the user performs a voice search on the number data in a specific format because he wants to access the web page information corresponding to the number data. In this case, when the search interaction unit 330 recognizes that the character information is the above-mentioned number data conforming to a specific format, the display unit 340 is further adapted to load the e-commerce webpage information corresponding to the number data returned by the search server, and provide information corresponding to the number data. The portal for interactive operations on e-commerce web pages.
在本发明的一个实施例中,图3所示装置的信号获取单元310,进一步适于获取移动终端的通话录音数据,对通话录音数据进行编辑得到用于进行语音搜索的语音信号。In an embodiment of the present invention, the signal acquisition unit 310 of the device shown in FIG. 3 is further adapted to acquire call recording data of the mobile terminal, and edit the call recording data to obtain a voice signal for voice search.
在本发明的一个实施例中,图3所示装置的信号获取单元310,进一步适于添加语音识别模块,对语音搜索用户的自身的语音进行识别后记录,并标记为语音搜索用户的语音数据,以及对于好友分享或发送来的语音数据进行手动识别或进行机器学习识别后记录,并标记为相应好友的语音数据;并且适于对语音搜索用户的语音数据或其好友的语音数据进行编辑,得到用于进行语音搜索的语音信号。In one embodiment of the present invention, the signal acquisition unit 310 of the device shown in FIG. 3 is further adapted to add a voice recognition module to recognize and record the voice search user's own voice, and mark it as the voice data of the voice search user , and record the voice data shared or sent by friends manually or after machine learning recognition, and mark it as the voice data of corresponding friends; and it is suitable for editing the voice data of voice search users or their friends, A voice signal for voice search is obtained.
在本发明的一个实施例中,图3所示装置的搜索交互单元330,进一步适于对语音转换交互单元320返回的字符信息进行语义识别,确定字符信息作为搜索词的分类;并且适于向相应分类的搜索服务器发送搜索请求。例如,语音转换服务器返回的字符信息为“APEC会议的新闻”,搜索交互单元330对其进行语义识别,确定搜索词分类为新闻,搜索交互单元330就以“APEC会议”作为搜索词,向新闻类的搜索服务器发送搜索请求。可知,当识别字符信息属于图片、音乐、视频、地图等类别时,搜索交互单元330的搜索过程与上述同理,在此不再赘述。本方案使得语音搜索更具指向性,搜索结果更加准确,符合用户需求。In one embodiment of the present invention, the search interaction unit 330 of the device shown in FIG. 3 is further adapted to perform semantic recognition on the character information returned by the speech conversion interaction unit 320, and determine the character information as the classification of the search term; The search server for the corresponding category sends a search request. For example, the character information returned by the speech conversion server is "News of the APEC meeting", and the search interaction unit 330 performs semantic recognition on it, and determines that the search term is classified as news, and the search interaction unit 330 uses "APEC meeting" as the search term to search for the news. class of search servers to send search requests. It can be seen that when the recognized character information belongs to categories such as pictures, music, videos, maps, etc., the search process of the search interaction unit 330 is the same as above, and will not be repeated here. This solution makes the voice search more directional, the search result is more accurate, and meets the needs of users.
在本发明的一个实施例中,由于外界环境或其他因素的影响,导致语音信号不清晰或者具有多重含义,可能会产生如下两种情况:In one embodiment of the present invention, due to the influence of the external environment or other factors, the speech signal is unclear or has multiple meanings, and the following two situations may occur:
第一种情况,语音转换服务器识别语音信号得到多个字符信息,即图3所示装置的语音转换交互单元320,适于接收语音转换服务器对语音信号进行识别后返回的相应的多个字符信息;在本情况下,为了继续执行搜索过程,则展示单元340,进一步适于展示多个字符信息,以供用户进行选择;搜素交互单元330,适于以用户选择的字符信息作为搜索词,向搜索服务器发送搜索请求。In the first case, the voice conversion server recognizes the voice signal to obtain a plurality of character information, that is, the voice conversion interaction unit 320 of the device shown in Figure 3 is adapted to receive the corresponding multiple character information returned by the voice conversion server after recognizing the voice signal ; In this case, in order to continue to perform the search process, the display unit 340 is further adapted to display a plurality of character information for the user to select; the search interaction unit 330 is adapted to use the character information selected by the user as the search term, Send a search request to the search server.
第二种情况,语音转换服务器无法识别语音信号,将返回表示识别错误的信息,图3所示装置的展示单元340,进一步适于当语音转换交互单元320接收到语音转换服务器对语音信号进行识别后返回的表示识别错误的信息时,展示表示识别错误的信息,以告知用户本次语音搜索的语音信号识别发生错误。In the second case, the voice conversion server cannot recognize the voice signal, and will return information indicating that the recognition is wrong. The display unit 340 of the device shown in Figure 3 is further suitable for recognizing the voice signal when the voice conversion interaction unit 320 receives the voice conversion server. When the information indicating the recognition error is returned later, the information indicating the recognition error is displayed to inform the user that an error occurred in the voice signal recognition of this voice search.
图4示出了根据本发明一个实施例的一种实现语音搜索的装置的信号获取单元的示意图。如图4所示,该信号获取单元400包括:Fig. 4 shows a schematic diagram of a signal acquisition unit of a device for realizing voice search according to an embodiment of the present invention. As shown in Figure 4, the signal acquisition unit 400 includes:
语音搜索跳转单元410,适于在浏览器客户端的首页中显示语音搜索图标;并且适于当该语音搜索图标被点击时,跳转到包含语音搜索控件的语音搜索页面。The voice search jumping unit 410 is adapted to display a voice search icon on the home page of the browser client; and is adapted to jump to a voice search page containing a voice search control when the voice search icon is clicked.
语音获取控制单元420,适于响应于语音搜索控件被启动的事件,利用浏览器客户端所在移动设备的麦克风开始捕获语音信号;并且适于响应于语音搜索控件被关闭的事件,停止捕获语音信号,获得输入的语音信号。The voice acquisition control unit 420 is adapted to start capturing voice signals with the microphone of the mobile device where the browser client is located in response to the event that the voice search control is activated; and is adapted to stop capturing voice signals in response to the event that the voice search control is turned off , to obtain the input speech signal.
在本发明的一个实施例中,图4所示信号获取单元400的语音搜索跳转单元410,适于在浏览器客户端的地址栏的尾部显示麦克风图标;图4所示信号获取单元400的语音获取控制单元420,适于响应于语音搜索控件开始被触摸,利用浏览器客户端所在移动设备的麦克风开始捕获语音信号;并且适于响应于语音搜索控件结束被触摸,停止捕获语音信号,获得输入的语音信号。In one embodiment of the present invention, the voice search jump unit 410 of the signal acquisition unit 400 shown in FIG. 4 is suitable for displaying a microphone icon at the end of the address bar of the browser client; The acquisition control unit 420 is adapted to start capturing voice signals with the microphone of the mobile device where the browser client is located in response to the voice search control being touched; voice signal.
在本发明的一个实施例中,语音搜索跳转单元410在地址栏的尾部显示麦克风图标如图8A所示,当该麦克风图标被点击时,跳转到如图8B所示的包含语音搜索控件的语音搜索页面,在本实施例中,该语音搜索控件同样也是一个麦克风图标,响应于该语音搜索控件开始被触摸,语音获取控制单元420利用浏览器客户端所在移动设备的麦克风开始捕获语音信号;响应于该语音搜索控件结束被触摸,语音获取控制单元420停止捕获语音信号,获得输入的语音信号。In one embodiment of the present invention, the voice search jump unit 410 displays a microphone icon at the end of the address bar as shown in FIG. 8A . In this embodiment, the voice search control is also a microphone icon. In response to the voice search control starting to be touched, the voice acquisition control unit 420 uses the microphone of the mobile device where the browser client is located to start capturing voice signals ; In response to the voice search control being touched, the voice acquisition control unit 420 stops capturing voice signals and obtains input voice signals.
在本发明的一个实施例中,图4所示的语音搜索跳转单元410,进一步适于在语音搜索页面中显示页面关闭控件,当该页面关闭控件被触发时,关闭语音搜索页面,返回到浏览器客户端的首页。如图8B所示,语音搜索跳转单元410在该语音搜索页面的右下角显示页面关闭控件,当该页面关闭控件被触发时,关闭该语音搜索页面,返回到如图8A所示的浏览器客户端的首页。In one embodiment of the present invention, the voice search jump unit 410 shown in FIG. 4 is further adapted to display a page close control on the voice search page, and when the page close control is triggered, close the voice search page and return to The home page of the browser client. As shown in Figure 8B, the voice search jump unit 410 displays a page close control in the lower right corner of the voice search page, when the page close control is triggered, close the voice search page and return to the browser as shown in Figure 8A The client's home page.
图5示出了根据本发明另一个实施例的一种实现语音搜索的装置的信号获取单元的示意图。如图5所示,该信号获取单元500包括:语音搜索跳转单元510、语音获取控制单元520和语音搜索提示单元530;其中,语音搜索跳转单元510和语音获取控制单元520分别与图4所示的语音搜索跳转单元410和语音获取控制单元420相同,在此不再赘述。Fig. 5 shows a schematic diagram of a signal acquisition unit of an apparatus for realizing voice search according to another embodiment of the present invention. As shown in Figure 5, the signal acquisition unit 500 includes: a voice search jump unit 510, a voice acquisition control unit 520, and a voice search prompt unit 530; The voice search jumping unit 410 shown is the same as the voice acquisition control unit 420 , and will not be repeated here.
语音搜索提示单元530,适于在语音搜索页面上展示N个热点搜索词;其中,N个热点搜索词是在一个统计周期内的统计得到的搜索排名靠前的N个搜索词,N为自然数;或者,适于在语音搜索页面上展示搜索提示信息,用于提示用户如何进行语音搜索。如图8B所示,语音搜索提示单元530在该语音搜索页面上展示了“明天天气怎么样”、“红烧肉怎么做”等七个热点搜索词,并通过字体大小区分开各搜索词的搜索排名。The voice search prompt unit 530 is suitable for displaying N hot search words on the voice search page; wherein, the N hot search words are the top N search words obtained from statistics within a statistical period, and N is a natural number ; Or, it is suitable for displaying search prompt information on the voice search page, which is used to prompt the user how to perform voice search. As shown in Figure 8B, the voice search prompt unit 530 displays seven hot search terms such as "how is the weather tomorrow" and "how to make braised pork" on the voice search page, and distinguishes the search terms of each search term by font size ranking.
在图5所示的信号获取单元500在语音搜索页面上展示N个热点搜索词的前提下,在本发明的一个实施例中,图3所示装置的搜索交互单元330,进一步适于响应于语音搜索页面上的热点搜索词被点击的事件,跳转到该热点搜索词对应的搜索结果页面;展示单元340,进一步适于加载显示该热点搜索词对应的搜索结果页面。例如在图8B所示的语音搜索页面中,响应于“明天天气怎么样”热点搜索词被点击的事件,搜索交互单元330跳转到该热点搜索词对应的天气预报的搜索结果页面;展示单元340向用户加载显示该搜索结果页面。该方案向用户推荐了近期的热点搜索词,并在用户点击热点搜索词时跳转到相应的搜索页面,符合用户的搜索习惯。On the premise that the signal acquisition unit 500 shown in FIG. 5 displays N hot search words on the voice search page, in one embodiment of the present invention, the search interaction unit 330 of the device shown in FIG. 3 is further adapted to respond to When a hot search word on the voice search page is clicked, jump to the search result page corresponding to the hot search word; the display unit 340 is further adapted to load and display the search result page corresponding to the hot search word. For example, in the voice search page shown in FIG. 8B , in response to the event that the hot search word "what's the weather tomorrow" is clicked, the search interaction unit 330 jumps to the search result page of the weather forecast corresponding to the hot search word; the display unit 340 loads and displays the search result page to the user. This solution recommends the recent hot search words to the user, and jumps to the corresponding search page when the user clicks the hot search word, which is in line with the user's search habits.
图6示出了根据本发明另一个实施例的一种实现语音搜索的装置的示意图。如图6所示,该实现语音搜索的装置600包括:信号获取单元610、网络检测单元620、语音转换交互单元630、搜索交互单元640和展示单元650;其中,信号获取单元610、搜索交互单元640分别与图3所示装置的信号获取单元310、搜索交互单元330相同;并且在本发明的一个实施例中,信号获取单元610可以是如图4所示的信号获取单元400或如图5所示的信号获取单元500,在此不再赘述。Fig. 6 shows a schematic diagram of an apparatus for implementing voice search according to another embodiment of the present invention. As shown in Figure 6, the device 600 for implementing voice search includes: a signal acquisition unit 610, a network detection unit 620, a voice conversion interaction unit 630, a search interaction unit 640, and a display unit 650; wherein, the signal acquisition unit 610, the search interaction unit 640 are respectively the same as the signal acquisition unit 310 and the search interaction unit 330 of the device shown in FIG. 3; and in one embodiment of the present invention, the signal acquisition unit 610 may be the signal acquisition unit 400 as shown in FIG. The signal acquisition unit 500 shown will not be described in detail here.
网络检测单元620,适于在语音转换交互单元630将语音信号发送到语音转换服务器之前,获取移动设备的网络连接状态。The network detection unit 620 is adapted to acquire the network connection status of the mobile device before the voice conversion interaction unit 630 sends the voice signal to the voice conversion server.
语音转换交互单元630,进一步适于在网络检测单元620检测到移动设备接入正常网络时,将语音信号发送到语音转换服务器。The voice conversion interaction unit 630 is further adapted to send a voice signal to the voice conversion server when the network detection unit 620 detects that the mobile device is connected to a normal network.
展示单元650,进一步适于在网络检测单元620检测到移动设备接入存在问题的网络时,显示表示移动设备的网络连接状态异常的消息。The display unit 650 is further adapted to display a message indicating that the network connection status of the mobile device is abnormal when the network detection unit 620 detects that the mobile device has access to a problematic network.
图7示出了根据本发明又一个实施例的一种实现语音搜索的装置的示意图。如图7所示,该实现语音搜索的装置700包括:信号获取单元710、语音转换交互单元720、搜索交互单元730、展示单元740和指令执行单元750;其中,信号获取单元710、搜索交互单元730、展示单元740分别与图3所示装置的信号获取单元310、搜索交互单元330、展示单元340相同;并且在本发明的一个实施例中,信号获取单元710可以是如图4所示的信号获取单元400或如图5所示的信号获取单元500,在此不再赘述。Fig. 7 shows a schematic diagram of an apparatus for implementing voice search according to yet another embodiment of the present invention. As shown in Figure 7, the device 700 for implementing voice search includes: a signal acquisition unit 710, a voice conversion interaction unit 720, a search interaction unit 730, a presentation unit 740, and an instruction execution unit 750; wherein, the signal acquisition unit 710, the search interaction unit 730 and the display unit 740 are respectively the same as the signal acquisition unit 310, the search interaction unit 330, and the display unit 340 of the device shown in Figure 3; and in one embodiment of the present invention, the signal acquisition unit 710 may be as shown in Figure 4 The signal acquisition unit 400 or the signal acquisition unit 500 shown in FIG. 5 will not be described in detail here.
语音转换交互单元720,进一步适于接收语音转换服务器对字符信息进行语义识别,并根据识别后的语义返回的指令信息;The voice conversion interaction unit 720 is further adapted to receive instruction information that the voice conversion server performs semantic recognition on the character information and returns according to the recognized semantics;
指令执行单元750,适于根据指令信息执行相应的操作。The instruction executing unit 750 is adapted to execute corresponding operations according to the instruction information.
本单元可以根据语音转换服务器返回的指令信息,调起终端中的相关服务组件或第三方应用程序,执行指令信息相应的操作。例如,信号获取单元710获取用户输入的语音信号为:“我想看电影”,语音转换交互单元720将该语音信号发送到语音转换服务器后,接收到语音转换服务器根据该语音信号对应的字符信息的语义返回的指令信息为:“搜索热门电影”或“查找附近的电影院”,指令执行单元750可以调起搜索交互单元730以“热门电影”为搜索词向视频类的搜索服务器发起搜索;或者,指令执行单元750也可以调起终端中用于定位的组件,查找附近的“电影院”,向用户展示搜索结果。According to the instruction information returned by the voice conversion server, this unit can invoke the relevant service components or third-party application programs in the terminal, and execute the corresponding operation of the instruction information. For example, the signal acquisition unit 710 acquires the voice signal input by the user as: "I want to watch a movie". After the voice conversion interaction unit 720 sends the voice signal to the voice conversion server, it receives the character information corresponding to the voice signal from the voice conversion server. The instruction information returned by semantics is: "search for popular movies" or "find nearby movie theaters", and the instruction execution unit 750 can invoke the search interaction unit 730 to initiate a search to a video-type search server with "hot movies" as the search term; or , the instruction execution unit 750 may also invoke the positioning component in the terminal to search for nearby "movie theaters" and display the search results to the user.
本发明的一个实施例中还公开了一种实现语音搜索的浏览器客户端,其包括如上述任一实施例中所述的实现语音搜索的装置。An embodiment of the present invention also discloses a browser client for implementing voice search, which includes the device for implementing voice search as described in any of the above-mentioned embodiments.
综上所述,本发明提供的技术方案通过与语音转换服务器的交互以及通过与搜索服务器的交互,完成了对用户输入的语音信号对应的字符信息进行搜索,并向用户展示搜索结果的过程,形成了完整、有效的语音搜索方案,与传统的手动搜索相比,简化了操作流程,提高了搜索效率,并进一步通过对语音搜索用户的标签信息的匹配、对热点搜索词的推荐展示、对语音信号的语义识别、对语音搜索当前网络的检测等方案扩展了语音搜索的交互界限,优化了语音搜索的过程,使得本方案的语音搜索更加符合用户的搜索习惯,给用户提供了更为便捷、完美的体验。In summary, the technical solution provided by the present invention completes the process of searching the character information corresponding to the voice signal input by the user and displaying the search results to the user through the interaction with the voice conversion server and the search server. A complete and effective voice search solution has been formed. Compared with the traditional manual search, it simplifies the operation process and improves the search efficiency. Further, through the matching of the voice search user's label information, the recommended display of hot search words, and the The semantic recognition of voice signals and the detection of the current network of voice search expand the interactive boundary of voice search, optimize the process of voice search, make the voice search of this solution more in line with the user's search habits, and provide users with more convenient , perfect experience.
需要说明的是:It should be noted:
在此提供的算法和显示不与任何特定计算机、虚拟装置或者其它设备固有相关。各种通用装置也可以与基于在此的示教一起使用。根据上面的描述,构造这类装置所要求的结构是显而易见的。此外,本发明也不针对任何特定编程语言。应当明白,可以利用各种编程语言实现在此描述的本发明的内容,并且上面对特定语言所做的描述是为了披露本发明的最佳实施方式。The algorithms and displays presented herein are not inherently related to any particular computer, virtual appliance, or other device. Various general purpose devices can also be used with the teachings based on this. The structure required to construct such an apparatus will be apparent from the foregoing description. Furthermore, the present invention is not specific to any particular programming language. It should be understood that various programming languages can be used to implement the content of the present invention described herein, and the above description of specific languages is for disclosing the best mode of the present invention.
在此处所提供的说明书中,说明了大量具体细节。然而,能够理解,本发明的实施例可以在没有这些具体细节的情况下实践。在一些实例中,并未详细示出公知的方法、结构和技术,以便不模糊对本说明书的理解。In the description provided herein, numerous specific details are set forth. However, it is understood that embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure the understanding of this description.
类似地,应当理解,为了精简本公开并帮助理解各个发明方面中的一个或多个,在上面对本发明的示例性实施例的描述中,本发明的各个特征有时被一起分组到单个实施例、图、或者对其的描述中。然而,并不应将该公开的方法解释成反映如下意图:即所要求保护的本发明要求比在每个权利要求中所明确记载的特征更多的特征。更确切地说,如下面的权利要求书所反映的那样,发明方面在于少于前面公开的单个实施例的所有特征。因此,遵循具体实施方式的权利要求书由此明确地并入该具体实施方式,其中每个权利要求本身都作为本发明的单独实施例。Similarly, it should be appreciated that in the foregoing description of exemplary embodiments of the invention, in order to streamline this disclosure and to facilitate an understanding of one or more of the various inventive aspects, various features of the invention are sometimes grouped together in a single embodiment, figure, or its description. This method of disclosure, however, is not to be interpreted as reflecting an intention that the claimed invention requires more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment. Thus, the claims following the Detailed Description are hereby expressly incorporated into this Detailed Description, with each claim standing on its own as a separate embodiment of this invention.
本领域那些技术人员可以理解,可以对实施例中的设备中的模块进行自适应性地改变并且把它们设置在与该实施例不同的一个或多个设备中。可以把实施例中的模块或单元或组件组合成一个模块或单元或组件,以及此外可以把它们分成多个子模块或子单元或子组件。除了这样的特征和/或过程或者单元中的至少一些是相互排斥之外,可以采用任何组合对本说明书(包括伴随的权利要求、摘要和附图)中公开的所有特征以及如此公开的任何方法或者设备的所有过程或单元进行组合。除非另外明确陈述,本说明书(包括伴随的权利要求、摘要和附图)中公开的每个特征可以由提供相同、等同或相似目的的替代特征来代替。Those skilled in the art can understand that the modules in the device in the embodiment can be adaptively changed and arranged in one or more devices different from the embodiment. Modules or units or components in the embodiments may be combined into one module or unit or component, and furthermore may be divided into a plurality of sub-modules or sub-units or sub-assemblies. All features disclosed in this specification (including accompanying claims, abstract and drawings) and any method or method so disclosed may be used in any combination, except that at least some of such features and/or processes or units are mutually exclusive. All processes or units of equipment are combined. Each feature disclosed in this specification (including accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise.
此外,本领域的技术人员能够理解,尽管在此所述的一些实施例包括其它实施例中所包括的某些特征而不是其它特征,但是不同实施例的特征的组合意味着处于本发明的范围之内并且形成不同的实施例。例如,在下面的权利要求书中,所要求保护的实施例的任意之一都可以以任意的组合方式来使用。Furthermore, those skilled in the art will understand that although some embodiments described herein include some features included in other embodiments but not others, combinations of features from different embodiments are meant to be within the scope of the invention. and form different embodiments. For example, in the following claims, any of the claimed embodiments may be used in any combination.
本发明的各个部件实施例可以以硬件实现,或者以在一个或者多个处理器上运行的软件模块实现,或者以它们的组合实现。本领域的技术人员应当理解,可以在实践中使用微处理器或者数字信号处理器(DSP)来实现根据本发明实施例的一种实现语音搜索的装置中的一些或者全部部件的一些或者全部功能。本发明还可以实现为用于执行这里所描述的方法的一部分或者全部的设备或者装置程序(例如,计算机程序和计算机程序产品)。这样的实现本发明的程序可以存储在计算机可读介质上,或者可以具有一个或者多个信号的形式。这样的信号可以从因特网网站上下载得到,或者在载体信号上提供,或者以任何其他形式提供。The various component embodiments of the present invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. Those skilled in the art should understand that a microprocessor or a digital signal processor (DSP) can be used in practice to implement some or all functions of some or all components in a device for implementing voice search according to an embodiment of the present invention . The present invention can also be implemented as an apparatus or an apparatus program (for example, a computer program and a computer program product) for performing a part or all of the methods described herein. Such a program for realizing the present invention may be stored on a computer-readable medium, or may be in the form of one or more signals. Such a signal may be downloaded from an Internet site, or provided on a carrier signal, or provided in any other form.
应该注意的是上述实施例对本发明进行说明而不是对本发明进行限制,并且本领域技术人员在不脱离所附权利要求的范围的情况下可设计出替换实施例。在权利要求中,不应将位于括号之间的任何参考符号构造成对权利要求的限制。单词“包含”不排除存在未列在权利要求中的元件或步骤。位于元件之前的单词“一”或“一个”不排除存在多个这样的元件。本发明可以借助于包括有若干不同元件的硬件以及借助于适当编程的计算机来实现。在列举了若干装置的单元权利要求中,这些装置中的若干个可以是通过同一个硬件项来具体体现。单词第一、第二、以及第三等的使用不表示任何顺序。可将这些单词解释为名称。It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The invention can be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In a unit claim enumerating several means, several of these means can be embodied by one and the same item of hardware. The use of the words first, second, and third, etc. does not indicate any order. These words can be interpreted as names.
本发明公开了A1、一种实现语音搜索的方法,该方法包括:获取语音搜索用户输入的语音信号;将所述语音信号发送到语音转换服务器;接收语音转换服务器对所述语音信号进行识别后返回的相应的字符信息;以所述字符信息作为搜索词,向搜索服务器发送搜索请求;接收和加载显示所述搜索服务器返回的搜索结果。The invention discloses A1. A method for implementing voice search, the method comprising: acquiring a voice signal input by a voice search user; sending the voice signal to a voice conversion server; receiving the voice conversion server after recognizing the voice signal Returning the corresponding character information; using the character information as a search term, sending a search request to the search server; receiving, loading and displaying the search result returned by the search server.
A2、如A1所述的方法,其中,该方法进一步包括:获取所述语音搜索用户的标签信息;根据该标签信息从语音转换服务器获取所述语音搜索用户的好友分享的与该标签信息匹配的经语音识别后的一个或多个关键字;当为一个关键字时以该关键字作为搜索词,当为多个关键字时从中选择一个关键字为搜索词,向搜索服务器发送搜索请求;接收和加载显示所述搜索服务器返回的搜索结果。A2. The method as described in A1, wherein, the method further includes: obtaining the tag information of the voice search user; obtaining from the voice conversion server according to the tag information the voice search user’s friends who match the tag information One or more keywords after speech recognition; when it is a keyword, use this keyword as a search term, when it is multiple keywords, select a keyword as a search term, and send a search request to the search server; receive and loading displays the search results returned by the search server.
A3、如A2所述的方法,其中,该方法进一步包括:根据所述语音搜索用户触发的指令,获取所述语音搜索用户输入的语音信号;将该语音信号与所述语音搜索用户的标签信息一起上传到语音转换服务器,供所述语音转换服务器将该语音信号识别成文字形式的关键字后,将该语音信号、识别出的关键字以及所述语音搜索用户的标签信息对应保存成一个分享信息条目。A3. The method according to A2, wherein the method further includes: acquiring a voice signal input by the voice search user according to an instruction triggered by the voice search user; combining the voice signal with the tag information of the voice search user Upload to the voice conversion server together, after the voice conversion server recognizes the voice signal into a keyword in text form, the voice signal, the recognized keyword and the label information of the voice search user are correspondingly stored as a share information entry.
A4、如A3所述的方法,其中,该方法进一步包括:从所述语音转换服务器下载与所述语音搜索用户的标签信息匹配的分享信息条目,并保存在本地。A4. The method as described in A3, wherein the method further includes: downloading from the voice conversion server the shared information item matching the tag information of the voice search user, and saving it locally.
A5、如A1所述的方法,该方法进一步包括:当识别所述字符信息为符合特定格式的号码数据时,展示该号码数据的相应操作的加载入口。A5. The method as described in A1, which further includes: when the character information is identified as number data conforming to a specific format, displaying the loading entry of the corresponding operation of the number data.
A6、如A5所述的方法,当识别所述字符信息为符合特定格式的号码数据时,还加载搜索服务器返回的该号码数据对应的电子商务网页信息,并提供与该电子商务网页进行交互操作的入口。A6, the method as described in A5, when identifying the character information as number data conforming to a specific format, also load the e-commerce web page information corresponding to the number data returned by the search server, and provide interactive operation with the e-commerce web page entrance.
A7、如A1所述的方法,该方法进一步包括:获取移动终端的通话录音数据,对所述通话录音数据进行编辑得到用于进行语音搜索的语音信号。A7. The method as described in A1, further comprising: acquiring call recording data of the mobile terminal, and editing the call recording data to obtain a voice signal for voice search.
A8、如A7所述的方法,该方法进一步包括:添加语音识别模块,对所述语音搜索用户的自身的语音进行识别后记录,并标记为所述语音搜索用户的语音数据,以及对于好友分享或发送来的语音数据进行手动识别或进行机器学习识别后记录,并标记为相应好友的语音数据;对所述语音搜索用户的语音数据或其好友的语音数据进行编辑,得到用于进行语音搜索的语音信号。A8, the method as described in A7, the method further includes: adding a voice recognition module, recording the voice of the voice search user after recognition, and marking it as the voice data of the voice search user, and sharing it with friends Or the voice data sent is manually recognized or recorded after machine learning recognition, and marked as the voice data of the corresponding friend; the voice data of the voice search user or the voice data of the friend is edited to obtain the voice data used for voice search. voice signal.
A9、如A1所述的方法,该方法进一步包括:对所述字符信息进行语义识别,确定所述字符信息作为搜索词的分类;A9, the method as described in A1, the method further includes: performing semantic recognition on the character information, and determining the classification of the character information as a search word;
所述向搜索服务器发送搜索请求包括:向相应分类的搜索服务器发送搜索请求。The sending the search request to the search server includes: sending the search request to the search server of the corresponding category.
A10、如A1所述的方法,所述获取输入的语音信号包括:在浏览器客户端的首页中显示语音搜索图标;当该语音搜索图标被点击时,跳转到包含语音搜索控件的语音搜索页面;响应于所述语音搜索控件被启动的事件,利用浏览器客户端所在移动设备的麦克风开始捕获语音信号;响应于所述语音搜索控件被关闭的事件,停止捕获语音信号,获得输入的语音信号。A10, the method as described in A1, said acquiring the input voice signal comprises: displaying a voice search icon on the home page of the browser client; when the voice search icon is clicked, jumping to a voice search page containing a voice search control ; In response to the event that the voice search control is started, use the microphone of the mobile device where the browser client is located to start capturing the voice signal; in response to the event that the voice search control is closed, stop capturing the voice signal and obtain the input voice signal .
A11、如A10所述的方法,在浏览器客户端的首页中显示语音搜索图标包括:在浏览器客户端的地址栏的尾部显示麦克风图标;A11, the method described in A10, displaying the voice search icon in the home page of the browser client includes: displaying the microphone icon at the end of the address bar of the browser client;
所述语音搜索控件被启动的事件为:所述语音搜索控件开始被触摸;The event that the voice search control is started is: the voice search control starts to be touched;
所述语音搜索控件被关闭的事件为:所述语音搜索控件结束被触摸。The event that the voice search control is turned off is: the voice search control ends being touched.
A12、如A10所述的方法,该方法进一步包括:在所述语音搜索页面上展示N个热点搜索词;其中,所述N个热点搜索词是在一个统计周期内的统计得到的搜索排名靠前的N个搜索词,N为自然数;或者,在所述语音搜索页面上展示搜索提示信息,用于提示用户如何进行语音搜索。A12, the method as described in A10, the method further includes: displaying N hot search words on the voice search page; wherein, the N hot search words are search rankings obtained by statistics within a statistical period The first N search terms, where N is a natural number; or, displaying search prompt information on the voice search page to prompt the user how to conduct a voice search.
A13、如A10所述的方法,其中,该方法进一步包括:A13, the method as described in A10, wherein, the method further comprises:
在所述语音搜索页面中显示页面关闭控件,当该页面关闭控件被触发时,关闭所述语音搜索页面,返回到浏览器客户端的首页。A page close control is displayed on the voice search page, and when the page close control is triggered, the voice search page is closed and the home page of the browser client is returned.
A14、如A12所述的方法,该方法进一步包括:响应于所述语音搜索页面上的热点搜索词被点击的事件,跳转到该热点搜索词对应的搜索结果页面。A14. The method according to A12, further comprising: jumping to a search result page corresponding to the hot search word in response to the event that the hot search word on the voice search page is clicked.
A15、如A1-A14中任一项所述的方法,其中,所述接收语音转换服务器对所述语音信号进行识别后返回的相应的字符信息包括:接收语音转换服务器对所述语音信号进行识别后返回的相应的多个字符信息;A15. The method according to any one of A1-A14, wherein the corresponding character information returned by the receiving speech conversion server after recognizing the speech signal includes: receiving the speech conversion server recognizing the speech signal The corresponding multiple character information returned after;
则该方法进一步包括:展示所述多个字符信息,以供用户进行选择;Then the method further includes: displaying the plurality of character information for selection by the user;
所述以所述字符信息作为搜索词为,以用户选择的字符信息作为搜索词。The using the character information as the search term is to use the character information selected by the user as the search term.
A16、如A1-A14中任一项所述的方法,其中,在将所述语音信号发送到语音转换服务器之前,该方法进一步包括:获取移动设备的网络连接状态;A16. The method according to any one of A1-A14, wherein, before sending the voice signal to the voice conversion server, the method further includes: acquiring the network connection status of the mobile device;
如果移动设备没有接入网络,或者移动设备接入的是存在问题的网络,则显示表示移动设备的网络连接状态异常的信息;If the mobile device is not connected to the network, or the mobile device is connected to a problematic network, display information indicating that the network connection status of the mobile device is abnormal;
如果移动设备接入的是正常网络,则执行所述将所述语音信号发送到语音转换服务器的步骤。If the mobile device is connected to a normal network, the step of sending the voice signal to the voice conversion server is performed.
A17、如A1-A14中任一项所述的方法,其中,该方法进一步包括:A17. The method according to any one of A1-A14, wherein the method further comprises:
当接收语音转换服务器对所述语音信号进行识别后返回的表示识别错误的信息时,展示所述表示识别错误的信息。When receiving the information indicating the recognition error returned by the speech conversion server after recognizing the voice signal, the information indicating the recognition error is displayed.
A18、如A1-A14中任一项所述的方法,其中,该方法进一步包括:A18. The method according to any one of A1-A14, wherein the method further comprises:
接收所述语音转换服务器对所述字符信息进行语义识别,并根据识别后的语义返回的指令信息;receiving the voice conversion server performing semantic recognition on the character information and returning instruction information according to the recognized semantics;
根据所述指令信息执行相应的操作。Perform corresponding operations according to the instruction information.
本发明还公开了B19、一种实现语音搜索的装置,其中,该装置包括:The present invention also discloses B19, a device for realizing voice search, wherein the device includes:
信号获取单元,适于获取语音搜索用户输入的语音信号;A signal acquisition unit adapted to acquire a voice signal input by a voice search user;
语音转换交互单元,适于将所述语音信号发送到语音转换服务器;并且适于接收语音转换服务器对所述语音信号进行识别后返回的相应的字符信息;The voice conversion interaction unit is adapted to send the voice signal to the voice conversion server; and is adapted to receive the corresponding character information returned by the voice conversion server after recognizing the voice signal;
搜索交互单元,适于以所述字符信息作为搜索词,向搜索服务器发送搜索请求;并且适于接收所述搜索服务器返回的搜索结果;The search interaction unit is adapted to use the character information as a search word to send a search request to a search server; and is adapted to receive search results returned by the search server;
展示单元,适于加载显示所述搜索服务器返回的搜索结果。The display unit is adapted to load and display the search results returned by the search server.
B20、如B19所述的装置,其中,所述信号获取单元,进一步适于获取所述语音搜索用户的标签信息;B20. The device according to B19, wherein the signal acquisition unit is further adapted to acquire tag information of the voice search user;
所述语音转换交互单元,进一步适于根据该标签信息从语音转换服务器获取所述语音搜索用户的好友分享的与该标签信息匹配的经语音识别后的一个或多个关键字;The voice conversion interaction unit is further adapted to obtain from the voice conversion server according to the tag information one or more speech-recognized keywords shared by friends of the voice search user and matched with the tag information;
所述搜索交互单元,适于当所述语音转换交互单元获取到一个关键字时以该关键字作为搜索词,当所述语音转换交互单元获取到多个关键字时从中选择一个关键字为搜索词,向搜索服务器发送搜索请求;并且适于接收所述搜索服务器返回的搜索结果;The search interaction unit is adapted to use a keyword as a search word when the speech conversion interaction unit obtains a keyword, and selects a keyword as a search word when the speech conversion interaction unit obtains multiple keywords. word, send a search request to the search server; and be adapted to receive the search result returned by the search server;
所述展示单元,适于加载显示所述搜索服务器返回的搜索结果。The display unit is adapted to load and display the search results returned by the search server.
B21、如B20所述的装置,其中,B21. The device of B20, wherein,
所述信号获取单元,进一步适于根据所述语音搜索用户触发的指令,获取所述语音搜索用户输入的语音信号;The signal acquisition unit is further adapted to acquire a voice signal input by the voice search user according to an instruction triggered by the voice search user;
所述语音转换交互单元,进一步适于将该语音信号与所述语音搜索用户的标签信息一起上传到语音转换服务器,供所述语音转换服务器将该语音信号识别成文字形式的关键字后,将该语音信号、识别出的关键字以及所述语音搜索用户的标签信息对应保存成一个分享信息条目。The voice conversion interaction unit is further adapted to upload the voice signal and the tag information of the voice search user to the voice conversion server, so that after the voice conversion server recognizes the voice signal into a keyword in text form, the The voice signal, the recognized keyword, and the tag information of the voice search user are correspondingly stored as a shared information item.
B22、如B21所述的装置,其中,B22. The device of B21, wherein,
所述语音转换交互单元,进一步适于从所述语音转换服务器下载与所述语音搜索用户的标签信息匹配的分享信息条目,并保存在本地。The voice conversion interaction unit is further adapted to download the shared information items matching the voice search user's tag information from the voice conversion server, and store them locally.
B23、如B19所述的装置,其中,B23. The device of B19, wherein,
所述展示单元,进一步适于当所述搜索交互单元识别所述字符信息为符合特定格式的号码数据时,展示该号码数据的相应操作的加载入口。The display unit is further adapted to display the loading entry of the corresponding operation of the number data when the search interaction unit recognizes that the character information is number data conforming to a specific format.
B24、如B23所述的装置,其中,B24. The device of B23, wherein,
所述展示单元,还适于当所述搜索交互单元识别所述字符信息为符合特定格式的号码数据时,加载搜索服务器返回的该号码数据对应的电子商务网页信息,并提供与该电子商务网页进行交互操作的入口。The display unit is also adapted to load the e-commerce web page information corresponding to the number data returned by the search server when the search interaction unit recognizes that the character information is number data conforming to a specific format, and provide information related to the e-commerce web page The entry point for interactive operations.
B25、如B19所述的装置,其中,B25. The device of B19, wherein,
所述信号获取单元,进一步适于获取移动终端的通话录音数据,对所述通话录音数据进行编辑得到用于进行语音搜索的语音信号。The signal acquisition unit is further adapted to acquire call recording data of the mobile terminal, and edit the call recording data to obtain a voice signal for voice search.
B26、如B25所述的装置,其中,B26. The device of B25, wherein,
所述信号获取单元,进一步适于添加语音识别模块,对所述语音搜索用户的自身的语音进行识别后记录,并标记为所述语音搜索用户的语音数据,以及对于好友分享或发送来的语音数据进行手动识别或进行机器学习识别后记录,并标记为相应好友的语音数据;并且适于对所述语音搜索用户的语音数据或其好友的语音数据进行编辑,得到用于进行语音搜索的语音信号。The signal acquisition unit is further adapted to add a voice recognition module to recognize and record the voice search user's own voice, and mark it as the voice data of the voice search user, as well as the voice shared or sent by friends The data is recorded after manual recognition or machine learning recognition, and marked as the voice data of the corresponding friend; and it is suitable for editing the voice data of the voice search user or the voice data of his friend to obtain the voice used for voice search Signal.
B27、如B19所述的装置,其中,B27. The device of B19, wherein,
所述搜索交互单元,进一步适于对所述字符信息进行语义识别,确定所述字符信息作为搜索词的分类;并且适于向相应分类的搜索服务器发送搜索请求。The search interaction unit is further adapted to perform semantic recognition on the character information, to determine the classification of the character information as a search term; and to send a search request to a search server of a corresponding classification.
B28、如B19所述的装置,其中,所述信号获取单元包括:语音搜索跳转单元和语音获取控制单元;B28. The device as described in B19, wherein the signal acquisition unit includes: a voice search jump unit and a voice acquisition control unit;
所述语音搜索跳转单元,适于在浏览器客户端的首页中显示语音搜索图标;当该语音搜索图标被点击时,跳转到包含语音搜索控件的语音搜索页面;The voice search jump unit is suitable for displaying a voice search icon on the homepage of the browser client; when the voice search icon is clicked, it jumps to a voice search page containing a voice search control;
所述语音获取控制单元,适于响应于所述语音搜索控件被启动的事件,利用浏览器客户端所在移动设备的麦克风开始捕获语音信号;并且适于响应于所述语音搜索控件被关闭的事件,停止捕获语音信号,获得输入的语音信号。The voice acquisition control unit is adapted to respond to the event that the voice search control is started, using the microphone of the mobile device where the browser client is located to start capturing voice signals; and is adapted to respond to the event that the voice search control is turned off , stop capturing the voice signal and get the input voice signal.
B29、如B28所述的装置,其中,B29. The device of B28, wherein,
所述语音搜索跳转单元,适于在浏览器客户端的地址栏的尾部显示麦克风图标;The voice search jump unit is adapted to display a microphone icon at the end of the address bar of the browser client;
所述语音获取控制单元,适于响应于所述语音搜索控件开始被触摸,利用浏览器客户端所在移动设备的麦克风开始捕获语音信号;并且适于响应于所述语音搜索控件结束被触摸,停止捕获语音信号,获得输入的语音信号。The voice acquisition control unit is adapted to start capturing voice signals with the microphone of the mobile device where the browser client is located in response to the voice search control being touched; and is adapted to stop being touched in response to the voice search control ending. Capture the speech signal to obtain the input speech signal.
B30、如B28所述的装置,其中,所述语音信号获取单元进一步包括:语音搜索提示单元;B30. The device according to B28, wherein the voice signal acquisition unit further includes: a voice search prompt unit;
所述语音搜索提示单元,适于在所述语音搜索页面上展示N个热点搜索词;其中,所述N个热点搜索词是在一个统计周期内的统计得到的搜索排名靠前的N个搜索词,N为自然数;或者,适于在所述语音搜索页面上展示搜索提示信息,用于提示用户如何进行语音搜索。The voice search prompt unit is adapted to display N hot search words on the voice search page; wherein, the N hot search words are the top N search terms obtained from statistics within a statistical period words, N is a natural number; or, it is suitable for displaying search prompt information on the voice search page to prompt the user how to perform voice search.
B31、如B28所述的装置,其中,B31. The device of B28, wherein,
所述语音搜索跳转单元,进一步适于在所述语音搜索页面中显示页面关闭控件,当该页面关闭控件被触发时,关闭所述语音搜索页面,返回到浏览器客户端的首页。The voice search jump unit is further adapted to display a page closing control on the voice search page, and when the page closing control is triggered, close the voice search page and return to the home page of the browser client.
B32、如B30所述的装置,其中,B32. The device of B30, wherein,
所述搜索交互单元,进一步适于响应于所述语音搜索页面上的热点搜索词被点击的事件,跳转到该热点搜索词对应的搜索结果页面;The search interaction unit is further adapted to jump to the search result page corresponding to the hot search word in response to the event that the hot search word on the voice search page is clicked;
所述展示单元,进一步适于加载显示该热点搜索词对应的搜索结果页面。The display unit is further adapted to load and display a search result page corresponding to the hot search word.
B33、如B19-B32中任一项所述的装置,其中,B33. The device of any one of B19-B32, wherein,
所述语音转换交互单元,适于接收语音转换服务器对所述语音信号进行识别后返回的相应的多个字符信息;The voice conversion interaction unit is adapted to receive corresponding multiple character information returned after the voice conversion server recognizes the voice signal;
所述展示单元,进一步适于展示所述多个字符信息,以供用户进行选择;The display unit is further adapted to display the plurality of character information for selection by the user;
所述搜索交互单元,适于以用户选择的字符信息作为搜索词,向搜索服务器发送搜索请求。The search interaction unit is adapted to send a search request to a search server using character information selected by a user as a search term.
B34、如B19-B32中任一项所述的装置,其中,该装置进一步包括:网络检测单元;B34. The device according to any one of B19-B32, wherein the device further includes: a network detection unit;
所述网络检测单元,适于在所述语音转换交互单元在将所述语音信号发送到语音转换服务器之前,获取移动设备的网络连接状态;The network detection unit is adapted to obtain the network connection status of the mobile device before the voice conversion interaction unit sends the voice signal to the voice conversion server;
所述语音转换交互单元,进一步适于在所述网络检测单元检测到移动设备接入正常网络时,将所述语音信号发送到语音转换服务器;The voice conversion interaction unit is further adapted to send the voice signal to a voice conversion server when the network detection unit detects that the mobile device is connected to a normal network;
所述展示单元,进一步适于在所述网络检测单元检测到移动设备接入存在问题的网络时,显示表示移动设备的网络连接状态异常的消息。The display unit is further adapted to display a message indicating that the network connection status of the mobile device is abnormal when the network detection unit detects that the mobile device has access to a problematic network.
B35、如B19-B32中任一项所述的装置,其中,B35. The device of any one of B19-B32, wherein,
所述展示单元,进一步适于当所述语音转换交互单元接收到语音转换服务器对所述语音信号进行识别后返回的表示识别错误的信息时,展示所述表示识别错误的信息。The presentation unit is further adapted to display the information indicating an error in recognition when the voice conversion interaction unit receives information indicating an error in recognition returned by the voice conversion server after recognizing the voice signal.
B36、如B19-B32中任一项所述的装置,其中,该装置进一步包括:指令执行单元;B36. The device according to any one of B19-B32, wherein the device further comprises: an instruction execution unit;
所述语音转换交互单元,进一步适于接收所述语音转换服务器对所述字符信息进行语义识别,并根据识别后的语义返回的指令信息;The voice conversion interaction unit is further adapted to receive the voice conversion server performing semantic recognition on the character information and returning instruction information according to the recognized semantics;
所述指令执行单元,适于根据所述指令信息执行相应的操作。The instruction executing unit is adapted to execute corresponding operations according to the instruction information.
本发明还公开了C37、一种实现语音搜索的浏览器客户端,包括如上述任一项所述的实现语音搜索的装置。The present invention also discloses C37, a browser client for realizing voice search, including the device for realizing voice search according to any one of the above items.
Claims (35)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201410676462.5A CN104462262B (en) | 2014-11-21 | 2014-11-21 | A kind of method for realizing phonetic search, device and browser client |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201410676462.5A CN104462262B (en) | 2014-11-21 | 2014-11-21 | A kind of method for realizing phonetic search, device and browser client |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN104462262A CN104462262A (en) | 2015-03-25 |
| CN104462262B true CN104462262B (en) | 2017-10-31 |
Family
ID=52908298
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201410676462.5A Active CN104462262B (en) | 2014-11-21 | 2014-11-21 | A kind of method for realizing phonetic search, device and browser client |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN104462262B (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110837334A (en) * | 2019-11-04 | 2020-02-25 | 北京字节跳动网络技术有限公司 | Method, device, terminal and storage medium for interactive control |
Families Citing this family (36)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10083697B2 (en) * | 2015-05-27 | 2018-09-25 | Google Llc | Local persisting of data for selectively offline capable voice action in a voice-enabled electronic device |
| US9966073B2 (en) | 2015-05-27 | 2018-05-08 | Google Llc | Context-sensitive dynamic update of voice to text model in a voice-enabled electronic device |
| CN104951536B (en) * | 2015-06-18 | 2021-01-22 | 百度在线网络技术(北京)有限公司 | Searching method and device |
| CN106407198A (en) * | 2015-07-28 | 2017-02-15 | 百度在线网络技术(北京)有限公司 | Question and answer information processing method and device |
| CN106445940A (en) * | 2015-08-05 | 2017-02-22 | 阿里巴巴集团控股有限公司 | Navigation method and apparatus |
| CN105138618A (en) * | 2015-08-11 | 2015-12-09 | 百度在线网络技术(北京)有限公司 | Method and apparatus for performing secondary search in combination with speech in image search |
| CN105138619B (en) * | 2015-08-12 | 2018-10-19 | 百度在线网络技术(北京)有限公司 | Searching method and system for mobile terminal webpage |
| CN105117195B (en) * | 2015-09-09 | 2018-05-08 | 百度在线网络技术(北京)有限公司 | The bootstrap technique and device of phonetic entry |
| CN105303794A (en) * | 2015-11-30 | 2016-02-03 | 讯美电子科技有限公司 | Comprehensive disposal management system for alarming by means of voiceprint recognition |
| CN105718239B (en) * | 2016-01-27 | 2019-03-08 | 百度在线网络技术(北京)有限公司 | A kind of method and apparatus of voice input |
| CN105843839A (en) * | 2016-02-01 | 2016-08-10 | 乐视移动智能信息技术(北京)有限公司 | Voice type search method and apparatus |
| CN106024013B (en) * | 2016-04-29 | 2022-01-14 | 努比亚技术有限公司 | Voice data searching method and system |
| CN107515884A (en) * | 2016-06-17 | 2017-12-26 | 阿里巴巴集团控股有限公司 | A search method, device and system |
| CN106601254B (en) | 2016-12-08 | 2020-11-06 | 阿里巴巴(中国)有限公司 | Information input method and device and computing equipment |
| CN108228627A (en) * | 2016-12-15 | 2018-06-29 | 百度在线网络技术(北京)有限公司 | For generating the method and apparatus of webpage and terminal device |
| CN107578776B (en) * | 2017-09-25 | 2021-08-06 | 咪咕文化科技有限公司 | A wake-up method, device and computer-readable storage medium for voice interaction |
| CN107622766B (en) * | 2017-09-28 | 2021-01-05 | 百度在线网络技术(北京)有限公司 | Method and apparatus for searching information |
| CN107845384A (en) * | 2017-10-30 | 2018-03-27 | 江西博瑞彤芸科技有限公司 | A kind of audio recognition method |
| CN107835313A (en) * | 2017-11-02 | 2018-03-23 | 奇酷互联网络科技(深圳)有限公司 | Carry out method, terminal, readable storage medium storing program for executing and the communicating terminal of dial-up |
| CN107992587A (en) * | 2017-12-08 | 2018-05-04 | 北京百度网讯科技有限公司 | A kind of voice interactive method of browser, device, terminal and storage medium |
| CN110019679A (en) * | 2017-12-13 | 2019-07-16 | 联易软件有限公司 | Food and drug administration method and apparatus |
| CN108231080A (en) * | 2018-01-05 | 2018-06-29 | 广州蓝豹智能科技有限公司 | Voice method for pushing, device, smart machine and storage medium |
| CN108491178B (en) * | 2018-02-13 | 2021-05-28 | 北京百度网讯科技有限公司 | Information browsing method, browser and server |
| CN108492476A (en) * | 2018-03-06 | 2018-09-04 | 中国邮政集团公司广州市分公司 | A kind of voice recognition pickup intelligent cabinet |
| CN108717853B (en) * | 2018-05-09 | 2020-11-20 | 深圳艾比仿生机器人科技有限公司 | Man-machine voice interaction method, device and storage medium |
| CN108763068B (en) * | 2018-05-15 | 2021-12-28 | 福建天泉教育科技有限公司 | Automatic testing method and terminal based on machine learning |
| CN108959520A (en) * | 2018-06-28 | 2018-12-07 | 百度在线网络技术(北京)有限公司 | Searching method, device, equipment and storage medium based on artificial intelligence |
| CN110750705A (en) * | 2018-07-06 | 2020-02-04 | 北京高德云图科技有限公司 | Search result display method and device and electronic equipment |
| CN109033392A (en) * | 2018-07-31 | 2018-12-18 | 广州新趋士网络科技有限公司 | A kind of commercial products retrieval system |
| CN109815366B (en) * | 2019-01-25 | 2023-07-14 | 浪潮软件科技有限公司 | Method and device for realizing video aggregation search voice docking |
| CN110246494A (en) * | 2019-05-20 | 2019-09-17 | 深圳壹账通智能科技有限公司 | Service request method, device and computer equipment based on speech recognition |
| CN110211587B (en) * | 2019-06-03 | 2024-04-30 | 腾讯大地通途(北京)科技有限公司 | Ranking information acquisition method, device, equipment and medium |
| CN117056622A (en) * | 2019-10-22 | 2023-11-14 | 海信视像科技股份有限公司 | Voice control method and display device |
| CN111243444A (en) * | 2020-03-19 | 2020-06-05 | 闻泰通讯股份有限公司 | Flexible display panel, electronic device, control method, and storage medium |
| TWI752474B (en) * | 2020-04-22 | 2022-01-11 | 莊連豪 | An accessible and intelligent voice recognition system and the control method |
| CN113539252A (en) * | 2020-04-22 | 2021-10-22 | 庄连豪 | Barrier-free intelligent voice system and control method thereof |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101437039A (en) * | 2007-11-15 | 2009-05-20 | 华为技术有限公司 | Mobile searching method, system and equipment |
| CN101505328A (en) * | 2008-02-04 | 2009-08-12 | 台达电子工业股份有限公司 | Network data retrieval method and system applying voice recognition |
| CN102999639A (en) * | 2013-01-04 | 2013-03-27 | 深圳市中兴移动通信有限公司 | Speech recognition character index based method and system for searching |
| CN103354089A (en) * | 2013-06-25 | 2013-10-16 | 天津三星通信技术研究有限公司 | Voice communication management method and apparatus thereof |
| CN104105075A (en) * | 2013-04-01 | 2014-10-15 | 百度在线网络技术(北京)有限公司 | Terminal device warning tone setting method through search engine and terminal device |
Family Cites Families (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP2339576B1 (en) * | 2009-12-23 | 2019-08-07 | Google LLC | Multi-modal input on an electronic device |
-
2014
- 2014-11-21 CN CN201410676462.5A patent/CN104462262B/en active Active
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101437039A (en) * | 2007-11-15 | 2009-05-20 | 华为技术有限公司 | Mobile searching method, system and equipment |
| CN101505328A (en) * | 2008-02-04 | 2009-08-12 | 台达电子工业股份有限公司 | Network data retrieval method and system applying voice recognition |
| CN102999639A (en) * | 2013-01-04 | 2013-03-27 | 深圳市中兴移动通信有限公司 | Speech recognition character index based method and system for searching |
| CN104105075A (en) * | 2013-04-01 | 2014-10-15 | 百度在线网络技术(北京)有限公司 | Terminal device warning tone setting method through search engine and terminal device |
| CN103354089A (en) * | 2013-06-25 | 2013-10-16 | 天津三星通信技术研究有限公司 | Voice communication management method and apparatus thereof |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110837334A (en) * | 2019-11-04 | 2020-02-25 | 北京字节跳动网络技术有限公司 | Method, device, terminal and storage medium for interactive control |
Also Published As
| Publication number | Publication date |
|---|---|
| CN104462262A (en) | 2015-03-25 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN104462262B (en) | A kind of method for realizing phonetic search, device and browser client | |
| JP6708717B2 (en) | News recommendation method and device | |
| US9524714B2 (en) | Speech recognition apparatus and method thereof | |
| KR101908099B1 (en) | Automated click type selection for content performance optimization | |
| US20170293419A1 (en) | Method and system for context based tab management | |
| US10440435B1 (en) | Performing searches while viewing video content | |
| US10878044B2 (en) | System and method for providing content recommendation service | |
| US20170359446A1 (en) | Computer Application Promotion | |
| CN106096037A (en) | Search Results polymerization based on artificial intelligence, device and search engine | |
| CN109791680A (en) | Key frame of video on online social networks is shown | |
| US12135752B2 (en) | Linking to a search result | |
| US20170060966A1 (en) | Action Recommendation System For Focused Objects | |
| KR20180087456A (en) | Identifying matching applications based on browsing activity | |
| TW201812618A (en) | Search information processing method and apparatus | |
| JP2015118708A (en) | Method and apparatus for providing search results | |
| CN106681598B (en) | Information input method and device | |
| US10061806B2 (en) | Presenting previously selected search results | |
| CN107515870B (en) | Searching method and device and searching device | |
| CN106709073A (en) | Browser notification pushing method and browser terminal | |
| CN105224657B (en) | A kind of information recommendation method and electronic equipment based on search engine | |
| US20160188708A1 (en) | Storing Predicted Search Results on a User Device Based on Software Application Use | |
| CN107515869B (en) | Searching method and device and searching device | |
| CN103530385A (en) | Method and device for searching for information based on vertical searching channels | |
| CN105224652A (en) | A kind of information recommendation method based on video and electronic equipment | |
| CN115687807A (en) | Information display method, device, terminal and storage medium |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant | ||
| TR01 | Transfer of patent right |
Effective date of registration: 20220728 Address after: Room 801, 8th floor, No. 104, floors 1-19, building 2, yard 6, Jiuxianqiao Road, Chaoyang District, Beijing 100015 Patentee after: BEIJING QIHOO TECHNOLOGY Co.,Ltd. Address before: 100088 room 112, block D, 28 new street, new street, Xicheng District, Beijing (Desheng Park) Patentee before: BEIJING QIHOO TECHNOLOGY Co.,Ltd. Patentee before: Qizhi software (Beijing) Co.,Ltd. |
|
| TR01 | Transfer of patent right |