[go: up one dir, main page]

CN113641439B - Text recognition and display method, device, electronic equipment and medium - Google Patents

Text recognition and display method, device, electronic equipment and medium Download PDF

Info

Publication number
CN113641439B
CN113641439B CN202110938868.6A CN202110938868A CN113641439B CN 113641439 B CN113641439 B CN 113641439B CN 202110938868 A CN202110938868 A CN 202110938868A CN 113641439 B CN113641439 B CN 113641439B
Authority
CN
China
Prior art keywords
text recognition
text
scanning
candidate
client
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110938868.6A
Other languages
Chinese (zh)
Other versions
CN113641439A (en
Inventor
梁霄
张铭阳
蒋峰
唐红羚
柳舒芳
张国旺
杨彦哲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN202110938868.6A priority Critical patent/CN113641439B/en
Publication of CN113641439A publication Critical patent/CN113641439A/en
Application granted granted Critical
Publication of CN113641439B publication Critical patent/CN113641439B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/451Execution arrangements for user interfaces

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The disclosure provides a text recognition and display method, a device, electronic equipment and a medium, relates to the technical field of computers, and particularly relates to the technical fields of text recognition, cloud computing and cloud service. The specific implementation scheme is as follows: determining a binding relation between the candidate client and the scanning equipment according to the acquired text recognition instruction; and determining a target server from the candidate servers according to the binding relation, and sending the acquired text scanning result to the target server, so that the target server determines a text recognition result according to the text scanning result. The method and the device have the advantage that the diversity and the richness of the text recognition result are improved.

Description

Text recognition and display method, device, electronic equipment and medium
Technical Field
The disclosure relates to the technical field of computers, in particular to the technical fields of text recognition, cloud computing and cloud service, and particularly relates to a text recognition and display method, a device, electronic equipment and a medium.
Background
Along with the development of science and technology, more and more intelligent learning devices are integrated into the learning process of students, so that the learning efficiency of the students is greatly improved. The dictionary pen is an emerging intelligent learning device, students can obtain the pronunciation, paraphrasing, translation and other information of the characters only by scanning the characters with the dictionary pen, and the dictionary book does not need to be turned over for many times.
Most of the current dictionary pens generate information query results based on a locally mounted processor.
Disclosure of Invention
The disclosure provides a method, a device, electronic equipment and a medium for identifying text to be identified.
According to an aspect of the present disclosure, there is provided a text recognition method including:
determining a binding relation between the candidate client and the scanning equipment according to the acquired text recognition instruction;
and determining a target server from the candidate servers according to the binding relation, and sending the acquired text scanning result to the target server, so that the target server determines a text recognition result according to the text scanning result.
According to another aspect of the present disclosure, there is provided a text display method including:
acquiring a text recognition result from a target server; the target server determines from the candidate server according to the binding relation between the candidate client and the scanning equipment;
and displaying the text recognition result.
According to another aspect of the present disclosure, there is provided an electronic device including:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,
The memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of the present disclosure.
According to another aspect of the present disclosure, there is provided a non-transitory computer-readable storage medium storing computer instructions for causing the computer to perform the method of any one of the present disclosure.
It should be understood that the description in this section is not intended to identify key or critical features of the embodiments of the disclosure, nor is it intended to be used to limit the scope of the disclosure. Other features of the present disclosure will become apparent from the following specification.
Drawings
The drawings are for a better understanding of the present solution and are not to be construed as limiting the present disclosure. Wherein:
FIG. 1 is a flow chart of a text recognition method disclosed in accordance with an embodiment of the present disclosure;
FIG. 2A is a flow chart of a text recognition method disclosed in accordance with an embodiment of the present disclosure;
FIG. 2B is a schematic diagram of an open user center interface according to an embodiment of the present disclosure;
FIG. 2C is a schematic diagram of a user center interface disclosed in accordance with an embodiment of the present disclosure;
FIG. 2D is a schematic diagram of one switching pattern disclosed in accordance with an embodiment of the present disclosure;
FIG. 2E is a schematic diagram of a binding hint disclosed in accordance with embodiments of the present disclosure;
fig. 2F is a schematic diagram of a network disconnection hint disclosed according to an embodiment of the present disclosure;
FIG. 2G is a schematic diagram of a shortcut icon disclosed in accordance with an embodiment of the present disclosure;
FIG. 3A is a flow chart of a text presentation method disclosed in accordance with an embodiment of the present disclosure;
FIG. 3B is an interface schematic of a functional page disclosed in accordance with an embodiment of the present disclosure;
FIG. 3C is a schematic diagram of a text recognition result presentation disclosed in accordance with an embodiment of the present disclosure;
FIG. 3D is a schematic diagram of a text recognition result presentation disclosed in accordance with an embodiment of the present disclosure;
FIG. 4 is a schematic diagram of a structure of a text recognition device according to an embodiment of the present disclosure;
FIG. 5 is a schematic diagram of a text display device according to an embodiment of the present disclosure;
fig. 6 is a block diagram of an electronic device for implementing the text recognition method and/or the text presentation method disclosed in embodiments of the present disclosure.
Detailed Description
Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and should be considered as merely exemplary. Accordingly, one of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
In the research and development process, the applicant finds that the existing text scanning device generally performs text recognition on the text scanning result based on a locally mounted processor, so as to obtain a text recognition result corresponding to the text scanning result. However, this conventional approach suffers from the disadvantages of: 1) The limited computing power of the processor carried by the scanning equipment results in slower text recognition speed, and the text recognition result cannot be obtained quickly. 2) The text recognition method at the local end of the scanning device is single, so that the obtained text recognition result is low in diversity and richness, and a high-quality text recognition result cannot be provided for a user.
Fig. 1 is a flowchart of a text recognition method according to an embodiment of the present disclosure, which may be applicable to a case of performing text recognition based on a server. The method of the present embodiment may be performed by the text recognition device disclosed in the embodiments of the present disclosure, where the device may be implemented in software and/or hardware, and may be integrated on any electronic device having computing capabilities.
As shown in fig. 1, the text recognition method disclosed in this embodiment may include:
s101, determining the binding relation between the candidate client and the scanning equipment according to the acquired text recognition instruction.
The scanning device is a device with an image acquisition function, and can acquire images of any entity containing text information, such as a text carrier, a book, a test paper, a newspaper and the like, so as to obtain an image acquisition result, namely a text scanning result. The text recognition instruction is an instruction for triggering the scanning device to scan text. The candidate client is a client which establishes mutual trust with the scanning equipment in advance, namely the candidate client has authority to acquire a text recognition result corresponding to the text scanning result. The candidate clients are divided into two types, one type of candidate clients has a binding relationship with the scanning equipment, and the candidate clients can acquire text recognition results based on the binding relationship; the other type of candidate client has no binding relation with the scanning equipment, and the candidate client can not acquire the text recognition result temporarily and needs to be bound with the scanning equipment.
In one embodiment, a user implements text recognition instructions on a scanning device, such as clicking a text recognition button on the scanning device's own touch-sensitive display screen, to trigger generation of the text recognition instructions. The scanning equipment acquires text recognition instructions implemented by a user, and verifies the binding relation between each candidate client and the scanning equipment according to the text recognition instructions to determine whether the binding relation between each candidate client and the scanning equipment is established. For example, polling verification is carried out on each candidate client, and if a binding relation is established between any candidate client and the scanning equipment, the candidate client is marked in a binding way; correspondingly, if any candidate client and the scanning equipment do not establish a binding relationship, the candidate client is marked with unbinding.
The binding relation between the candidate client and the scanning equipment is determined according to the acquired text recognition instruction, so that a foundation is laid for determining the target server for text recognition according to the binding relation.
S102, determining a target server from the candidate servers according to the binding relation, and sending the acquired text scanning result to the target server, so that the target server determines a text recognition result according to the text scanning result.
The candidate service ends are determined according to the candidate client ends, and an association relationship exists between the candidate service ends and the candidate client ends, namely, at least one candidate service end is associated with any one candidate client end. With the increase of new candidate clients, corresponding servers are correspondingly increased in the existing candidate servers, that is, the candidate servers have expandability in the embodiment.
In one embodiment, according to the determined binding relationship between the candidate clients and the scanning device, screening each candidate client, determining the candidate client with the binding relationship established between the candidate client and the scanning device, and taking at least one candidate server associated with the candidate client as a target server. After the target server is determined, the scanning device prompts a user to scan the text, the user controls the scanning device to scan the text aiming at the text information to be scanned, and the scanning device correspondingly acquires a text scanning result. The scanning device sends the text scanning result to the total server in a network transmission mode based on the network transmission function of the scanning device, the total server calls the interfaces of all the target servers, and the text scanning result is respectively transmitted to all the target servers.
After receiving the text scanning result, each target server performs text recognition on the text scanning result based on a text recognition algorithm and a database carried by the target server, and determines a text recognition result corresponding to the text scanning result, wherein the text recognition result includes but is not limited to OCR (optical character recognition, text recognition) information, text part-of-speech information, text paraphrasing information, text phonetic symbol information and the like, and the embodiment does not limit specific content included in the text recognition result. Because each target service end is mutually independent, the carried text recognition algorithm and the database are different, so that the obtained text recognition result has higher diversity and richness.
Each target server side respectively sends the determined text recognition result to a total server side, and the total server side sends the text recognition result to a target client side in a network transmission mode, wherein the target client side can be a client side corresponding to the target server side, namely each target client side respectively receives the text recognition result of the corresponding target server side; the target client may also be an independent client, that is, the total server gathers text recognition results obtained by each target server and sends the summarized text recognition results to one target client. Meanwhile, the total server side also sends the text recognition result to the scanning device.
And after receiving the text recognition result, the target client displays the text recognition result in a specified display area. After receiving the text recognition result, the scanning device displays the text recognition result in a specified area of the display screen of the scanning device, so that synchronous display of the client and the scanning device is realized, and the user can check conveniently.
According to the method and the device, the binding relation between the candidate client and the scanning equipment is determined according to the acquired text recognition instruction, the target server is determined from the candidate server according to the binding relation, and the acquired text scanning result is sent to the target server, so that the target server determines the text recognition result according to the text scanning result, and the server can improve the text recognition speed and reduce the waiting time of a user due to strong calculation support; in addition, a plurality of candidate service ends are arranged in the embodiment, so that diversity and richness of text recognition results are improved, the candidate service ends have expandability, and more candidate service ends can be accessed subsequently, so that the quality of text recognition is improved.
On the basis of the embodiment, optionally, the scanning device is a smart dictionary pen.
The intelligent dictionary pen is provided with a high-speed camera at the pen point, and more than one hundred images are shot every second of the dating date, and the images are spliced together to be used as a text scanning result. The intelligent dictionary pen is also internally provided with a network transmission module so as to send the text scanning result to the target server in a network transmission mode. The intelligent dictionary pen is also externally provided with a touch display screen for a user to implement text recognition instructions and display text recognition results to the user.
By taking the intelligent dictionary pen as the scanning equipment, the application range of the text recognition method provided by the embodiment is expanded, the text recognition speed and the text recognition quality of the intelligent dictionary pen are improved, and more perfect learning assistance is provided for users.
Fig. 2A is a flowchart of a text recognition method according to an embodiment of the present disclosure, which is further optimized and expanded based on the above technical solution, and may be combined with the above various alternative embodiments.
As shown in fig. 2A, the text recognition method disclosed in this embodiment may include:
s201, displaying a text recognition function switch, and generating a text recognition instruction according to user control operation of the text recognition function switch.
In one embodiment, a user clicks a "user center" button on a touch display screen of the scanning device, and the touch display screen correspondingly displays a user center interface to the user, where the user center interface includes a text recognition function switch. The user can select to turn on or off the text recognition function switch according to the self requirement, and when the user turns on the text recognition function switch, the text recognition function is turned on, so that a text recognition instruction is generated; correspondingly, when the user turns off the text recognition function switch, the text recognition function is turned off.
Fig. 2B is a schematic diagram of opening a user center interface according to an embodiment of the disclosure, as shown in fig. 2B, 200 represents a desktop page of a touch display screen of a scanning device, 201 represents a "user center" button, and when the button 201 is clicked, the touch display screen displays the user center interface.
Fig. 2C is a schematic diagram of a user center interface according to an embodiment of the present disclosure, as shown in fig. 2C, 202 represents a user center interface, 203 represents a text recognition function switch, a user may select to turn on or off the text recognition function switch 203 according to his own needs, and after the user turns on the text recognition function switch 203, the text recognition function is turned on, so as to generate a text recognition instruction; accordingly, when the user turns off the text recognition function switch 203, the text recognition function is turned off.
Optionally, the "show text recognition function switch" in S201 includes:
controlling the pattern of the text recognition switch to be in a first pattern when the text recognition function switch is in an on state and the network state of the scanning equipment is in an off state; controlling the pattern of the text recognition switch to be in a second pattern when the text recognition function switch is in an on state and the network state of the scanning device is in a connection state; controlling the pattern of the text recognition switch to be in a third pattern under the condition that the text recognition function switch is in an off state; wherein the first pattern, the second pattern, and the third pattern are different.
In one embodiment, the scanning device detects the network state and the state of the text recognition function switch in real time, and when the text recognition function switch is in an on state and the network state of the scanning device is in an off state, the text recognition function is started at this time, but the data cannot be transmitted to the target server because the network is off, so that the text recognition function cannot be normally realized, and the scanning device controls the text recognition switch to display to the user in the first mode.
When the text recognition function switch is in an on state and the network state of the scanning device is in a connection state, the text recognition function can be normally realized at the moment, and the scanning device controls the text recognition switch to display the text recognition function switch to a user in a second mode.
When the text recognition function switch is in the off state, the text recognition function is turned off, and text recognition cannot be performed, and the scanning device controls the text recognition switch to display the text recognition function in a third mode to a user.
The first style, the second style, and the third style represent different styles, namely, a text recognition function switch of the first style, a text recognition function switch of the second style, and a text recognition function switch of the third style, having different visual appearances.
FIG. 2D is a schematic diagram of a switch style disclosed in accordance with an embodiment of the present disclosure, as shown in FIG. 2D, 204 representing a first style of text recognition function switch with striped under color fill and a switch button on the right side of the switch; 205 denotes a second style text recognition function switch with solid ground color fill and a switch button on the right side of the switch; 206 denotes a third style of text recognition function switch, with no under color fill and a switch button on the left side of the switch. The present embodiment explains the first pattern, the second pattern, and the third pattern by taking the above-described patterns as examples only, and is not limited in any way.
Controlling the pattern of the text recognition switch to be in a first pattern when the text recognition function switch is in an on state and the network state of the scanning device is in an off state; controlling the pattern of the text recognition switch to be in a second pattern when the text recognition function switch is in an on state and the network state of the scanning device is in a connection state; controlling the pattern of the text recognition switch to be in a third pattern when the text recognition function switch is in an off state; the first style, the second style and the third style are different, so that the effect of distinguishing and displaying the text recognition function switch according to whether the scanning equipment can normally recognize the text at present is achieved, a user can intuitively know whether the scanning equipment can normally recognize the text, and user experience is improved.
S202, determining the binding relation between the candidate client and the scanning equipment according to the acquired text recognition instruction.
Optionally, after S202, the method further includes:
and under the condition that any candidate client does not have a binding relation with the scanning equipment, displaying binding identification information, wherein the binding identification information is used for controlling at least one candidate client to bind with the scanning equipment according to the binding identification information.
In one embodiment, the scanning device determines a binding relationship between each candidate client and the scanning device, and if it is determined that all candidate clients are not bound to the scanning device, that is, if any candidate client does not have a binding relationship with the scanning device, binding prompt information is displayed to prompt a user that at least one candidate client needs to be bound to the scanning device, so that text recognition can be performed. After the user clicks the identification button, binding identification information is displayed, and the user controls at least one candidate client to bind with the scanning device according to the binding identification information. Wherein the binding identification information includes, but is not limited to, a two-dimensional code, a bar code, an identification code, or the like. For example, the user invokes the scanning function of any candidate client to scan the two-dimensional code, so as to bind the candidate client with the scanning device.
Fig. 2E is a schematic diagram of a binding prompt disclosed in an embodiment of the disclosure, where, as shown in fig. 2E, in a case where any candidate client does not have a binding relationship with the scanning device, the binding prompt 207 is popped up at the user center interface, for example, "please show the binding identification information in the upper right corner, and is turned on after scanning with the XX client". The embodiment is only explained by taking the binding prompt message 207 as an example, and the specific content of the binding prompt message is not limited.
And displaying binding identification information under the condition that any candidate client does not have a binding relation with the scanning equipment, wherein the binding identification information is used for controlling at least one candidate client to bind with the scanning equipment according to the binding identification information, so that the effect of guiding a user to bind the candidate client is realized, and the smooth execution of text recognition is ensured.
S203, determining at least one target client with binding relation with the scanning equipment from the candidate clients, and taking a candidate server associated with the target client as the target server.
For example, assume that the candidate clients of the scanning device include a candidate client a, a candidate client B, a candidate client C, and a candidate client D, and the candidate servers associated with the candidate clients are a candidate server a, a candidate server B, a candidate server C, and a candidate server D, respectively. And if the candidate client A and the candidate client B have binding relation with the scanning equipment, taking the candidate client A and the candidate client B as target clients and taking the candidate server A and the candidate server B as target servers.
S204, sending the acquired text scanning result to the target server side, and enabling the target server side to determine a text recognition result according to the text scanning result.
Optionally, before S204, the method further includes:
determining a network state of the scanning device; and under the condition that the network state is in a disconnection state, displaying network disconnection prompt information, wherein the network disconnection prompt information is used for controlling the scanning equipment to carry out network connection according to the network disconnection prompt information.
In one embodiment, the scanning device detects whether the network transmission function of the scanning device is normal, and if the network transmission function is abnormal, that is, if the network state is in a disconnection state, the scanning device displays a network disconnection prompting message to prompt a user to control the scanning device to reconnect the network.
Fig. 2F is a schematic diagram of a network disconnection prompting message according to an embodiment of the disclosure, where, as shown in fig. 2F, in a case where a network state of a scanning device is a disconnected state, a network disconnection prompting message 208, for example, "network disconnection, requests to connect to a network before being turned on" is popped up at a user center interface. The embodiment is only explained by taking the network disconnection prompting message 208 as an example, and the specific content of the network disconnection prompting message is not limited.
By determining the network status of the scanning device; under the condition that the network state is in a disconnection state, the network disconnection prompting information is displayed and used for controlling the scanning equipment to conduct network connection according to the network disconnection prompting information, so that the effect of guiding a user to conduct network connection on the scanning equipment is achieved, and smooth execution of text recognition is ensured.
According to the text recognition function switch, the text recognition instruction is generated according to the user control operation of the text recognition function switch, so that the effect that a user can control the on or off of the text recognition function is achieved, and the user experience is improved; by determining at least one target client having a binding relation with the scanning device from the candidate clients and taking the candidate server associated with the target client as the target server, the effect of determining the server for text recognition is achieved, smooth execution of text recognition is ensured, and a plurality of candidate servers are arranged in the embodiment, so that diversity and richness of text recognition results are improved.
On the basis of the above embodiment, optionally, the text recognition function switch may also be displayed in the form of a shortcut icon in a shortcut setting page of the scanning device.
Fig. 2G is a schematic diagram of a shortcut icon disclosed in an embodiment of the present disclosure, and as shown in fig. 2G, 209 is a shortcut icon of a text recognition function switch in a shortcut setting page, and a user may control the shortcut icon 209 to implement turning on or off of the text recognition function. And, in the case that the text recognition function switch is in an on state and the network state of the scanning device is an off state, the style of the control shortcut icon 209 is in a fourth style; in the case where the text recognition function switch is in the on state and the network state of the scanning device is the connection state, the style of the control shortcut icon 209 is in the fifth style; in the case where the text recognition function switch is in the off state, the style of the control shortcut icon 209 is in the sixth style; wherein the fourth pattern, the fifth pattern, and the sixth pattern are different.
The shortcut icon of the text recognition function switch is displayed in the shortcut setting page, so that a user can conveniently and quickly control the on or off of the text recognition function, and the efficiency is improved.
The applicant finds that in the research and development process, the existing text recognition result display method mostly utilizes near field communication technology such as WIFI or bluetooth to transmit the text recognition result obtained by the scanning device to the client for display. However, this approach must keep the scanning device and the terminal to which the client belongs closer, and is more limited.
Fig. 3A is a flowchart of a text display method according to an embodiment of the present disclosure, where the embodiment may be applicable to a case where a client displays a text recognition result. The method of the present embodiment may be performed by the text recognition device disclosed in the embodiments of the present disclosure, where the device may be implemented in software and/or hardware, and may be integrated on any electronic device having computing capabilities.
As shown in fig. 3A, the text display method disclosed in this embodiment may include:
s301, acquiring a text recognition result from a target server; the target server side is determined from the candidate server side according to the binding relation between the candidate client side and the scanning device.
In one embodiment, at least one target client having a binding relationship with the scanning device is determined from the candidate clients, a candidate server associated with the target client is taken as a target server, and the client obtains a text recognition result from the target server.
In an actual scene, a user clicks a 'real-time display' button in a functional page of the client, pops up prompt information, such as 'please confirm that the bound scanning device is in a networking state', and after clicking the confirm button, the client determines whether the client and the scanning device are in an available state.
Fig. 3B is an interface schematic diagram of a functional page according to an embodiment of the present disclosure, as shown in fig. 3B, 300 shows a functional interface of a client, and after a user clicks a real-time display button 301, a prompt message, for example, "please confirm that a bound scanning device is in a networking state", is popped up, and after the user clicks a confirm button, the client determines whether the client and the scanning device are in an available state.
If the network state of the terminal to which the client belongs is a connected state, determining that the client is in an available state; if the text recognition function switch of the scanning device is turned on, the scanning device is determined to be in an available state.
When the scanning device is in an unavailable state, a guidance message is displayed, for example, "please check whether the scanning device text recognition function switch is turned on". When the client itself is in an unavailable state, guiding information is displayed, for example, "please check the mobile phone network and try again".
When the client and the scanning device are both in the available state, guiding information, such as "start scanning word-finding bar", is displayed, and the recognition result is synchronously displayed. Further, the client starts to acquire the text recognition result from the target server.
S302, displaying the text recognition result.
In one embodiment, the client first obtains the OCR results from the target client for on-screen presentation, and then subsequently obtains other types of text recognition results from the target server. By displaying the OCR result, the user can obtain rapid recognition result display experience, and the waiting time of the user is reduced.
Fig. 3C is a schematic diagram of a text recognition result presentation according to an embodiment of the present disclosure, where, as shown in fig. 3C, OCR results 302 are first presented in a client and prompt information 303 "in query" is presented to prompt a user to be happy to wait for other types of text recognition results.
And after the client acquires other types of text recognition results from the target server, the other types of text recognition results are completely displayed.
Fig. 3D is a schematic diagram of a text recognition result presentation according to an embodiment of the present disclosure, where, as shown in fig. 3D, a client presents a complete text recognition.
In the display process, there may be a situation that the network of the scanning device is disconnected or the network of the terminal to which the client belongs is disconnected, at this time, the client displays a prompt message, for example, please check the network states of the mobile phone and the scanning device, and after the user re-controls the network states of the terminal to which the client belongs and the scanning device to be both in a connection state, the user clicks a "re-acquire" button to trigger the client to re-acquire the text recognition result from the target server and display the text recognition result.
The text recognition result is obtained from the target server; the target server determines from the candidate server according to the binding relation between the candidate client and the scanning device, and displays the text recognition result, so that the effect of displaying the text recognition result to the user in the client is realized, the user can more conveniently check the text recognition result, and the server is not required to send the text recognition result, so that the terminal to which the client belongs is not required to be kept at a relatively short distance from the scanning device, the applicability range of the method is greatly improved, and the limitation is reduced. Meanwhile, the plurality of candidate service ends are arranged in the embodiment, so that the diversity and the richness of the text recognition result are improved.
Fig. 4 is a schematic structural diagram of a text recognition device according to an embodiment of the present disclosure, which may be suitable for a case of performing text recognition based on a server. The device of the embodiment can be implemented by software and/or hardware, and can be integrated on any electronic equipment with computing capability.
As shown in fig. 4, the text recognition device 40 disclosed in this embodiment may include a binding relationship determining module 41 and a scan result transmitting module 42, where:
the binding relationship determining module 41 is configured to determine a binding relationship between the candidate client and the scanning device according to the acquired text recognition instruction.
And the scan result sending module 42 is configured to determine a target service end from the candidate service ends according to the binding relationship, and send the obtained text scan result to the target service end, so that the target service end determines a text recognition result according to the text scan result.
Optionally, the scan result sending module 42 is specifically configured to:
determining at least one target client having a binding relationship with the scanning device from the candidate clients;
and taking the candidate service end associated with the target client as the target service end.
Optionally, the device further comprises a switch display module, specifically configured to:
displaying a text recognition function switch;
and generating the text recognition instruction according to the user control operation of the text recognition function switch.
Optionally, the switch display module is specifically further configured to:
controlling the pattern of the text recognition switch to be in a first pattern when the text recognition function switch is in an on state and the network state of the scanning equipment is in an off state;
controlling the pattern of the text recognition switch to be in a second pattern when the text recognition function switch is in an on state and the network state of the scanning device is in a connection state;
controlling the pattern of the text recognition switch to be in a third pattern under the condition that the text recognition function switch is in an off state;
wherein the first pattern, the second pattern, and the third pattern are different.
Optionally, the device further comprises a binding identification information display module, specifically configured to:
and under the condition that any candidate client does not have a binding relation with the scanning equipment, displaying binding identification information, wherein the binding identification information is used for controlling at least one candidate client to bind with the scanning equipment according to the binding identification information.
Optionally, the device further includes a prompt information display module, specifically configured to:
determining a network state of the scanning device;
and under the condition that the network state is in a disconnection state, displaying network disconnection prompt information, wherein the network disconnection prompt information is used for controlling the scanning equipment to carry out network connection according to the network disconnection prompt information.
Optionally, the scanning device is a smart dictionary pen.
The text recognition device 40 disclosed in the embodiments of the present disclosure may perform the text recognition method disclosed in the embodiments of the present disclosure, and has the corresponding functional modules and beneficial effects of the execution method. Reference is made to the description of any method embodiment of the disclosure for details not explicitly described in this embodiment.
Fig. 5 is a schematic structural diagram of a text display device according to an embodiment of the present disclosure, which may be suitable for a case where a client displays a text recognition result. The device of the embodiment can be implemented by software and/or hardware, and can be integrated on any electronic equipment with computing capability.
As shown in fig. 5, the text display device 50 disclosed in this embodiment may include a recognition result obtaining module 51 and a recognition result display module 52, where:
the recognition result obtaining module 51 is configured to obtain a text recognition result from the target server; the target server determines from the candidate server according to the binding relation between the candidate client and the scanning equipment;
And the recognition result display module 52 is used for displaying the text recognition result.
The text display device 50 disclosed in the embodiments of the present disclosure can execute the text display method disclosed in the embodiments of the present disclosure, and has the corresponding functional modules and beneficial effects of the execution method. Reference is made to the description of any method embodiment of the disclosure for details not explicitly described in this embodiment.
In the technical scheme of the disclosure, the related processes of collecting, storing, using, processing, transmitting, providing, disclosing and the like of the personal information of the user accord with the regulations of related laws and regulations, and the public order colloquial is not violated.
According to embodiments of the present disclosure, the present disclosure also provides an electronic device, a readable storage medium and a computer program product.
Fig. 6 illustrates a schematic block diagram of an example electronic device 600 that may be used to implement embodiments of the present disclosure. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The electronic device may also represent various forms of mobile devices, such as personal digital processing, cellular telephones, smartphones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be exemplary only, and are not meant to limit implementations of the disclosure described and/or claimed herein.
As shown in fig. 6, the apparatus 600 includes a computing unit 601 that can perform various appropriate actions and processes according to a computer program stored in a Read Only Memory (ROM) 602 or a computer program loaded from a storage unit 608 into a Random Access Memory (RAM) 603. In the RAM 603, various programs and data required for the operation of the device 600 may also be stored. The computing unit 601, ROM 602, and RAM 603 are connected to each other by a bus 604. An input/output (I/O) interface 605 is also connected to bus 604.
Various components in the device 600 are connected to the I/O interface 605, including: an input unit 606 such as a keyboard, mouse, etc.; an output unit 607 such as various types of displays, speakers, and the like; a storage unit 608, such as a magnetic disk, optical disk, or the like; and a communication unit 609 such as a network card, modem, wireless communication transceiver, etc. The communication unit 609 allows the device 600 to exchange information/data with other devices via a computer network, such as the internet, and/or various telecommunication networks.
The computing unit 601 may be a variety of general and/or special purpose processing components having processing and computing capabilities. Some examples of computing unit 601 include, but are not limited to, a Central Processing Unit (CPU), a Graphics Processing Unit (GPU), various specialized Artificial Intelligence (AI) computing chips, various computing units running machine learning model algorithms, a Digital Signal Processor (DSP), and any suitable processor, controller, microcontroller, etc. The computing unit 601 performs the various methods and processes described above, such as a text recognition method and/or a text presentation method. For example, in some embodiments, the text recognition method and/or the text presentation method may be implemented as a computer software program tangibly embodied on a machine-readable medium, such as storage unit 608. In some embodiments, part or all of the computer program may be loaded and/or installed onto the device 600 via the ROM 602 and/or the communication unit 609. When the computer program is loaded into the RAM 603 and executed by the computing unit 601, one or more steps of the text recognition method and/or the text presentation method described above may be performed. Alternatively, in other embodiments, the computing unit 601 may be configured to perform the text recognition method and/or the text presentation method in any other suitable manner (e.g., by means of firmware).
Various implementations of the systems and techniques described here above may be implemented in digital electronic circuitry, integrated circuit systems, field Programmable Gate Arrays (FPGAs), application Specific Integrated Circuits (ASICs), application Specific Standard Products (ASSPs), systems On Chip (SOCs), load programmable logic devices (CPLDs), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs, the one or more computer programs may be executed and/or interpreted on a programmable system including at least one programmable processor, which may be a special purpose or general-purpose programmable processor, that may receive data and instructions from, and transmit data and instructions to, a storage system, at least one input device, and at least one output device.
Program code for carrying out methods of the present disclosure may be written in any combination of one or more programming languages. These program code may be provided to a processor or controller of a general purpose computer, special purpose computer, or other programmable data processing apparatus such that the program code, when executed by the processor or controller, causes the functions/operations specified in the flowchart and/or block diagram to be implemented. The program code may execute entirely on the machine, partly on the machine, as a stand-alone software package, partly on the machine and partly on a remote machine or entirely on the remote machine or server.
In the context of this disclosure, a machine-readable medium may be a tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. The machine-readable medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples of a machine-readable storage medium would include an electrical connection based on one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and pointing device (e.g., a mouse or trackball) by which a user can provide input to the computer. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user may be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic input, speech input, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a background component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such background, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), wide Area Networks (WANs), blockchain networks, and the internet.
The computer system may include a client and a server. The client and server are typically remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. The server may be a cloud server, a server of a distributed system, or a server incorporating a blockchain.
It should be appreciated that various forms of the flows shown above may be used to reorder, add, or delete steps. For example, the steps recited in the present disclosure may be performed in parallel or sequentially or in a different order, provided that the desired results of the technical solutions of the present disclosure are achieved, and are not limited herein.
The above detailed description should not be taken as limiting the scope of the present disclosure. It will be apparent to those skilled in the art that various modifications, combinations, sub-combinations and alternatives are possible, depending on design requirements and other factors. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present disclosure are intended to be included within the scope of the present disclosure.

Claims (16)

1. A text recognition method, comprising:
determining a binding relation between a candidate client and scanning equipment according to an acquired text recognition instruction, wherein the text recognition instruction is an instruction for triggering the scanning equipment to perform text scanning;
determining at least one target client having a binding relationship with the scanning device from the candidate clients;
taking the candidate service end associated with the target client as a target service end;
And sending the acquired text scanning result to the target server side, so that the target server side determines a text recognition result according to the text scanning result.
2. The method of claim 1, further comprising, prior to determining the binding relationship between the candidate client and the scanning device according to the obtained text recognition instruction:
displaying a text recognition function switch;
and generating the text recognition instruction according to the user control operation of the text recognition function switch.
3. The method of claim 2, wherein the show text recognition function switch comprises:
controlling the pattern of the text recognition switch to be in a first pattern when the text recognition function switch is in an on state and the network state of the scanning equipment is in an off state;
controlling the pattern of the text recognition switch to be in a second pattern when the text recognition function switch is in an on state and the network state of the scanning device is in a connection state;
controlling the pattern of the text recognition switch to be in a third pattern under the condition that the text recognition function switch is in an off state;
wherein the first pattern, the second pattern, and the third pattern are different.
4. The method of claim 1, further comprising, after the determining the binding relationship between the candidate client and the scanning device:
and under the condition that any candidate client does not have a binding relation with the scanning equipment, displaying binding identification information, wherein the binding identification information is used for controlling at least one candidate client to bind with the scanning equipment according to the binding identification information.
5. The method of claim 1, further comprising, before the sending the obtained text scan result to the target server:
determining a network state of the scanning device;
and under the condition that the network state is in a disconnection state, displaying network disconnection prompt information, wherein the network disconnection prompt information is used for controlling the scanning equipment to carry out network connection according to the network disconnection prompt information.
6. The method of claim 1, wherein the scanning device is a smart dictionary pen.
7. A text display method, comprising:
acquiring a text recognition result from a target server; the method for determining the target server side comprises the following steps: determining at least one target client having a binding relationship with the scanning device from the candidate clients; taking the candidate service end associated with the target client as the target service end;
The method for determining the text recognition result comprises the following steps: the target server identifies the scanning result obtained by the scanning equipment and determines the text identification result corresponding to the scanning result;
and displaying the text recognition result.
8. A text recognition device, comprising:
the binding relation determining module is used for determining the binding relation between the candidate client and the scanning equipment according to the acquired text recognition instruction, wherein the text recognition instruction is an instruction for triggering the scanning equipment to perform text scanning;
a scanning result sending module, configured to determine at least one target client having a binding relationship with the scanning device from the candidate clients;
taking the candidate service end associated with the target client as a target service end;
and sending the acquired text scanning result to the target server side, so that the target server side determines a text recognition result according to the text scanning result.
9. The apparatus of claim 8, further comprising a switch presentation module, in particular for:
displaying a text recognition function switch;
and generating the text recognition instruction according to the user control operation of the text recognition function switch.
10. The apparatus of claim 9, wherein the switch presentation module is further specifically configured to:
controlling the pattern of the text recognition switch to be in a first pattern when the text recognition function switch is in an on state and the network state of the scanning equipment is in an off state;
controlling the pattern of the text recognition switch to be in a second pattern when the text recognition function switch is in an on state and the network state of the scanning device is in a connection state;
controlling the pattern of the text recognition switch to be in a third pattern under the condition that the text recognition function switch is in an off state;
wherein the first pattern, the second pattern, and the third pattern are different.
11. The apparatus of claim 8, further comprising a binding identification information presentation module, in particular for:
and under the condition that any candidate client does not have a binding relation with the scanning equipment, displaying binding identification information, wherein the binding identification information is used for controlling at least one candidate client to bind with the scanning equipment according to the binding identification information.
12. The apparatus of claim 8, further comprising a hint information presentation module, specifically configured to:
Determining a network state of the scanning device;
and under the condition that the network state is in a disconnection state, displaying network disconnection prompt information, wherein the network disconnection prompt information is used for controlling the scanning equipment to carry out network connection according to the network disconnection prompt information.
13. The apparatus of claim 8, wherein the scanning device is a smart dictionary pen.
14. A text display device comprising:
the identification result acquisition module is used for:
acquiring a text recognition result from a target server; the method for determining the target server side comprises the following steps: determining at least one target client having a binding relationship with the scanning device from the candidate clients; taking the candidate service end associated with the target client as the target service end;
the method for determining the text recognition result comprises the following steps: the target server identifies the scanning result obtained by the scanning equipment and determines the text identification result corresponding to the scanning result;
and the recognition result display module is used for displaying the text recognition result.
15. An electronic device, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,
The memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 1-7.
16. A non-transitory computer readable storage medium storing computer instructions for causing the computer to perform the method of any one of claims 1-7.
CN202110938868.6A 2021-08-16 2021-08-16 Text recognition and display method, device, electronic equipment and medium Active CN113641439B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110938868.6A CN113641439B (en) 2021-08-16 2021-08-16 Text recognition and display method, device, electronic equipment and medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110938868.6A CN113641439B (en) 2021-08-16 2021-08-16 Text recognition and display method, device, electronic equipment and medium

Publications (2)

Publication Number Publication Date
CN113641439A CN113641439A (en) 2021-11-12
CN113641439B true CN113641439B (en) 2023-08-29

Family

ID=78422151

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110938868.6A Active CN113641439B (en) 2021-08-16 2021-08-16 Text recognition and display method, device, electronic equipment and medium

Country Status (1)

Country Link
CN (1) CN113641439B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115086501A (en) * 2022-05-19 2022-09-20 阿波罗智联(北京)科技有限公司 Scanning method, scanning device, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102904909A (en) * 2011-07-25 2013-01-30 上海博路信息技术有限公司 OCR (Optical Character Recognition) method based on cloud model
CN102902968A (en) * 2011-07-25 2013-01-30 上海博路信息技术有限公司 Method for quickly acquiring publication content by scanning with mobile phone
CN103176964A (en) * 2011-12-21 2013-06-26 上海博路信息技术有限公司 Translation auxiliary system based on OCR
CN110611685A (en) * 2019-10-30 2019-12-24 南宁市指搜信息技术有限公司 Internet site login system based on intelligent equipment monitoring and user identity recognition
CN111862940A (en) * 2020-07-15 2020-10-30 百度在线网络技术(北京)有限公司 Earphone-based translation method, device, system, equipment and storage medium
CN112382286A (en) * 2020-11-10 2021-02-19 苏州思必驰信息科技有限公司 Method, device and system for realizing online evaluation of learning effect based on intelligent voice control

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9037450B2 (en) * 2012-12-14 2015-05-19 Microsoft Technology Licensing, Llc Text overlay techniques in realtime translation
US9870357B2 (en) * 2013-10-28 2018-01-16 Microsoft Technology Licensing, Llc Techniques for translating text via wearable computing device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102904909A (en) * 2011-07-25 2013-01-30 上海博路信息技术有限公司 OCR (Optical Character Recognition) method based on cloud model
CN102902968A (en) * 2011-07-25 2013-01-30 上海博路信息技术有限公司 Method for quickly acquiring publication content by scanning with mobile phone
CN103176964A (en) * 2011-12-21 2013-06-26 上海博路信息技术有限公司 Translation auxiliary system based on OCR
CN110611685A (en) * 2019-10-30 2019-12-24 南宁市指搜信息技术有限公司 Internet site login system based on intelligent equipment monitoring and user identity recognition
CN111862940A (en) * 2020-07-15 2020-10-30 百度在线网络技术(北京)有限公司 Earphone-based translation method, device, system, equipment and storage medium
CN112382286A (en) * 2020-11-10 2021-02-19 苏州思必驰信息科技有限公司 Method, device and system for realizing online evaluation of learning effect based on intelligent voice control

Also Published As

Publication number Publication date
CN113641439A (en) 2021-11-12

Similar Documents

Publication Publication Date Title
CN113419865B (en) Cloud resource processing method, related device and computer program product
CN112269607A (en) Applet control method, system, server and terminal device
CN111796747B (en) Multi-open application processing method and device and electronic equipment
KR20210114356A (en) Screen mirroring method, device, equipment and storage medium
CN104038560A (en) Remote assistance method between mobile terminals, client side, electronic device and system
CN115344315B (en) Skin switching method and device of applet page and electronic equipment
CN114726906B (en) Device interaction method, device, electronic device and storage medium
CN113766504B (en) Communication connection method, device, server, terminal equipment, system and medium
KR20210038812A (en) Speech control method and apparatus, electronic device, and readable storage medium
CN108039173B (en) Voice information input method, mobile terminal, system and readable storage medium
CN109032732A (en) Notification display method and device, storage medium and electronic equipment
CN113343207A (en) Information verification method and device, computer equipment and storage medium
CN110109594A (en) A kind of draw data sharing method, device, storage medium and equipment
CN113641439B (en) Text recognition and display method, device, electronic equipment and medium
CN114401337B (en) Data sharing method, device, equipment and storage medium based on cloud phone
CN112583694B (en) Message pushing method, device, storage medium and control terminal
CN110309462B (en) Data display method and system
CN107770474B (en) Sound processing method and device, terminal equipment and storage medium
CN113743288B (en) Image recognition method, device, equipment and storage medium for cloud mobile phone
CN114153312B (en) VPA control method, device, equipment, storage medium and program product
CN114415892B (en) Interface control generation method and device, readable medium and electronic equipment
CN116631396A (en) Control display method and device, electronic equipment and medium
CN111782992B (en) Display control method, device, equipment and readable storage medium
WO2023046105A1 (en) Message sending method and apparatus and electronic device
CN110531952B (en) Voice skill processing method, device and equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant