[go: up one dir, main page]

CN106506437B - Audio data processing method and device - Google Patents

Audio data processing method and device Download PDF

Info

Publication number
CN106506437B
CN106506437B CN201510566251.0A CN201510566251A CN106506437B CN 106506437 B CN106506437 B CN 106506437B CN 201510566251 A CN201510566251 A CN 201510566251A CN 106506437 B CN106506437 B CN 106506437B
Authority
CN
China
Prior art keywords
audio data
parameters
audio
configuration information
adjustment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510566251.0A
Other languages
Chinese (zh)
Other versions
CN106506437A (en
Inventor
赵娜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tencent Technology Shenzhen Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201510566251.0A priority Critical patent/CN106506437B/en
Publication of CN106506437A publication Critical patent/CN106506437A/en
Application granted granted Critical
Publication of CN106506437B publication Critical patent/CN106506437B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • H04L65/762Media network packet handling at the source 

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Telephone Function (AREA)

Abstract

The embodiment of the invention discloses an audio data processing method and equipment, wherein the method comprises the following steps: acquiring original audio data and acquiring adjustment parameters; adjusting the original audio data according to the adjustment parameters to obtain target audio data; and sending the target audio data to the target equipment in the form of a short voice message. And adjusting the original audio data through the acquired adjustment parameters to obtain the target audio. The target audio is a short voice message, can be applied to a plurality of application scenes of audio data carried in a short message form, and can flexibly control the attribute of the sent audio data, so that the audio has richer expression forms.

Description

Audio data processing method and device
Technical Field
The present invention relates to the field of communications technologies, and in particular, to an audio data processing method and device.
Background
Audio is a term of art and all sounds that a human being can hear are referred to as audio, which may include noise, etc. After the sound is recorded, either the speech sound, the singing sound, the musical instrument can be processed by software, or it is made into a file of audio data and stored in a storage device, all the sounds being unchanged at this time. Files of audio data may be stored in the computer. If a computer is provided with a corresponding audio card, namely a sound card which is often called, all sounds can be recorded, and acoustic characteristics of the sounds, such as sound level and the like, can be stored in a hard disk file of the computer. Conversely, the stored audio file can be played by a certain audio program to restore the previously recorded sound.
Audio data is typically transmitted in a network from one end of the device to the other as communication signals, such as digital/analog/optical signals. Digital audio data is not easily lost. And analog audio can be heard, but the modification loss of the audio data of the analog signal during the recording process is too large.
The transmission of audio data has wide application in the field of communication technology, for example: WeChat voice function, Internet phone, etc. in the instant communication tool.
At present, the voice transmission function is single, and usually, the voice data transmission to the opposite terminal is finished.
Disclosure of Invention
The embodiment of the invention provides an audio data processing method and equipment, which are used for enriching the processing of short audio data, so that audio can be flexibly controlled to have richer expression forms.
An audio data processing method, comprising:
acquiring original audio data and acquiring adjustment parameters;
adjusting the original audio data according to the adjustment parameters to obtain target audio data;
and sending the target audio data to the target equipment in the form of a short voice message.
An audio data processing device comprising:
the audio acquisition unit is used for acquiring original audio data;
a parameter obtaining unit for obtaining an adjustment parameter;
the audio adjusting unit is used for adjusting the original audio data according to the adjusting parameters to obtain target audio data;
and the audio sending unit is used for sending the target audio data to the target equipment in the form of a short voice message.
According to the technical scheme, the embodiment of the invention has the following advantages: and adjusting the original audio data through the acquired adjustment parameters to obtain the target audio. The target audio is a short voice message, can be applied to a plurality of application scenes of audio data carried in a short message form, and can flexibly control the attribute of the sent audio data, so that the audio has richer expression forms.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.
FIG. 1 is a schematic flow chart of a method according to an embodiment of the present invention;
FIG. 2 is a schematic flow chart of a method according to an embodiment of the present invention;
FIG. 3 is a schematic structural diagram of an apparatus according to an embodiment of the present invention;
FIG. 4 is a schematic structural diagram of an apparatus according to an embodiment of the present invention;
FIG. 5 is a schematic diagram of an apparatus according to an embodiment of the present invention;
fig. 6 is a schematic structural diagram of an apparatus according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention clearer, the present invention will be described in further detail with reference to the accompanying drawings, and it is apparent that the described embodiments are only a part of the embodiments of the present invention, not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
An embodiment of the present invention provides an audio data processing method, as shown in fig. 1, including:
101: acquiring original audio data and acquiring adjustment parameters;
the execution main body of the embodiment may be any device, especially a terminal device, for example: mobile terminal devices such as mobile phones and PADs. Or may be a terminal device such as a personal computer. It is known from the functions it performs that the device needs to have a speech acquisition device. In a broad sense, the audio acquisition may be directly acquiring audio information from an acquisition device side, or may be receiving audio information from other devices or audio acquisition devices. The original audio data is relative to the adjusted target audio data, and it is not necessary that the audio acquisition device acquires the directly obtained audio data.
102: adjusting the original audio data according to the adjustment parameters to obtain target audio data;
different audio adjustment parameters, such as frequency, loudness, sampling rate, background type, etc., of the audio may be used depending on the requirements of the target audio data.
103: and sending the target audio data to the target equipment in the form of a short voice message.
In this embodiment, the original audio data is adjusted by the obtained adjustment parameter, so as to obtain the target audio. The target audio is a short voice message, can be applied to a plurality of application scenes of audio data carried in a short message form, and can flexibly control the attribute of the sent audio data, so that the audio has richer expression forms.
The embodiment of the invention also provides a starting scene for acquiring the adjustment parameters, which specifically comprises the following steps: before the obtaining of the adjustment parameter, the method further includes:
and receiving an audio adjusting instruction, or determining that audio parameters need to be adjusted according to the read configuration information.
The former approach can be applied when the user specifies that the audio needs to be adjusted, for example: in the WeChat voice process, a user inputs an instruction for selecting to be adjusted after recording a section of audio. The latter method is more intelligent, and the user can pre-configure the information whether the voice transmission of the phrase voice message class needs to be adjusted in the application software, so that the device automatically calls the parameters to determine whether to start the adjustment process. According to different requirements, the two schemes can be combined and selected.
In this embodiment, an alternative form of configuration information is given, where the configuration information includes:
the configuration information is configured to correspond to a receiving object of the target audio; alternatively, the configuration information may be configured to correspond to a transmission method.
In this embodiment, the former method, in which the configuration information corresponds to the receiving object, means that the configuration information is configured to which voice information of the receiving object needs to be adjusted, for example: to a specific group, or user. The latter method uses configuration information corresponding to the transmission method, for example: corresponding to the transmission modes such as group transmission, broadcast, specific user designation and the like, one or more transmission modes are designated as required to be adjusted by default.
In this embodiment, the receiving manner of the adjustment parameter may be as follows: the acquiring of the adjustment parameter includes:
and displaying an adjustment parameter input interface, and acquiring adjustment parameters from the adjustment parameter input interface.
The parameter input interface in this embodiment may be configured according to an adjustment type supported by audio adjustment, and a preferred implementation scheme is provided in this embodiment of the present invention, which is specifically as follows: the displaying an adjustment parameter input interface, acquiring an adjustment parameter from the adjustment parameter input interface, includes:
and displaying a selection identifier on a displayed parameter input interface, and reading a preset adjustment parameter corresponding to the selection identifier according to the received selection identifier.
In this embodiment, the user only needs to select the selection identifier, and the selection identifier may be some identifying prompt information, for example: male/female voices, children, etc.; it may also be a background sound: clapping, music selection, etc. The pre-configured adjusting parameters are pre-configured based on the selection identifier, so that the audio attribute can be adjusted to the parameters corresponding to the selection identifier, and the parameters can be combined. For example: loudness, sound ray, frequency, etc.
Based on the above description, it can be seen that: the selection identifier includes: sound parameters and/or sound effect parameters. If there are other adjustment requirements, the selection identifier may also include other parameter types, and the embodiment of the present invention is not limited uniquely. The sound parameter may be loudness, sound ray, frequency, or other parameters.
The embodiment of the present invention further provides an application example based on WeChat, please refer to the flow shown in FIG. 2, which includes:
201: receiving an instruction for selecting a certain voice message;
in this step, the voice message carries original audio data collected by the microphone of the mobile phone through the WeChat.
202: inquiring whether a special effect needs to be added, if so, entering the next step, and if not, entering 204;
203: executing the adjustment operation of changing sound or adding sound effect to obtain target audio;
special effects can be classified into 2 types:
short voice change: after the frequency is switched, the frequency can be switched into boys, girls, children and the like;
and (3) sound effect addition: clapping, kissing, background music, etc.
204: and sending the voice message to the target user.
In this step, if 202 is entered, it is the voice message selected by the user, and if 203 is entered, it is the voice message carrying the target audio.
In the above embodiment, an "phrase sound special effect" entry needs to be set in the application of the phrase sound, and a specific flow of the operation at the user side is as follows: and (4) recording a short voice with a friend chatting window, entering a 'phrase voice special effect', setting an effect, and sending the friend. After listening to the voice message, the friend clicks to play the voice message, and plays the voice message with the special effect.
The embodiment provides a way and a method for using the voice special effect of the instant chat tool user by publishing a 'phrase voice special effect' mechanism in the instant chat tool.
The present invention also provides an audio data processing apparatus, as shown in fig. 3, including:
an audio acquisition unit 301, configured to acquire original audio data;
a parameter obtaining unit 302, configured to obtain an adjustment parameter;
an audio adjusting unit 303, configured to adjust the original audio data according to the adjustment parameter to obtain target audio data;
an audio sending unit 304, configured to send the target audio data to the target device in the form of a short voice message.
The execution main body of the embodiment may be any device, especially a terminal device, for example: mobile terminal devices such as mobile phones and PADs. Or may be a terminal device such as a personal computer. It is known from the functions it performs that the device needs to have a speech acquisition device. In a broad sense, the audio acquisition may be directly acquiring audio information from an acquisition device side, or may be receiving audio information from other devices or audio acquisition devices. The original audio data is relative to the adjusted target audio data, and it is not necessary that the audio acquisition device acquires the directly obtained audio data.
Different audio adjustment parameters, such as frequency, loudness, sampling rate, background type, etc., of the audio may be used depending on the requirements of the target audio data.
In this embodiment, the original audio data is adjusted by the obtained adjustment parameter, so as to obtain the target audio. The target audio is a short voice message, can be applied to a plurality of application scenes of audio data carried in a short message form, and can flexibly control the attribute of the sent audio data, so that the audio has richer expression forms.
The embodiment of the invention also provides a starting scene for acquiring the adjustment parameters, which specifically comprises the following steps: as shown in fig. 4, the above apparatus further includes:
the adjustment starting unit 401 is configured to receive an audio adjustment instruction before the adjustment parameter is obtained, or determine that the audio parameter needs to be adjusted according to the read configuration information.
The former approach can be applied when the user specifies that the audio needs to be adjusted, for example: in the WeChat voice process, a user inputs an instruction for selecting to be adjusted after recording a section of audio. The latter method is more intelligent, and the user can pre-configure the information whether the voice transmission of the phrase voice message class needs to be adjusted in the application software, so that the device automatically calls the parameters to determine whether to start the adjustment process. According to different requirements, the two schemes can be combined and selected.
In this embodiment, an alternative form of configuration information is given, where the configuration information includes:
the configuration information is configured to correspond to a receiving object of the target audio; alternatively, the configuration information may be configured to correspond to a transmission method.
In this embodiment, the former method, in which the configuration information corresponds to the receiving object, means that the configuration information is configured to which voice information of the receiving object needs to be adjusted, for example: to a specific group, or user. The latter method uses configuration information corresponding to the transmission method, for example: corresponding to the transmission modes such as group transmission, broadcast, specific user designation and the like, one or more transmission modes are designated as required to be adjusted by default.
In this embodiment, the receiving manner of the adjustment parameter may be as follows: the parameter obtaining unit 302 is configured to control a display to display an adjustment parameter input interface, and obtain an adjustment parameter from the adjustment parameter input interface.
The parameter input interface in this embodiment may be configured according to an adjustment type supported by audio adjustment, and a preferred implementation scheme is provided in this embodiment of the present invention, which is specifically as follows: the parameter obtaining unit 302 is configured to control the display to display a selection identifier on a displayed parameter input interface, and read a pre-configured adjustment parameter corresponding to the selection identifier according to the received selection identifier.
In this embodiment, the user only needs to select the selection identifier, and the selection identifier may be some identifying prompt information, for example: male/female voices, children, etc.; it may also be a background sound: clapping, music selection, etc. The pre-configured adjusting parameters are pre-configured based on the selection identifier, so that the audio attribute can be adjusted to the parameters corresponding to the selection identifier, and the parameters can be combined. For example: loudness, sound ray, frequency, etc.
Based on the above description, it can be seen that: the selection identifier includes: sound parameters and/or sound effect parameters. If there are other adjustment requirements, the selection identifier may also include other parameter types, and the embodiment of the present invention is not limited uniquely. The sound parameter may be loudness, sound ray, frequency, or other parameters.
An embodiment of the present invention further provides another audio data processing device, as shown in fig. 5, including: a receiver 501, a transmitter 502, a processor 503, and a memory 504;
the processor 503 is configured to control and execute: acquiring original audio data and acquiring adjustment parameters; adjusting the original audio data according to the adjustment parameters to obtain target audio data; and sending the target audio data to the target equipment in the form of a short voice message.
The execution main body of the embodiment may be any device, especially a terminal device, for example: mobile terminal devices such as mobile phones and PADs. Or may be a terminal device such as a personal computer. It is known from the functions it performs that the device needs to have a speech acquisition device. In a broad sense, the audio acquisition may be directly acquiring audio information from an acquisition device side, or may be receiving audio information from other devices or audio acquisition devices. The original audio data is relative to the adjusted target audio data, and it is not necessary that the audio acquisition device acquires the directly obtained audio data.
Different audio adjustment parameters, such as frequency, loudness, sampling rate, background type, etc., of the audio may be used depending on the requirements of the target audio data.
In this embodiment, the original audio data is adjusted by the obtained adjustment parameter, so as to obtain the target audio. The target audio is a short voice message, can be applied to a plurality of application scenes of audio data carried in a short message form, and can flexibly control the attribute of the sent audio data, so that the audio has richer expression forms.
The embodiment of the invention also provides a starting scene for acquiring the adjustment parameters, which specifically comprises the following steps: the processor 503 is further configured to control and execute: before the adjustment parameters are obtained, an audio adjustment instruction is received, or the audio parameters needing to be adjusted are determined according to the read configuration information.
The former approach can be applied when the user specifies that the audio needs to be adjusted, for example: in the WeChat voice process, a user inputs an instruction for selecting to be adjusted after recording a section of audio. The latter method is more intelligent, and the user can pre-configure the information whether the voice transmission of the phrase voice message class needs to be adjusted in the application software, so that the device automatically calls the parameters to determine whether to start the adjustment process. According to different requirements, the two schemes can be combined and selected.
In this embodiment, an alternative form of configuration information is given, where the configuration information includes:
the configuration information is configured to correspond to a receiving object of the target audio; alternatively, the configuration information may be configured to correspond to a transmission method.
In this embodiment, the former method, in which the configuration information corresponds to the receiving object, means that the configuration information is configured to which voice information of the receiving object needs to be adjusted, for example: to a specific group, or user. The latter method uses configuration information corresponding to the transmission method, for example: corresponding to the transmission modes such as group transmission, broadcast, specific user designation and the like, one or more transmission modes are designated as required to be adjusted by default.
In this embodiment, the receiving manner of the adjustment parameter may be as follows: the processor 503 is configured to control and execute: the acquiring of the adjustment parameter comprises: and displaying an adjustment parameter input interface, and acquiring adjustment parameters from the adjustment parameter input interface.
The parameter input interface in this embodiment may be configured according to an adjustment type supported by audio adjustment, and a preferred implementation scheme is provided in this embodiment of the present invention, which is specifically as follows: the processor 503 is configured to control and execute: displaying an adjustment parameter input interface, wherein acquiring adjustment parameters from the adjustment parameter input interface comprises:
and displaying a selection identifier on a displayed parameter input interface, and reading a preset adjustment parameter corresponding to the selection identifier according to the received selection identifier.
In this embodiment, the user only needs to select the selection identifier, and the selection identifier may be some identifying prompt information, for example: male/female voices, children, etc.; it may also be a background sound: clapping, music selection, etc. The pre-configured adjusting parameters are pre-configured based on the selection identifier, so that the audio attribute can be adjusted to the parameters corresponding to the selection identifier, and the parameters can be combined. For example: loudness, sound ray, frequency, etc.
Based on the above description, it can be seen that: the selection identifier includes: sound parameters and/or sound effect parameters. If there are other adjustment requirements, the selection identifier may also include other parameter types, and the embodiment of the present invention is not limited uniquely. The sound parameter may be loudness, sound ray, frequency, or other parameters.
Another audio data processing device is provided in the embodiment of the present invention, as shown in fig. 6, for convenience of description, only the parts related to the embodiment of the present invention are shown, and details of the specific technology are not disclosed, please refer to the method part in the embodiment of the present invention. The terminal may be any terminal device including a mobile phone, a tablet computer, a PDA (Personal Digital Assistant), a POS (Point of Sales), a vehicle-mounted computer, etc., taking the terminal as the mobile phone as an example:
fig. 6 is a block diagram illustrating a partial structure of a mobile phone related to a terminal provided in an embodiment of the present invention. Referring to fig. 6, the handset includes: radio Frequency (RF) circuit 610, memory 620, input unit 630, display unit 640, sensor 650, audio circuit 660, wireless fidelity (WiFi) module 670, processor 680, and power supply 690. Those skilled in the art will appreciate that the handset configuration shown in fig. 6 is not intended to be limiting and may include more or fewer components than those shown, or some components may be combined, or a different arrangement of components.
The following describes each component of the mobile phone in detail with reference to fig. 6:
the RF circuit 610 may be used for receiving and transmitting signals during information transmission and reception or during a call, and in particular, receives downlink information of a base station and then processes the received downlink information to the processor 680; in addition, the data for designing uplink is transmitted to the base station. In general, RF circuit 610 includes, but is not limited to, an antenna, at least one Amplifier, a transceiver, a coupler, a Low Noise Amplifier (LNA), a duplexer, and the like. In addition, the RF circuitry 610 may also communicate with networks and other devices via wireless communications. The wireless communication may use any communication standard or protocol, including but not limited to Global system for Mobile communication (GSM), General Packet Radio Service (GPRS), Code Division Multiple Access (CDMA), Wideband Code Division Multiple Access (WCDMA), Long Term Evolution (LTE), email, Short Messaging Service (SMS), and the like.
The memory 620 may be used to store software programs and modules, and the processor 680 may execute various functional applications and data processing of the mobile phone by operating the software programs and modules stored in the memory 620. The memory 620 may mainly include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required by at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may store data (such as audio data, a phonebook, etc.) created according to the use of the cellular phone, and the like. Further, the memory 620 may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.
The input unit 630 may be used to receive input numeric or character information and generate key signal inputs related to user settings and function control of the cellular phone. Specifically, the input unit 630 may include a touch panel 631 and other input devices 632. The touch panel 631, also referred to as a touch screen, may collect touch operations of a user (e.g., operations of the user on the touch panel 631 or near the touch panel 631 by using any suitable object or accessory such as a finger or a stylus) thereon or nearby, and drive the corresponding connection device according to a preset program. Alternatively, the touch panel 631 may include two parts of a touch detection device and a touch controller. The touch detection device detects the touch direction of a user, detects a signal brought by touch operation and transmits the signal to the touch controller; the touch controller receives touch information from the touch sensing device, converts the touch information into touch point coordinates, sends the touch point coordinates to the processor 680, and can receive and execute commands sent by the processor 680. In addition, the touch panel 631 may be implemented using various types, such as resistive, capacitive, infrared, and surface acoustic wave. The input unit 630 may include other input devices 632 in addition to the touch panel 631. In particular, other input devices 632 may include, but are not limited to, one or more of a physical keyboard, function keys (such as volume control keys, switch keys, etc.), a trackball, a mouse, a joystick, and the like.
The display unit 640 may be used to display information input by the user or information provided to the user and various menus of the mobile phone. The Display unit 640 may include a Display panel 641, and optionally, the Display panel 641 may be configured in the form of a Liquid Crystal Display (LCD), an Organic Light-Emitting Diode (OLED), or the like. Further, the touch panel 631 can cover the display panel 641, and when the touch panel 631 detects a touch operation thereon or nearby, the touch panel is transmitted to the processor 680 to determine the type of the touch event, and then the processor 680 provides a corresponding visual output on the display panel 641 according to the type of the touch event. Although in fig. 6, the touch panel 631 and the display panel 641 are two independent components to implement the input and output functions of the mobile phone, in some embodiments, the touch panel 631 and the display panel 641 may be integrated to implement the input and output functions of the mobile phone.
The handset may also include at least one sensor 650, such as a light sensor, motion sensor, and other sensors. Specifically, the light sensor may include an ambient light sensor that adjusts the brightness of the display panel 641 according to the brightness of ambient light, and a proximity sensor that turns off the display panel 641 and/or the backlight when the mobile phone is moved to the ear. As one of the motion sensors, the accelerometer sensor can detect the magnitude of acceleration in each direction (generally, three axes), can detect the magnitude and direction of gravity when stationary, and can be used for applications of recognizing the posture of a mobile phone (such as horizontal and vertical screen switching, related games, magnetometer posture calibration), vibration recognition related functions (such as pedometer and tapping), and the like; as for other sensors such as a gyroscope, a barometer, a hygrometer, a thermometer, and an infrared sensor, which can be configured on the mobile phone, further description is omitted here.
Audio circuit 660, speaker 661, and microphone 662 can provide an audio interface between a user and a cell phone. The audio circuit 660 may transmit the electrical signal converted from the received audio data to the speaker 661, and convert the electrical signal into an audio signal through the speaker 661 for output; on the other hand, the microphone 662 converts the collected sound signals into electrical signals, which are received by the audio circuit 660 and converted into audio data, which are processed by the audio data output processor 680 and then transmitted via the RF circuit 610 to, for example, another cellular phone, or output to the memory 620 for further processing.
WiFi belongs to short-distance wireless transmission technology, and the mobile phone can help a user to receive and send e-mails, browse webpages, access streaming media and the like through the WiFi module 670, and provides wireless broadband Internet access for the user. Although fig. 6 shows the WiFi module 670, it is understood that it does not belong to the essential constitution of the handset, and can be omitted entirely as needed within the scope not changing the essence of the invention.
The processor 680 is a control center of the mobile phone, and connects various parts of the entire mobile phone by using various interfaces and lines, and performs various functions of the mobile phone and processes data by operating or executing software programs and/or modules stored in the memory 620 and calling data stored in the memory 620, thereby performing overall monitoring of the mobile phone. Optionally, processor 680 may include one or more processing units; preferably, the processor 680 may integrate an application processor, which mainly handles operating systems, user interfaces, application programs, etc., and a modem processor, which mainly handles wireless communications. It will be appreciated that the modem processor described above may not be integrated into processor 680.
The handset also includes a power supply 690 (e.g., a battery) for powering the various components, which may preferably be logically connected to the processor 680 via a power management system, such that the power management system may be used to manage charging, discharging, and power consumption.
Although not shown, the mobile phone may further include a camera, a bluetooth module, etc., which are not described herein.
In the embodiment of the present invention, the processor 680 included in the terminal further has the following functions:
the processor 680 is configured to control and execute: acquiring original audio data and acquiring adjustment parameters; adjusting the original audio data according to the adjustment parameters to obtain target audio data; and sending the target audio data to the target equipment in the form of a short voice message.
The execution main body of the embodiment may be any device, especially a terminal device, for example: mobile terminal devices such as mobile phones and PADs. Or may be a terminal device such as a personal computer. It is known from the functions it performs that the device needs to have a speech acquisition device. In a broad sense, the audio acquisition may be directly acquiring audio information from an acquisition device side, or may be receiving audio information from other devices or audio acquisition devices. The original audio data is relative to the adjusted target audio data, and it is not necessary that the audio acquisition device acquires the directly obtained audio data.
Different audio adjustment parameters, such as frequency, loudness, sampling rate, background type, etc., of the audio may be used depending on the requirements of the target audio data.
In this embodiment, the original audio data is adjusted by the obtained adjustment parameter, so as to obtain the target audio. The target audio is a short voice message, can be applied to a plurality of application scenes of audio data carried in a short message form, and can flexibly control the attribute of the sent audio data, so that the audio has richer expression forms.
The embodiment of the invention also provides a starting scene for acquiring the adjustment parameters, which specifically comprises the following steps: the processor 680 is further configured to control and execute: before the adjustment parameters are obtained, an audio adjustment instruction is received, or the audio parameters needing to be adjusted are determined according to the read configuration information.
The former approach can be applied when the user specifies that the audio needs to be adjusted, for example: in the WeChat voice process, a user inputs an instruction for selecting to be adjusted after recording a section of audio. The latter method is more intelligent, and the user can pre-configure the information whether the voice transmission of the phrase voice message class needs to be adjusted in the application software, so that the device automatically calls the parameters to determine whether to start the adjustment process. According to different requirements, the two schemes can be combined and selected.
In this embodiment, an alternative form of configuration information is given, where the configuration information includes:
the configuration information is configured to correspond to a receiving object of the target audio; alternatively, the configuration information may be configured to correspond to a transmission method.
In this embodiment, the former method, in which the configuration information corresponds to the receiving object, means that the configuration information is configured to which voice information of the receiving object needs to be adjusted, for example: to a specific group, or user. The latter method uses configuration information corresponding to the transmission method, for example: corresponding to the transmission modes such as group transmission, broadcast, specific user designation and the like, one or more transmission modes are designated as required to be adjusted by default.
In this embodiment, the receiving manner of the adjustment parameter may be as follows: the processor 680 is configured to control and execute: the acquiring of the adjustment parameter comprises: and displaying an adjustment parameter input interface, and acquiring adjustment parameters from the adjustment parameter input interface.
The parameter input interface in this embodiment may be configured according to an adjustment type supported by audio adjustment, and a preferred implementation scheme is provided in this embodiment of the present invention, which is specifically as follows: the processor 680 is configured to control and execute: displaying an adjustment parameter input interface, wherein acquiring adjustment parameters from the adjustment parameter input interface comprises:
and displaying a selection identifier on a displayed parameter input interface, and reading a preset adjustment parameter corresponding to the selection identifier according to the received selection identifier.
In this embodiment, the user only needs to select the selection identifier, and the selection identifier may be some identifying prompt information, for example: male/female voices, children, etc.; it may also be a background sound: clapping, music selection, etc. The pre-configured adjusting parameters are pre-configured based on the selection identifier, so that the audio attribute can be adjusted to the parameters corresponding to the selection identifier, and the parameters can be combined. For example: loudness, sound ray, frequency, etc.
It should be noted that, in the above device embodiment, each included unit is only divided according to functional logic, but is not limited to the above division as long as the corresponding function can be implemented; in addition, specific names of the functional units are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present invention.
In addition, it is understood by those skilled in the art that all or part of the steps in the above method embodiments may be implemented by related hardware, and the corresponding program may be stored in a computer readable storage medium, where the above mentioned storage medium may be a read-only memory, a magnetic disk or an optical disk.
The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the embodiment of the present invention are included in the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (6)

1. A method of audio data processing, comprising:
collecting original audio data;
determining audio parameters to be adjusted according to the read configuration information, displaying an adjustment parameter input interface and acquiring adjustment parameters from the adjustment parameter input interface;
performing sound changing or sound effect adding on the original audio data according to the adjusting parameters to obtain target audio data;
sending the target audio data to a target device in the form of a short voice message;
the configuration information corresponds to a sending mode, the configuration information corresponds to the sending mode and indicates that the configuration information is configured to be the audio data needing to be adjusted, and the sending mode is group sending or broadcasting.
2. The method of claim 1, wherein displaying the tuning parameter input interface and obtaining tuning parameters from the tuning parameter input interface comprises:
and displaying a selection identifier on a displayed parameter input interface, and reading a preset adjustment parameter corresponding to the selection identifier according to the received selection identifier.
3. The method of claim 2, wherein selecting the identifier comprises:
sound parameters and/or sound effect parameters.
4. An audio data processing apparatus, characterized by comprising:
the audio acquisition unit is used for acquiring original audio data;
the parameter acquisition unit is used for determining audio parameters to be adjusted according to the read configuration information, displaying an adjustment parameter input interface and acquiring adjustment parameters from the adjustment parameter input interface;
the audio adjusting unit is used for executing sound changing or sound effect adding on the original audio data according to the adjusting parameters to obtain target audio data;
the audio sending unit is used for sending the target audio data to the target equipment in the form of short voice messages;
the configuration information corresponds to a sending mode, the configuration information corresponds to the sending mode and indicates that the configuration information is configured to be the audio data needing to be adjusted, and the sending mode is group sending or broadcasting.
5. The apparatus of claim 4,
the parameter acquisition unit is used for controlling the display to display a selection identifier on a displayed parameter input interface, and reading a pre-configured adjustment parameter corresponding to the selection identifier according to the received selection identifier.
6. The apparatus of claim 5, wherein the selection identifier comprises:
sound parameters and/or sound effect parameters.
CN201510566251.0A 2015-09-07 2015-09-07 Audio data processing method and device Active CN106506437B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510566251.0A CN106506437B (en) 2015-09-07 2015-09-07 Audio data processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510566251.0A CN106506437B (en) 2015-09-07 2015-09-07 Audio data processing method and device

Publications (2)

Publication Number Publication Date
CN106506437A CN106506437A (en) 2017-03-15
CN106506437B true CN106506437B (en) 2021-03-16

Family

ID=58286841

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510566251.0A Active CN106506437B (en) 2015-09-07 2015-09-07 Audio data processing method and device

Country Status (1)

Country Link
CN (1) CN106506437B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2019041186A1 (en) * 2017-08-30 2019-03-07 深圳传音通讯有限公司 Audio voice changing method, intelligent device, and storage medium
CN107818787B (en) * 2017-10-31 2021-02-05 努比亚技术有限公司 Voice information processing method, terminal and computer readable storage medium
CN109246308A (en) * 2018-10-24 2019-01-18 维沃移动通信有限公司 A kind of method of speech processing and terminal device
CN111128204A (en) * 2018-11-01 2020-05-08 阿里巴巴集团控股有限公司 Comment method and device, terminal device and computer storage medium
CN110075518A (en) * 2019-04-22 2019-08-02 网易(杭州)网络有限公司 Voice information processing method and device, storage medium, electronic equipment
CN110830368B (en) * 2019-11-22 2022-05-06 维沃移动通信有限公司 Instant messaging message sending method and electronic device

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101123830A (en) * 2006-08-09 2008-02-13 索尼株式会社 Device, method and program for processing audio frequency signal
CN103617797A (en) * 2013-12-09 2014-03-05 腾讯科技(深圳)有限公司 Voice processing method and device

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7123695B2 (en) * 2002-05-21 2006-10-17 Bellsouth Intellectual Property Corporation Voice message delivery over instant messaging
CN1941747A (en) * 2005-09-27 2007-04-04 腾讯科技(深圳)有限公司 Demand telecommunicating method and system
CN101304391A (en) * 2008-06-30 2008-11-12 腾讯科技(深圳)有限公司 Voice call method and system based on instant communication system
CN103839550A (en) * 2012-11-26 2014-06-04 阿尔卡特朗讯 Method, apparatus and system for processing voice in conversation
CN104780091B (en) * 2014-01-13 2019-06-25 北京发现角科技有限公司 A kind of instant communicating method and system with speech audio processing function
CN104200824B (en) * 2014-08-25 2019-05-03 努比亚技术有限公司 Audio recording method and apparatus
CN104167213B (en) * 2014-08-26 2019-07-30 努比亚技术有限公司 Audio-frequency processing method and device
CN104811540A (en) * 2015-03-26 2015-07-29 努比亚技术有限公司 Mobile terminal and sound setting method and device for voice communication of mobile terminal

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101123830A (en) * 2006-08-09 2008-02-13 索尼株式会社 Device, method and program for processing audio frequency signal
CN103617797A (en) * 2013-12-09 2014-03-05 腾讯科技(深圳)有限公司 Voice processing method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
即时通信系统中的音视频处理技术;苏育挺;《电视技术》;20120102;第36卷(第01期);全文 *

Also Published As

Publication number Publication date
CN106506437A (en) 2017-03-15

Similar Documents

Publication Publication Date Title
US10649720B2 (en) Sound effect configuration method and system and related device
CN105872253B (en) A kind of live sound processing method and mobile terminal
CN106506437B (en) Audio data processing method and device
CN104393848B (en) Volume adjusting method and device
CN103488939B (en) Method, device and terminal for prompting user
CN108886653B (en) An earphone channel control method, related equipment and system
CN105955700A (en) Sound effect adjusting method and user terminal
CN106210755B (en) A kind of methods, devices and systems playing live video
CN106973330B (en) Screen live broadcasting method, device and system
EP3416040A1 (en) Sound effect processing method and mobile terminal
WO2017215652A1 (en) Sound effect parameter adjustment method, and mobile terminal
CN103618514A (en) Volume adjustment method, device and electronic equipment
CN106126160B (en) A sound effect adjustment method and user terminal
CN112019929A (en) Volume adjusting method and device
WO2017215661A1 (en) Scenario-based sound effect control method and electronic device
CN107562406A (en) A kind of volume adjusting method, mobile terminal and computer-readable recording medium
CN106445457A (en) Headphone sound channel switching method and device
CN106126170A (en) A terminal sound effect setting method and terminal
CN108600906B (en) Playing control method and device for multiple sound boxes
CN107391732B (en) Function guiding method and device of application program and terminal equipment
WO2015117550A1 (en) Method and apparatus for acquiring reverberated wet sound
CN106713668B (en) Method and device for controlling vibration of terminal
CN109119101B (en) A kind of audio data processing method, device and mobile terminal
CN105159655B (en) Behavior event playing method and device
CN106231499A (en) A kind of audio Enhancement Method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant