EP2285025A1 - Method and apparatus for coding/decoding a stereo audio signal into a mono audio signal - Google Patents
Method and apparatus for coding/decoding a stereo audio signal into a mono audio signal Download PDFInfo
- Publication number
- EP2285025A1 EP2285025A1 EP09305680A EP09305680A EP2285025A1 EP 2285025 A1 EP2285025 A1 EP 2285025A1 EP 09305680 A EP09305680 A EP 09305680A EP 09305680 A EP09305680 A EP 09305680A EP 2285025 A1 EP2285025 A1 EP 2285025A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- audio signal
- signal
- coding information
- averaged
- input
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 215
- 238000000034 method Methods 0.000 title claims abstract description 27
- 239000002131 composite material Substances 0.000 claims abstract description 37
- 238000012935 Averaging Methods 0.000 claims abstract description 5
- 238000004891 communication Methods 0.000 claims description 4
- 230000002194 synthesizing effect Effects 0.000 claims 2
- 238000001914 filtration Methods 0.000 description 7
- 230000008901 benefit Effects 0.000 description 4
- 230000006978 adaptation Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000000354 decomposition reaction Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000010183 spectrum analysis Methods 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
- 208000016254 weariness Diseases 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H20/00—Arrangements for broadcast or for distribution combined with broadcast
- H04H20/86—Arrangements characterised by the broadcast information itself
- H04H20/88—Stereophonic broadcast systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
Definitions
- the technical domain of the invention is the domain of transmission of audio signal, and particularly stereo audio signal.
- the invention concerns a method to code and decode a stereo audio signal, comprising two separate audio signals into a single composite audio signal, in order to transmit it over a single link designed to transmit a mono audio signal.
- an audio signal such as voice, music or any other sound
- a communication through a communication system e.g. phone communication, be it RTC, TDM or IP
- an audio signal such as voice, music or any other sound
- a stereo audio signal can be a really interesting feature to enhance user hearing and for services such as e.g. vocal message, voice guide, music on hold, conferencing system and so on.
- a stereo audio signal comprises two audio signals, and would then need twice the amount of capacities to be transmitted.
- most of the existing systems are designed to manage and to transmit only single mono signals.
- the idea of the invention is then to use an existing mono audio link to transmit a stereo audio signal. This needs a special coding of the stereo audio signal, composed of a first input audio signal and a second input audio signal, into a single composite audio signal.
- the invention thus concerns a method for coding a stereo audio signal, composed of its two components comprising a first input audio signal and a second input audio signal, into a single composite audio signal, comprising the steps of:
- an adaptative filter may be used to determine said first and second coding information.
- the signal may be digitalised applying G.711 codec and arranged into bytes using HDLC protocol, and the first and second coding information may be embedded by replacing the less significant bit of some of said bytes.
- the invention also concerns a corresponding method for decoding a stereo audio signal, from a single composite audio signal, into its two components comprising a first output audio signal and a second output audio signal, comprising the steps of:
- the invention also concerns a transmitter able to code and transmit such a stereo audio signal.
- the invention also concerns a receiver able to receive and decode such a stereo audio signal.
- the invention also concerns a transmitting system comprising said transmitter and said receiver.
- the proposed solution advantageously offers the capability to transmit a stereo signal over a mono audio transmitting link.
- Another advantage of the proposed solution is to offer transparency in that a classical mono receiver can receive said stereo signal coded as a composite audio signal and still proceed to retrieve an audio signal.
- a stereo audio signal initially composed of its two components comprising a first input audio signal 1 and a second input audio signal 2, is coded into a single composite audio signal 3.
- Said composite audio signal 3 is tailored to be transmitted over an audio link or channel able to transmit a mono audio signal.
- Said coding can be made by following several steps.
- the two input audio signals 1, 2 are averaged into an averaged audio signal 4.
- the averaged audio signal 4 is preferably an audio signal of the same kind as the two input audio signals 1, 2. For instance it is preferably an analogical signal if the input signals 1, 2 are analogical signals, and it is preferably a digital signal if the input signals 1, 2 are digital signals. It also preferably shares with the input signals 1, 2, the same bandwidth or coding scheme.
- Said averaged audio signal 4 may be obtained by adding the two signals 1, 2, and dividing the result by two. Said averaging operation realised by averaging module 20 can be done analogically or digitally.
- each initial input audio signal 1, 2, is compared to said averaged audio signal 4.
- a first comparator/analyser 21 compares said averaged audio signal 4 to said first input audio signal 1. From that comparison it determines a first coding information 7, representative of the first input audio signal 1 or of the difference signal 5 obtained by subtracting the averaged signal 4 out of the first input audio signal 1.
- a second comparator/analyser 22 similarly, compares said averaged audio signal 4 to said second input audio signal 2. From that comparison it determines a second coding information 8, representative of the second input audio signal 2 or of the difference signal 6 obtained by subtracting the averaged signal 4 out of the second input audio signal 2.
- Each of said first and second coding information 7, 8, may be either analogical or digital. However it is advantageously in a form that takes less occupancy/bandwidth/size, than the initial input audio signal 1, 2, in order to ease the following embedding step, but nonetheless containing the essential features of each respective original input audio signal 1, 2, so as to be able to synthesise said original signals out of said coding information 7, 8, during a further decoding stage.
- the comparing/analysing operations can be done either analogically or digitally.
- the now reduced coding information 7, 8, respectively representative of the input audio signals 1, 2, or of the two difference signals 5, 6, are embedded into the averaged audio signal 4.
- Said operation is done by an embedding module 25.
- the resulting signal 3 is the composite audio signal 3.
- Said composite audio signal 3 is preferably an audio signal of the same kind as the difference signals 5, 6 or the averaged audio signal 4. It can be either an analogical signal or a digital signal.
- the embedding operation can be done either analogically or digitally.
- One preferred embodiment advantageously uses adaptative filtering and will be described with reference to figure 3 , showing a zoomed view of the comparator/analyser modules 21 and 22 of figure 1 .
- An upper first comparator/analyser 21 determines a first coding information 7 as output. It receives as input the first input audio signal 1 and the averaged audio signal 4.
- a first comparator 40 determines a first difference signal 5 by subtracting said first input audio signal 1 out of said averaged audio signal 4. Said first difference signal 5 is feedback into a first adaptative filter 42 to adapt its filtering. Said first adaptative filter 42 is used to filter the averaged audio signal 4.
- Said first filter 42 is self adapted to reach its optimum value to get its output signal very close to the first input audio signal 1, in order to get the first difference signal 5 very close to zero.
- the signal 7 obtained at output of the first comparator 40 is used as the first coding information 7.
- a lower second comparator/analyser 22 determines a second coding information 8 as output. It receives as input the second input audio signal 2 and the averaged audio signal 4.
- a second comparator determines a second difference signal 6 by subtracting said second input audio signal 2 out of said averaged audio signal 4. Said second difference signal 6 is feedback into a second similar adaptative filter 43 to adapt its filtering. Said second adaptative filter 43 is used to filter the averaged audio signal 4. Said second filter 43 is self adapted to reach its optimum value to get its output signal very close to the second input audio signal 2, in order to get the second difference signal 6 very close to zero.
- the signal 8 obtained at output of the second comparator is used as the second coding information 8.
- said adaptative filters 42, 43 are advantageously initialised to zero at start-up.
- Said adaptative filters 42, 43 may be based either on least mean square, LMS, family or on recursive least square, RLS, family kind of adaptative filters.
- Said first comparator/analyser module 21, respectively second comparator/analyser module 22, may also comprises, between the output of said comparator 40 and the branching of a feedback line 44, an additional quantifier 41.
- the aim of said quantifier 41 is to drastically reduce the occupancy of its output signal with respect to its input signal, by compressing the first difference signal 5, respectively the second difference signal 6, into the first coding information 7, respectively the second coding information 8. This may be done by analogically or digitally compressing and/or digitalising said signal.
- said quantifier 41 is used to reduce the number of bits used to code the coding signal 7, 8. This is possible because the signal should be close to zero.
- the coding signal 7, 8, may be coded using very few bits.
- the convergence time of the adaptative filters 42, 43 depends on the number of bits used to code the coding signal 7, 8. 2x40 bits can then be enough to code the coding signals 7, 8, of two channels of a stereo audio signal, as shown in a later described digital embodiment.
- Another possible embodiment of the determining/analysing step comprises spectral analysis of the difference signal 5, 6, thus producing a decomposition over a given set of frequencies. Since said frequencies are given or predefined in accordance with the decoding stage, the coefficients of said decomposition may be sufficient to wholly define the difference signal 5, 6. Said coefficients may be retained as the coding information 7, 8.
- the first and second coding information 7, 8, whatever their shape or kind, are then embedded into the averaged audio signal 4, by replacing a removed part of said averaged audio signal 4.
- Said removed or replaced part of the averaged audio signal 4 is advantageously chosen among a least significant part of said averaged audio signal 4.
- Another important advantage of said feature is that the eventual decoding of a so coded signal by a classical, e.g. existing prior art mono, receiver remains possible without too much distorting the audio signal. This provides a transparency of the method which allows a progressive deployment of the invention, where the old existing mono receivers can remain in service.
- the embedding step can be done analogically by including a signal coding said first and second coding information 7, 8, into the averaged signal 4.
- the least significant part of the signal may be e.g. a frequency not used by the signal, such as a non audible frequency, however contained in the bandwidth of the transmitting/receiving link.
- digitalisation of signal can be done as early as at input stage, applying to input audio signals 1, 2, or as late as at output stage, applying to composite audio signal 3, or at any stage in between.
- digitalisation is applied by a digitalisation module 26 on an averaged audio signal 4 to produce a digitalised averaged audio signal 9.
- the embedding step can also be done digitally by including some bits coding the first and second coding information 7, 8, into the bits coding the averaged signal 9.
- the least significant part of the signal 9 is some of the least significant bits.
- the digitalisation is made using G.711 codec, and before the transmission the digitalised signal 9 is arranged according to HDLC protocol into bytes. Since the G.711 codec uses a rather large bandwidth, the least significant bit of each of said bytes is quite not relevant to the so coded audio signal. In a HDLC frame, said least significant bits, may then be considered a least significant part of the signal and may be used to code the first and second coding information 7, 8.
- the resulting composite audio signal 3 comprising the averaged audio signal 9 and so embedded first and second coding information 7, 8, is thus not modified in its size/type and can be transmitted as it previously used to be over one single mono transmitting link. Moreover, it is quite unaffected audiophonically when decoded and rendered.
- G.711 codec is used for public switched telephone network, PSTN, for integrated services data network, ISDN and also for voice transport over IP networks with little compression.
- the invention may then directly be applied to these applications.
- Said coding scheme may obviously be implemented in a transmitter for transmitting said stereo audio signal, before a transmitting module that only needs to be able to transmit a single mono audio signal, since the composite audio signal 3 presents all the characteristics of a mono audio signal.
- Said coding stage may be advantageously used in conjunction with a corresponding decoding stage described now with reference to figure 2 .
- the aim is to retrieve the two components of the initial stereo audio signal. This can be done applying the following steps.
- a first step the first and second coding information 7, 8, embedded into the composite audio signal 3 are separated out of said composite audio signal 3.
- Said separation step which may be realised by a separating module 30, extracts first and second coding information 7, 8, on one side, and a central audio signal 10, 11, on the other side.
- the method used here to separate depends on the corresponding method used to embed, and is its corresponding reverse method.
- Said central audio signal 10, 11, generally compares to the averaged audio signal 4, 9, of the coding stage.
- a first output audio signal 14 and a second output audio signal 15 are synthesised respectively out of first and second coding information 7, 8.
- Said synthesising step is made applying a process reverse from the one uses in the comparing/analysing step of the coding stage.
- the thus obtained first output audio signal 14 and second output audio signal 15, normally compare respectively to the first input audio signal 1 and to the second input audio signal 2 inputted in the coding stage.
- all steps of the decoding stage may indifferently apply to either analogical signals of digital signals, and may be done either analogically or digitally.
- figure 2 illustrates a digital to analogical converter 31 for converting a digital central audio signal 10 into an analogical central audio signal 11.
- first and second coding information 7, 8, embedded into the composite audio signal 3 were coefficients coded from adaptative filtering, said coefficients are retrieved during the separating step of the decoding stage, and are then used accordingly, in synthesise step, to synthesise the two output audio signals 14, 15, out of said coefficients 7, 8.
- the synthesising step corresponding to the analysing step of the coding process described with reference to figure 3 will now be described with reference to figure 4 .
- Figure 4 shows in detail the synthesising modules 34 and 35 of figure 2 , in an embodiment adapted to decode the first and second coding information 7, 8, when said first and second coding information 7, 8, were obtained, at coding stage, through adaptative filtering, as previously described.
- An upper first synthesiser 34 determines a first output audio signal 14 as output. It receives as input the first coding information 7 and the central audio signal 10, 11.
- a first adaptative filter 45 filters said central audio signal 10, 11, the first adaptative filter 45 being adapted by said first coding information 7. Since the coding process produced said first coding information 7 as previously described, said filtering adapted by said first coding information 7, synthesises at its output a first output audio signal 14.
- the adaptative filter 45 is initialised to zero. Since it uses the same central audio signal 10, 11 as input and the same signal 7 for adaptation, the resulting output signal 14 compares to the first input audio signal 1, when convergence of filters are good.
- a lower second synthesiser 35 determines a second output audio signal 15 as output. It receives as input the second coding information 8 and the central audio signal 10, 11.
- a second adaptative filter 46 filters said central audio signal 10, 11, the second adaptative filter 46 being adapted by said second coding information 8. Since the coding process produced said second coding information 8 as previously described, said filtering adapted by said second coding information 8, synthesises at its output a second output audio signal 15.
- the adaptative filter 46 is initialised to zero. Since it uses the same signal 10, 11 as input and the same signal 8 for adaptation, the resulting output signal 15 compares to second input audio signal 2, when convergence of filters are good.
- Said adaptative filters 45, 46 may be based either on LMS family or on RLS family kind of adaptative filters.
- All of said previously described decoding schemes may obviously be implemented in a receiver of said stereo audio signal, after a receiving module, able to receive the composite audio signal 3, as a single mono audio signal, decoding it so as to provide the first output audio signal 14 and the second output audio signal 15, components of said stereo audio signal.
- Said two output audio signals 14, 15, may then be audiophonically rendered by any known stereo player such as a pair of loudspeakers or a stereo headset generally reproducing the initial stereo signal composed of the two input audio signals 1, 2.
- any known stereo player such as a pair of loudspeakers or a stereo headset generally reproducing the initial stereo signal composed of the two input audio signals 1, 2.
- the only differences are only caused by the process losses.
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
- averaging said first input audio signal (1) and said second input audio signal (2) into an averaged audio signal (4, 9),
- determining a first coding information (7) by comparing said averaged audio signal (4, 9) to said first input audio signal (1),
- determining a second coding information (8) by comparing said averaged audio signal (4, 9) to said second input audio signal (2),
- embedding said first and second coding information (7, 8) into said averaged audio signal (4, 9) to obtain the composite audio signal (3).
Description
- The technical domain of the invention is the domain of transmission of audio signal, and particularly stereo audio signal.
- More particularly, the invention concerns a method to code and decode a stereo audio signal, comprising two separate audio signals into a single composite audio signal, in order to transmit it over a single link designed to transmit a mono audio signal.
- During a communication through a communication system e.g. phone communication, be it RTC, TDM or IP, an audio signal, such as voice, music or any other sound, is generally transmitted in mono, over a single audio link between users. This leads to a loss of a real spatial dimension in which each user is moving and can be a source of involuntary weariness. Furthermore, a stereo audio signal can be a really interesting feature to enhance user hearing and for services such as e.g. vocal message, voice guide, music on hold, conferencing system and so on.
- However, a stereo audio signal comprises two audio signals, and would then need twice the amount of capacities to be transmitted. Moreover, most of the existing systems are designed to manage and to transmit only single mono signals.
- The idea of the invention is then to use an existing mono audio link to transmit a stereo audio signal. This needs a special coding of the stereo audio signal, composed of a first input audio signal and a second input audio signal, into a single composite audio signal.
- The invention thus concerns a method for coding a stereo audio signal, composed of its two components comprising a first input audio signal and a second input audio signal, into a single composite audio signal, comprising the steps of:
- averaging said first input audio signal and said second input audio signal into an averaged audio signal,
- determining a first coding information by comparing said averaged audio signal to said first input audio signal,
- determining a second coding information by comparing said averaged audio signal to said second input audio signal,
- embedding said first and second coding information into said averaged audio signal to obtain a composite audio signal.
- According to another feature of the invention, an adaptative filter may be used to determine said first and second coding information.
- According to another feature of the invention, the signal may be digitalised applying G.711 codec and arranged into bytes using HDLC protocol, and the first and second coding information may be embedded by replacing the less significant bit of some of said bytes.
- The invention also concerns a corresponding method for decoding a stereo audio signal, from a single composite audio signal, into its two components comprising a first output audio signal and a second output audio signal, comprising the steps of:
- separating first and second coding information embedded into said composite audio signal out of said composite audio signal, to extract said first and second coding information and a central audio signal,
- synthesising said first output audio signal out of said first coding information and said central audio signal,
- synthesising said second output audio signal out of said second coding information and said central audio signal.
- The invention also concerns a transmitter able to code and transmit such a stereo audio signal.
- The invention also concerns a receiver able to receive and decode such a stereo audio signal.
- The invention also concerns a transmitting system comprising said transmitter and said receiver.
- The proposed solution advantageously offers the capability to transmit a stereo signal over a mono audio transmitting link.
- Another advantage of the proposed solution is to offer transparency in that a classical mono receiver can receive said stereo signal coded as a composite audio signal and still proceed to retrieve an audio signal.
- Others features, details and advantages of the invention will become more apparent from the detailed illustrating description given hereafter with respect to the drawings on which:
-
figure 1 is a synoptic of the coding/emitting stage, -
figure 2 is a synoptic of the receiving/decoding stage, -
figure 3 is a synoptic detailing a particular embodiment of a part of the coding/emitting stage, -
figure 4 is a synoptic detailing a particular embodiment of a part of the receiving/decoding stage. - According to
figure 1 , presenting a synoptic of the coding stage, a stereo audio signal, initially composed of its two components comprising a firstinput audio signal 1 and a secondinput audio signal 2, is coded into a singlecomposite audio signal 3. Saidcomposite audio signal 3 is tailored to be transmitted over an audio link or channel able to transmit a mono audio signal. Said coding can be made by following several steps. - In a first step, the two
1, 2 are averaged into an averagedinput audio signals audio signal 4. The averagedaudio signal 4 is preferably an audio signal of the same kind as the two 1, 2. For instance it is preferably an analogical signal if theinput audio signals 1, 2 are analogical signals, and it is preferably a digital signal if theinput signals 1, 2 are digital signals. It also preferably shares with theinput signals 1, 2, the same bandwidth or coding scheme. Said averagedinput signals audio signal 4 may be obtained by adding the two 1, 2, and dividing the result by two. Said averaging operation realised by averagingsignals module 20 can be done analogically or digitally. - In a second step, each initial
1, 2, is compared to said averagedinput audio signal audio signal 4. A first comparator/analyser 21 compares said averagedaudio signal 4 to said firstinput audio signal 1. From that comparison it determines afirst coding information 7, representative of the firstinput audio signal 1 or of the difference signal 5 obtained by subtracting the averagedsignal 4 out of the firstinput audio signal 1. A second comparator/analyser 22 similarly, compares said averagedaudio signal 4 to said secondinput audio signal 2. From that comparison it determines asecond coding information 8, representative of the secondinput audio signal 2 or of thedifference signal 6 obtained by subtracting the averagedsignal 4 out of the secondinput audio signal 2. - Each of said first and
7, 8, may be either analogical or digital. However it is advantageously in a form that takes less occupancy/bandwidth/size, than the initialsecond coding information 1, 2, in order to ease the following embedding step, but nonetheless containing the essential features of each respective originalinput audio signal 1, 2, so as to be able to synthesise said original signals out of saidinput audio signal 7, 8, during a further decoding stage. The comparing/analysing operations can be done either analogically or digitally.coding information - In a third step, the now reduced
7, 8, respectively representative of thecoding information 1, 2, or of the twoinput audio signals difference signals 5, 6, are embedded into the averagedaudio signal 4. Said operation is done by anembedding module 25. The resultingsignal 3 is thecomposite audio signal 3. Saidcomposite audio signal 3 is preferably an audio signal of the same kind as the difference signals 5, 6 or the averagedaudio signal 4. It can be either an analogical signal or a digital signal. Depending on the 4, 7, 8, the embedding operation can be done either analogically or digitally.ingress signals - Many ways can be used to analyse said
1, 2, or saidinput audio signals difference signals 5, 6, into first and 7, 8.second coding information - One preferred embodiment advantageously uses adaptative filtering and will be described with reference to
figure 3 , showing a zoomed view of the comparator/ 21 and 22 ofanalyser modules figure 1 . An upper first comparator/analyser 21 determines afirst coding information 7 as output. It receives as input the firstinput audio signal 1 and the averagedaudio signal 4. Afirst comparator 40 determines a first difference signal 5 by subtracting said firstinput audio signal 1 out of said averagedaudio signal 4. Said first difference signal 5 is feedback into a firstadaptative filter 42 to adapt its filtering. Said firstadaptative filter 42 is used to filter the averagedaudio signal 4. Saidfirst filter 42 is self adapted to reach its optimum value to get its output signal very close to the firstinput audio signal 1, in order to get the first difference signal 5 very close to zero. Thesignal 7 obtained at output of thefirst comparator 40 is used as thefirst coding information 7. - Similarly, a lower second comparator/
analyser 22 determines asecond coding information 8 as output. It receives as input the secondinput audio signal 2 and the averagedaudio signal 4. A second comparator determines asecond difference signal 6 by subtracting said secondinput audio signal 2 out of said averagedaudio signal 4. Saidsecond difference signal 6 is feedback into a second similaradaptative filter 43 to adapt its filtering. Said secondadaptative filter 43 is used to filter the averagedaudio signal 4. Saidsecond filter 43 is self adapted to reach its optimum value to get its output signal very close to the secondinput audio signal 2, in order to get thesecond difference signal 6 very close to zero. Thesignal 8 obtained at output of the second comparator is used as thesecond coding information 8. In a digital embodiment, said adaptative filters 42, 43, are advantageously initialised to zero at start-up. Said 42, 43, may be based either on least mean square, LMS, family or on recursive least square, RLS, family kind of adaptative filters.adaptative filters - Said first comparator/
analyser module 21, respectively second comparator/analyser module 22, may also comprises, between the output of saidcomparator 40 and the branching of afeedback line 44, anadditional quantifier 41. The aim of saidquantifier 41 is to drastically reduce the occupancy of its output signal with respect to its input signal, by compressing the first difference signal 5, respectively thesecond difference signal 6, into thefirst coding information 7, respectively thesecond coding information 8. This may be done by analogically or digitally compressing and/or digitalising said signal. - In a digital embodiment, said
quantifier 41 is used to reduce the number of bits used to code the 7, 8. This is possible because the signal should be close to zero. Thecoding signal 7, 8, may be coded using very few bits. The convergence time of the adaptative filters 42, 43, depends on the number of bits used to code thecoding signal 7, 8. 2x40 bits can then be enough to code the coding signals 7, 8, of two channels of a stereo audio signal, as shown in a later described digital embodiment.coding signal - Another possible embodiment of the determining/analysing step comprises spectral analysis of the
difference signal 5, 6, thus producing a decomposition over a given set of frequencies. Since said frequencies are given or predefined in accordance with the decoding stage, the coefficients of said decomposition may be sufficient to wholly define thedifference signal 5, 6. Said coefficients may be retained as the 7, 8.coding information - The first and
7, 8, whatever their shape or kind, are then embedded into the averagedsecond coding information audio signal 4, by replacing a removed part of said averagedaudio signal 4. Said removed or replaced part of the averagedaudio signal 4 is advantageously chosen among a least significant part of said averagedaudio signal 4. - Due to that feature, the removed, and modified by the insertion of
7, 8, part of the averagedcoding information signal 4, thus lacking at the decoding stage, would not impact to much the signal when decoded and rendered. - Another important advantage of said feature is that the eventual decoding of a so coded signal by a classical, e.g. existing prior art mono, receiver remains possible without too much distorting the audio signal. This provides a transparency of the method which allows a progressive deployment of the invention, where the old existing mono receivers can remain in service.
- The embedding step can be done analogically by including a signal coding said first and
7, 8, into the averagedsecond coding information signal 4. In the analogical case, the least significant part of the signal may be e.g. a frequency not used by the signal, such as a non audible frequency, however contained in the bandwidth of the transmitting/receiving link. - It is important to notice that the invention may be carried over either analogically or digitally. When referring to
figure 1 , digitalisation of signal can be done as early as at input stage, applying to input 1, 2, or as late as at output stage, applying toaudio signals composite audio signal 3, or at any stage in between. For instance, onfigure 1 , the digitalisation is applied by adigitalisation module 26 on an averagedaudio signal 4 to produce a digitalised averagedaudio signal 9. - The embedding step can also be done digitally by including some bits coding the first and
7, 8, into the bits coding the averagedsecond coding information signal 9. In the digital case, the least significant part of thesignal 9 is some of the least significant bits. - According to a preferred embodiment of the invention, the digitalisation is made using G.711 codec, and before the transmission the digitalised
signal 9 is arranged according to HDLC protocol into bytes. Since the G.711 codec uses a rather large bandwidth, the least significant bit of each of said bytes is quite not relevant to the so coded audio signal. In a HDLC frame, said least significant bits, may then be considered a least significant part of the signal and may be used to code the first and 7, 8. The resultingsecond coding information composite audio signal 3 comprising the averagedaudio signal 9 and so embedded first and 7, 8, is thus not modified in its size/type and can be transmitted as it previously used to be over one single mono transmitting link. Moreover, it is quite unaffected audiophonically when decoded and rendered.second coding information - For instance, a 20 ms G.711 frame comprises a payload of 160 bytes. This could provide at most 160 least significant bits that could be used to embed first and
7, 8. When using HDLC protocol, this number decreases to 126 due to protocol bytes. This is rather enough to embed first andsecond coding information 7, 8, comprising 8 coefficients with 5 bits of precision each for twosecond coding information difference signals 5, 6, that is: 8*5*2 = 80 bits. - Said preferred embodiment is particularly advantageous since the G.711 codec is used for public switched telephone network, PSTN, for integrated services data network, ISDN and also for voice transport over IP networks with little compression. The invention may then directly be applied to these applications.
- Said coding scheme may obviously be implemented in a transmitter for transmitting said stereo audio signal, before a transmitting module that only needs to be able to transmit a single mono audio signal, since the
composite audio signal 3 presents all the characteristics of a mono audio signal. - Said coding stage may be advantageously used in conjunction with a corresponding decoding stage described now with reference to
figure 2 . - When receiving a so coded
composite audio signal 3, the aim is to retrieve the two components of the initial stereo audio signal. This can be done applying the following steps. - In a first step, the first and
7, 8, embedded into thesecond coding information composite audio signal 3 are separated out of saidcomposite audio signal 3. Said separation step, which may be realised by a separatingmodule 30, extracts first and 7, 8, on one side, and asecond coding information 10, 11, on the other side. The method used here to separate depends on the corresponding method used to embed, and is its corresponding reverse method. Saidcentral audio signal 10, 11, generally compares to the averagedcentral audio signal 4, 9, of the coding stage.audio signal - In a second step, e.g. respectively realized by a
first synthesising module 34 and asecond synthesising module 35, a firstoutput audio signal 14 and a secondoutput audio signal 15 are synthesised respectively out of first and 7, 8. Said synthesising step is made applying a process reverse from the one uses in the comparing/analysing step of the coding stage.second coding information - The thus obtained first
output audio signal 14 and secondoutput audio signal 15, normally compare respectively to the firstinput audio signal 1 and to the secondinput audio signal 2 inputted in the coding stage. - As noted for the coding stage, all steps of the decoding stage may indifferently apply to either analogical signals of digital signals, and may be done either analogically or digitally.
- For instance,
figure 2 illustrates a digital toanalogical converter 31 for converting a digitalcentral audio signal 10 into an analogicalcentral audio signal 11. - Obviously, the steps of the decoding stage must correspond to those of the coding stage. If first and
7, 8, embedded into thesecond coding information composite audio signal 3, were coefficients coded from adaptative filtering, said coefficients are retrieved during the separating step of the decoding stage, and are then used accordingly, in synthesise step, to synthesise the two output audio signals 14, 15, out of said 7, 8. The synthesising step corresponding to the analysing step of the coding process described with reference tocoefficients figure 3 , will now be described with reference tofigure 4 . -
Figure 4 shows in detail the synthesising 34 and 35 ofmodules figure 2 , in an embodiment adapted to decode the first and 7, 8, when said first andsecond coding information 7, 8, were obtained, at coding stage, through adaptative filtering, as previously described.second coding information - An upper
first synthesiser 34 determines a firstoutput audio signal 14 as output. It receives as input thefirst coding information 7 and the 10, 11. A firstcentral audio signal adaptative filter 45 filters said 10, 11, the firstcentral audio signal adaptative filter 45 being adapted by saidfirst coding information 7. Since the coding process produced saidfirst coding information 7 as previously described, said filtering adapted by saidfirst coding information 7, synthesises at its output a firstoutput audio signal 14. At start-up theadaptative filter 45 is initialised to zero. Since it uses the same 10, 11 as input and thecentral audio signal same signal 7 for adaptation, the resultingoutput signal 14 compares to the firstinput audio signal 1, when convergence of filters are good. - Similarly, a lower
second synthesiser 35 determines a secondoutput audio signal 15 as output. It receives as input thesecond coding information 8 and the 10, 11. A secondcentral audio signal adaptative filter 46 filters said 10, 11, the secondcentral audio signal adaptative filter 46 being adapted by saidsecond coding information 8. Since the coding process produced saidsecond coding information 8 as previously described, said filtering adapted by saidsecond coding information 8, synthesises at its output a secondoutput audio signal 15. At start-up theadaptative filter 46 is initialised to zero. Since it uses the 10, 11 as input and thesame signal same signal 8 for adaptation, the resultingoutput signal 15 compares to secondinput audio signal 2, when convergence of filters are good. - Said
45, 46, may be based either on LMS family or on RLS family kind of adaptative filters.adaptative filters - It will be apparent to those skilled in the art, that the process would benefit of said first
adaptative filter 45 used for decoding being identical to the firstadaptative filter 42 used to code the firstinput audio signal 1 into saidcomposite audio signal 3, and of said secondadaptative filter 46 used for decoding being identical to the secondadaptative filter 43 used to code the secondinput audio signal 2 into saidcomposite audio signal 3. - All of said previously described decoding schemes may obviously be implemented in a receiver of said stereo audio signal, after a receiving module, able to receive the
composite audio signal 3, as a single mono audio signal, decoding it so as to provide the firstoutput audio signal 14 and the secondoutput audio signal 15, components of said stereo audio signal. - Said two output audio signals 14, 15, may then be audiophonically rendered by any known stereo player such as a pair of loudspeakers or a stereo headset generally reproducing the initial stereo signal composed of the two
1, 2. In fact, the only differences are only caused by the process losses.input audio signals
Claims (13)
- A method for coding a stereo audio signal, composed of its two components comprising a first input audio signal (1) and a second input audio signal (2), into a single composite audio signal (3), characterised in that it comprises the steps of:- averaging said first input audio signal (1) and said second input audio signal (2) into an averaged audio signal (4, 9),- determining a first coding information (7) by comparing said averaged audio signal (4, 9) to said first input audio signal (1),- determining a second coding information (8) by comparing said averaged audio signal (4, 9) to said second input audio signal (2),- embedding said first and second coding information (7, 8) into said averaged audio signal (4, 9) to obtain the composite audio signal (3).
- The method of claim 1, wherein said determining a first coding information (7) step adaptatively filters said averaged audio signal (4, 9) through a first adaptative filter (42) in order to zeroing a first difference signal (5), obtained by subtracting said first input audio signal (1) out of said averaged audio signal (4, 9), the first coding information (7) being said first difference signal (5), and wherein said determining a second coding information (8) step adaptatively filters said averaged audio signal (4, 9) through a second adaptative filter (43) in order to zeroing a second difference signal (5), obtained by subtracting said second input audio signal (2) out of said averaged audio signal (4, 9), the second coding information (8) being said second difference signal (8).
- The method of claim 2, wherein each determining step respectively further comprises:- quantifying said first (5), respectively second (6), difference signal in order to compress said difference signal (5, 6), the first (7), respectively second, coding information (8), being said first (5), respectively second, difference signal (6) compressed.
- The method of any one of claims 1 to 3, wherein said embedding step comprises replacing into said averaged audio signal (4, 9) a least significant part of the signal by said first and second coding information (7, 8).
- The method of claim 4, wherein the averaged audio signal (4) is digitalised, and wherein said least significant part of the signal comprises some less significant bits of said digitalised signal (9).
- The method of claim 5, wherein said digitalisation applies G.711 codec and arranges the digitalised signal (9) into bytes using HDLC protocol, and wherein said least significant part of the signal comprises the less significant bit of some of said bytes.
- A composite audio signal (3) characterised in that it is produced by the method for coding according to any one of claims 1 to 6.
- A method for decoding a stereo audio signal, from a single composite audio signal (3), into its two components comprising a first output audio signal (14) and a second output audio signal (15), characterised in that it comprises the steps of:- separating first and second coding information (7, 8) embedded into said composite audio signal (3) out of said composite audio signal (3), to extract said first and second coding information (7, 8) and a central audio signal (10, 11),- synthesising said first output audio signal (14) out of said first coding information (7) and said central audio signal (10, 11),- synthesising said second output audio signal (15) out of said second coding information (8) and said central audio signal (10, 11).
- The method of claim 8, wherein synthesizing said first output audio signal (14) step adaptatively filters said central audio signal (10, 11) through a first adaptative filter (45) adapted by said first coding information (7), and wherein synthesizing said second output audio signal (15) step adaptatively filters said central audio signal (10, 11) through a second adaptative filter (46) adapted by said second coding information (8).
- The method of claim 9, wherein said first adaptative filter (45) is identical to a first adaptative filter (42) used to code a first input audio signal (1) into said composite audio signal (3), and wherein said second adaptative filter (46) is identical to a second adaptative filter (43) used to code a second input audio signal (2) into said composite audio signal (3).
- A transmitter able to transmit a stereo audio signal, composed of a first input audio signal (1) and a second input audio signal (2), as a single composite audio signal (3), characterised in that it comprises means for implementing the steps of the coding method of any one of claims 1 to 6 and transmitting means for transmitting said composite audio signal (3).
- A receiver able to receive a stereo audio signal, transmitted as a single composite audio signal (3), and to deliver its components comprising a first output audio signal (14) and a second output audio signal (15), characterised in that it comprises receiving means for receiving said composite audio signal (3) and means for implementing the steps of the decoding method of any one of claims 8 to 10.
- A communication system able to transmit a stereo audio signal, composed of a first input audio signal (1) and a second input audio signal (2), as a single composite audio signal (3), and to deliver its components comprising a first output audio signal (14) and a second output audio signal (15), a transmitter according to claim 11 and a receiver according to claim 12.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP09305680A EP2285025A1 (en) | 2009-07-16 | 2009-07-16 | Method and apparatus for coding/decoding a stereo audio signal into a mono audio signal |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP09305680A EP2285025A1 (en) | 2009-07-16 | 2009-07-16 | Method and apparatus for coding/decoding a stereo audio signal into a mono audio signal |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| EP2285025A1 true EP2285025A1 (en) | 2011-02-16 |
Family
ID=42357918
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EP09305680A Withdrawn EP2285025A1 (en) | 2009-07-16 | 2009-07-16 | Method and apparatus for coding/decoding a stereo audio signal into a mono audio signal |
Country Status (1)
| Country | Link |
|---|---|
| EP (1) | EP2285025A1 (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP3301673A1 (en) * | 2016-09-30 | 2018-04-04 | Nxp B.V. | Audio communication method and apparatus |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20030220800A1 (en) * | 2002-05-21 | 2003-11-27 | Budnikov Dmitry N. | Coding multichannel audio signals |
| US20070053521A1 (en) * | 1998-10-13 | 2007-03-08 | Victor Company Of Japan, Ltd. | Audio signal processing apparatus |
| EP1926082A1 (en) * | 2006-11-25 | 2008-05-28 | Deutsche Telekom AG | Process for scaleable encoding of stereo signals |
-
2009
- 2009-07-16 EP EP09305680A patent/EP2285025A1/en not_active Withdrawn
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20070053521A1 (en) * | 1998-10-13 | 2007-03-08 | Victor Company Of Japan, Ltd. | Audio signal processing apparatus |
| US20030220800A1 (en) * | 2002-05-21 | 2003-11-27 | Budnikov Dmitry N. | Coding multichannel audio signals |
| EP1926082A1 (en) * | 2006-11-25 | 2008-05-28 | Deutsche Telekom AG | Process for scaleable encoding of stereo signals |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP3301673A1 (en) * | 2016-09-30 | 2018-04-04 | Nxp B.V. | Audio communication method and apparatus |
| US10964332B2 (en) | 2016-09-30 | 2021-03-30 | Nxp B.V. | Audio communication method and apparatus for watermarking an audio signal with spatial information |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR100299528B1 (en) | Apparatus and method for encoding / decoding audio signal using intensity-stereo process and prediction process | |
| US7046695B2 (en) | System and method of disharmonic frequency multiplexing | |
| JP3168012B2 (en) | Method and apparatus for coding, manipulating and decoding audio signals | |
| US7272567B2 (en) | Scalable lossless audio codec and authoring tool | |
| US8340959B2 (en) | Method and apparatus for transmitting wideband speech signals | |
| TWI545931B (en) | Systems, methods and apparatus for transmitting data over a voice channel of a wireless telephone network | |
| KR20040048978A (en) | Method of transmission of wideband audio signals on a transmission channel with reduced bandwidth | |
| JP2001500344A (en) | Method and apparatus for improving the sound quality of a tandem vocoder | |
| US20090006104A1 (en) | Method of configuring codec and codec using the same | |
| JP3887651B2 (en) | Data transmission in multirate networks. | |
| EP2228791B1 (en) | Scalable lossless audio codec and authoring tool | |
| KR100465318B1 (en) | Transmiiter and receiver for wideband speech signal and method for transmission and reception | |
| EP2285025A1 (en) | Method and apparatus for coding/decoding a stereo audio signal into a mono audio signal | |
| JP5006975B2 (en) | Background noise information decoding method and background noise information decoding means | |
| JPS63110830A (en) | Frequency band dividing and encoding system | |
| Ding | Wideband audio over narrowband low-resolution media | |
| JP4936688B2 (en) | Relay device, communication terminal device, signal decoding device, signal processing method, and signal processing program | |
| US7181000B2 (en) | Voice transmission device and voice transmission system | |
| US20060143001A1 (en) | Method for the adaptation of comfort noise generation parameters | |
| CN113302686B (en) | Multipoint control method, multipoint telephone connection system and recording medium | |
| CN113366568B (en) | Multi-point control method, system and recording medium | |
| CN113302687B (en) | Multipoint control method, multipoint telephone connection system and recording medium | |
| JP2010034630A (en) | Sound transmission system | |
| JP3765615B2 (en) | Audio transmission device | |
| JP3217237B2 (en) | Loop type band division audio conference circuit |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
| AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK SM TR |
|
| AX | Request for extension of the european patent |
Extension state: AL BA RS |
|
| 17P | Request for examination filed |
Effective date: 20110816 |
|
| RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: ALCATEL LUCENT |
|
| 111Z | Information provided on other rights and legal means of execution |
Free format text: AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK SM TR Effective date: 20130410 |
|
| RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: ALCATEL LUCENT |
|
| D11X | Information provided on other rights and legal means of execution (deleted) | ||
| 17Q | First examination report despatched |
Effective date: 20160610 |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
| 18D | Application deemed to be withdrawn |
Effective date: 20161021 |