Title of Invention

APPARATUSES AND METHODS FOR GENERATING A PARAMETER REPRESENTATION OF A MULTI-CHANNEL INPUT SIGNAL

Abstract The invention relates to an apparatus for generating a parameter representation of a multi-channel input signal having original channels, the original channels including a left channel (B), a right channel (D), a center channel (C), a rear left channel (A), and a rear right channel (E), comprising a parameter generator (203) for generating a first balance parameter (r1), a first coherence parameter or a first time difference parameter between a first channel pair, and for generating a second balance parameter (r2), between a second channel pair, and for generating a third balance parameter (r3) between a third channel pair, the balance parameters, coherence parameters or time parameters forming the parameter representation, wherein each channel of the two channel pair is one of the original channels or a weight or unweighted combination of the original channels, and wherein the first balance parameter (r1) is a left/right balance parameter, and wherein the first channel pair includes, as a first channel, a left-channel or a left down-mix channel and, as a second channel, a right channel, or a right down-mix channel, wherein the second balance parameter (r2) is a center balance parameter and the second channel pair includes, as a first channel, the center channel or a channel combination of original channels including the center channel, and, as a second channel, a channel combination including the left channel land the right channel, and wherein the third balance parameter (r3) is a front/back balance parameter and the third channel pair has, as a first channel, a channel combination including the rear-left channel and the rear-right channel and, as a second channel, a channel combination including a left channel and a right channel.
Full Text

FIELD OF THE INVENTION
The present invention relates to coding of multi-channel representations of audio
signals using spatial parameters. The present invention teaches new methods for
estimating and defining proper parameters for recreating a multi-channel signal
from a number of channels being less than the number of output channels. In
particular it aims at minimizing the bit rate for the multi-channel representation,
and providing a coded representation of the multi-channel signal enabling easy
encoding and decoding of the data for all possible channel configurations.
BACKGROUND OF THE INVENTION
It has been shown in US 7382886B2 "Efficient and scalable Parametric Stereo
Coding for Low Bit rate Audio Coding Applications", that it is possible to re-create
a stereo image that closely resembles to original stereo image, from a mono
signal given a very compact representation of the stereo image. The basic
principle is to divide the input signal into frequency bands and time segments,
and for these frequency bands and time segments, estimate inter-channel
intensity difference (IID), and inter-channel coherence (ICC). The first parameter
is a measurement of the power distribution between the two channels in the
specific frequency band and the second parameter is an estimation of the
correlation between the two channels for the specific frequency band. On the
decoder side the stereo image is recreated from the mono signal by distributing
the mono signal between the two output channels in accordance with the IID-
data, and by adding a decorrelated signal in order to retain the channel
correlation of the original stereo channels.

Document HERRE JET AL: "INTENSITY STEREO CODING" PREPRINTS OF
PAPERS PRESENTED AT THE AES CONVENTION, vol.96, no.3799, 26 February
1994, pages 1-10 XP009025131 discloses an apparatus for coding a multi-
channel audio signal in which left and center channel are combined and rear left
and rear right channel are combined in order to form a parameter representation
of the original channels.
For a multi-channel case (multi-channel in this context meaning more than two
output channels), several additional issues have to be accounted for. Several
multi-channel configurations exist. The most commonly known is the 5.1
configuration (center channel, front left/right, surround left/right, and the LFE
channel). However, many other configurations exist. From the complete
encoder/decoder systems point-of-view, it is desirable to have a system that can
use the same parameter set (e.g. IID and ICC) or sub-sets thereof for all channel
configurations. ITU-R BS.775 defines several down-mix schemes to be able to
obtain a channel configuration comprising fewer channels from a given channel
configuration. Instead of always having to decode all channels and rely on a
down-mix, it can be desirable to have a multi-channel representation that
enables a receiver to extract the parameters relevant for the channel
configuration at hand, prior to decoding the channels. Further, a parameter set
that is inherently scaleable is desirable from a scalable or embedded coding point

of view, where it is e.g. possible to store the data corresponding to the surround
channels in an enhancement layer in the bitstream.
Contrary to the above it can also be desirable to be able to use different
parameter definitions based on the characteristics of the signal being processed,
in order to switch between the parameterization that results in the lowest bit rate
overhead for the current signal segment being processed.
Another representation of multi-channel signals using a sum signal or down mix signal and additional parametric side information is known in the art as binaural
cue coding (BCC). This technique is described in "Binaural Cue Coding - Part 1:
Psycho-Acoustic Fundamentals and Design Principles", IEEE Transactions on
Speech and Audio Processing, vol.11, No.6, November 2003, F. Baumgarte

"Binaural Cue Coding. Part II: Schemes and Applications",
IEEE Transactions on Speech and Audio Processing vol. 11,
No. 6, November 2003, C. Faller and F. Baumgarte.
Generally, binaural cue coding is a method for multi-
channel spatial rendering based on one down-mixed audio
channel and side information. Several parameters to be cal-
culated by a BCC encoder and to be used by a BCC decoder
for audio reconstruction or audio rendering include inter-
channel level differences, inter-channel time differences,
and inter-channel coherence parameters. These inter-channel
cues are the determining factor for the perception of a
spatial image. These parameters are given for blocks of
time samples of the original multi-channel signal and are
also given frequency-selective so that each block of multi-
channel signal samples have several cues for several fre-
quency bands. In the general case of C playback channels,
the inter-channel level differences and the inter-channel
time differences are considered in each subband between
pairs of channels, i.e., for each channel relative to a
reference channel. One channel is defined as the reference
channel for each inter-channel level difference. With the
inter-channel level differences and the inter-channel time
differences, it is possible to render a source to any di-
rection between one of the loudspeaker pairs of a playback
set-up that is used. For determining the width or diffuse-
ness of a rendered source, it is enough to consider one pa-
rameter per subband for all audio channels. This parameter
is the inter-channel coherence parameter. The width of the
rendered source is controlled by modifying the subband sig-
nals such that all possible channel pairs have the same in-
ter-channel coherence parameter.
In BCC coding, all inter-channel level differences are de-
termined between the reference channel 1 and any other
channel. When, for example, the center channel is deter-
mined to be the reference channel, a first inter-channel
level difference between the left channel and the centre

channel, a second inter-channel level difference between
the right channel and the centre channel, a third inter-
channel level difference between the left surround channel
and the center channel, and a forth inter-channel level
difference between the right surround channel and the cen-
ter channel are calculated. This scenario describes a five-
channel scheme. When the five-channel scheme additionally
includes a low frequency enhancement channel, which is also
known as a "sub-woofer" channel, a fifth inter-channels
level difference between the low frequency enhancement
channel and the center channel, which is the single refer-
ence channel, is calculated.
When reconstructing the original multi-channel using the
single down mix channel, which is also termed as the "mono"
channel, and the transmitted cues such as ICLD (Interchan-
nel Level Difference), ICTD (Interchannel Time Difference),
and ICC (Interchannel Coherence), the spectral coefficients
of the mono signal are modified using these cues. The level
modification is performed using a positive real number de-
termining the level modification for each spectral coeffi-
cient. The inter-channel time difference is generated using
a complex number of magnitude of one determining a phase
modification for each spectral coefficient. Another func-
tion determines the coherence influence. The factors for
level modifications of each channel are computed by firstly
calculating the factor for the reference channel. The fac-
tor for the reference channel is computed such that for
each frequency partition, the sum of the power of all chan-
nels is the same as the power of the sum signal. Then,
based on the level modification factor for the reference
channel, the level modification factors for the other chan-
nels are calculated using the respective ICLD parameters.
Thus, in order to perform BCC synthesis, the level modifi-
cation factor for the reference channel is to be calcu-
lated. For this calculation, all ICLD parameters for a fre-
quency band are necessary. Then, based on this level modi-

fication for the single channel, the level modification
factors for the other channels, i.e., the channels, which
are not the reference channel, can be calculated.
This approach is disadvantageous in that, for a perfect re-
construction, one needs each and every inter-channel level
difference. This requirement is even more problematic, when
an error-prone transmission channel is present. Each error
within a transmitted inter-channel level difference will
result in an error in the reconstructed multi-channel sig-
nal, since each inter-channel level difference is required
to calculate each one of the multi-channel output signal.
Additionally, no reconstruction is possible, when an inter-
channel level difference has been lost during transmission,
although this inter-channel level difference was only nec-
essary for e.g. the left surround channel or the right sur-
round channel, which channels are not so important to
multi-channel reconstruction, since most of the information
is included in the front left channel, which is subse-
quently called the left channel, the front right channel,
which is subsequently called the right channel, or the cen-
ter channel. This situation becomes even worse, when the
inter-channel level difference of the low frequency en-
hancement channel has been lost during transmission. In
this situation, no or only an erroneous multi-channel re-
construction is possible, although the low frequency en-
hancement channel is not so decisive for the listeners'
listening comfort. Thus, errors in a single inter-channel
level difference are propagated to errors within each of
the reconstructed output channels.
Additionally, the existing BCC scheme, which is also de-
scribed in AES convention paper 5574, "Binaural Cue Coding
applied to Stereo and Multi-channel Audio Compression",
C. Faller, F. Baumgarte, May 10 to 13, 2002, Munich, Ger-
many, is not so well-suited, when an intuitive listening
scenario is considered because of the single reference
channel. It is not natural for a human being, which is, of

course, the ultimate
goal of the whole audio processing, that everything is related to a single
reference channel. Instead, a human being has two ears, which are positioned at
different sides of the human being's head. Thus, a human being's natural
listening impression is, whether a signal is balanced more to the left or more to
the right, or is balanced between the front and back. Contrary thereto, it is
unnatural for a human being to feel whether a certain sound source in the
auditory field is in a certain balance between each speaker with respect to a
single reference speaker. This disvergence between the natural listening
impression on the one hand and the mathematical/physical model of BCC on the
other hand may lead to negative consequences of the encoding scheme, when
bit rate requirements, scalability requirements, flexibility requirements,
reconstruction artefact requirements, or error-robustness requirements are
considered.
SUMMARY OF THE INVENTION
It is an object of the present invention to provide an improved concept for
presenting multi-channel audio signals.
This object is achieved by apparatuses and methods for generating parameter representation and a reconstructed multi-channel representation of a multi-
channel input signal.

The present invention is based on the finding that, for a multi-channel
representation, one has to rely on balance parameters between channel pairs.
Additionally, it has been found out that a multi-channel signal parameter
representation is possible by providing at least two different balance parameters,
which indicate a balance between two different channel pairs. In particular,
flexibility, scalability, error-robustness, and even bit rate efficiency are the result
of the fact that the first channel pair, which is the basis for the first balance
parameter is different from the second channel pair, which is the basis for the
second balance parameters, wherein the four channels forming these channel
pairs are all different from each other.
Thus, the inventive concept departs from the single reference channel concept
and uses a multi-balance or super-balance concept, which is more intuitive and
more natural for a human being's sound impression. In particular, the channel
pairs underlying the first and second balance parameters can include original
channels, down-mix channels, or preferably, certain combinations between input
channels.
It has been found out that a balance parameter derived from the center channel
as the first channel and a sum of the left original channel and the right original
channel as the second channel of the channel pair is especially useful for
providing an exact energy distribution between the center channel and the left
and right channels. It is to be noted in this context that these three channels
normally include most information of the audio scene, wherein particularly the
left-right stereo localization is not only influenced by the balance between left
-and right but also by the balance between center and the sum of left and right.
This

flexible, error-robust, and to a large extent artefact-
free.
On the receiver-side, in contrast to BCC synthesis in which
each channel is calculated by the transmitted information
alone, the inventive multi-balance representation addition-
ally makes use of information on the down-mixing scheme
used for generating the down-mix channel(s). Thus, in ac-
cordance with the present invention, information on the
down-mixing scheme, which is not used in prior art systems,
is also used for up-mixing in addition to the balance pa-
rameter. The up-mixing operation is, therefore, performed
such that the balance between the channels within a recon-
structed multi-channel signal forming a channel pair for a
balance parameter is determined by the balance parameter.
This concept, i.e., having different channel pairs for dif-
ferent balance parameters, makes it possible to generate
some channels without knowledge of each and every transmit-
ted balance parameter. In particular, in accordance with
the present invention, the left, right and center channels
can be reconstructed without any knowledge on any rear-
left/rear-right balance or without any knowledge on a
front/back balance. This effect allows the very fine-tuned
scalability, since extracting an additional parameter from
a bit stream or transmitting an additional balance parame-
ter to a receiver consequently allows the reconstruction of
one or more additional channels. This is in contrast to the
prior art single-reference system, in which one needed each
and every inter-channel level difference for reconstructing
all or only a subgroup of all reconstructed output chan-
nels.
The inventive concept is also flexible in that the choice
of the balance parameters can be adapted to a certain re-
construction environment. When, for example, a five-channel
set-up forms the original multi-channel signal set-up, and
when a four-channel set-up forms a reconstruction multi-

channel set-up, which has only a single surround speaker,
which is e.g. positioned behind the listener, a front-back
balance parameter allows calculating the combined surround
channel without any knowledge on the left surround channel,
and the right surround channel. This is in contrast to a
single-reference channel system, in which one has to ex-
tract an inter-channel level difference for the left sur-
round channel and an inter-channel level difference for the
right surround channel from the data stream. Then, one has
to calculate the left surround channel and the right sur-
round channel. Finally, one has to add both channels to ob-
tain the single surround speaker channel for a four-channel
reproduction set-up. All these steps do not have to be per-
formed in the more-intuitive and more user-directed balance
parameter representation, since this representation auto-
matically delivers the combined surround channel because of
the balance parameter representation, which is not tied to
a single reference channel, but which also allows to use a
combination of original channels as a channel of a balance
parameter channel pair.
The present invention relates to the problem of a param-
eterized multi-channel representation of audio signals. It
provides an efficient manner to define the proper parame-
ters for the multi-channel representation and also the
ability to extract the parameters representing the desired
channel configuration without having to decode all chan-
nels. The invention further solves the problem of choosing
the optimal parameter configuration for a given signal seg-
ment in order to minimize the bit rate required to code the
spatial parameters for the given signal segment. The pre-
sent invention also outlines how to apply the decorrelation
methods previously only applicable for the two channel case
in a general multi-channel environment.
In preferred embodiments, the present invention comprises
the following features:

Down-mix the multi-channel signal to a one or two chan-
nel representation on the encoders side;
Given the multi-channel signal, define the parameters
representing the multi-channel signals, either in a
flexible on a per-frame basis in order to minimize bit
rate or in order to enable the decoder to extract the
channel configuration on a bitstream level;
At the decoder side extract the relevant parameter set
given the channel configuration currently supported by
the decoder;
Create the required number of mutually decorrelated sig-
nals given the present channel configuration;
Recreate the output signals given the parameter set de-
coded from the bitstream data, and the decorrelated sig-
nals .
Definition of a parameterization of the multi-channel
audio signal, such that the same parameters or a subset
of the parameters can be used irrespective of the chan-
nel configuration.
Definition of a parameterization of the multi-channel
audio signal, such that the parameters can be used in a
scalable coding scheme, where subsets of the parameter
set are transmitted in different layers of the scalable
stream.
Definition of a parameterization of the multi-channel
audio signal, such that the energy reconstruction of the
output signals from the decoder is not impaired by the
underlying audio codec used to code the downmixed sig-
nal .

- Switching between different parameterizations of the
multi-channel audio signal, such that the bit rate over-
head for coding the parameterization is minimized.
- Definition of a parameterization of the multi-channel
audio signal, in which a parameter is included repre-
senting the energy correction factor for the downmixed
signal.
- Usage of several mutually decorrelated decorrelators to
re-create the multi-channel signal.
- Re-create the multi-channel signal from an upmix matrix
H that is calculated based on the transmitted parameter
set.
BRIEF DESCRIPTION OF THE ACCOMPANYING DRAWINGS

The present invention will now be described by way of il-
lustrative examples, not limiting the scope or spirit of
the invention, with reference to the accompanying draw-
ings, in which:
Fig. 1 illustrates a nomenclature used for a 5.1. chan-
nel configuration as used in the present inven-
tion;
Fig. 2 illustrates a possible encoder implementation of
the present invention;
Fig. 3 illustrates a possible decoder implementation of
the present invention;
Fig. 4 illustrates one preferred parameterization of
the multi-channel signal according to the pre-
sent invention;

Fig. 5 illustrates one preferred parameterization of
the multi-channel signal according to the pre-
sent invention;
Fig. 6 illustrates one preferred parameterization of
the multi-channel signal according to the pre-
sent invention;
Fig. 7 illustrates a schematic set-up for a down-mixing
scheme generating a single base channel or two
base channels;
Fig. 8 illustrates a schematic representation of an up-
mixing scheme, which is based on the inventive
balance parameters and information on the down-
mixing scheme;
Fig. 9a illustrates a determination of a level parameter
on an encoder-side;
Fig. 9b illustrates the usage of the level parameter on
the decoder-side;
Fig. 10a illustrates a scalable bit stream having differ-
ent parts of the multi-channel parameterization
in different layers of the bit stream;
Fig. 10b illustrates a scalability table indicating which
channels can be constructed using which balance
parameters, and which balance parameters and
channels are not used or calculated; and
Fig. 11 illustrates the application of the up-mix matrix
according to the present invention.
DESCRIPTION OF PREFERRED EMBODIMENTS

The below-described embodiments are merely illustrative
for the principles of the present invention on multi-
channel representation of audio signals. It is understood
that modifications and variations of the arrangements and
the details described herein will be apparent to others
skilled in the art. It is the intent, therefore, to be
limited only by the scope of the impending patent claims
and not by the specific details presented by way of de-
scription and explanation of the embodiments herein.
In the following description of the present invention out-
lining how to parameterize IID and ICC parameters, and how
to apply them in order to re-create a multi-channel repre-
sentation of audio signals, it is assumed that all referred
signals are subband signals in a filterbank, or some other
frequency selective representation of a part of the whole
frequency range for the corresponding channel. It is there-
fore understood, that the present invention is not limited
to a specific filterbank, and that the present invention is
outlined below for one frequency band of the subband repre-
sentation of the signal, and that the same operations apply
to all of the subband signals.
Although a balance parameter is also termed to be a "inter-
channel intensity difference (IID)" parameter, it is to be
emphasized that a balance parameter between a channel pair
does not necessarily has to be the ratio between the energy
or intensity in the first channel of the channel pair and
the energy or intensity of the second channel in the chan-
nel pair. Generally, the balance parameter indicates the
localization of a sound source between the two channels of
the channel pair. Although this localization is usually
given by energy/level/intensity differences, other charac-
teristics of a signal can be used such as a power measure
for both channels or time or frequency envelopes of the
channels, etc.

In Fig. 1 the different channels for a 5.1 channel configu-
ration are visualized, where a(t) 101 represents the left
surround channel, b(t) 102 represents the left front chan-
nel, c(t) 103 represents the center channel, d(t) 104
represents the right front channel, e(t) 105 represents the
right surround channel, and f(t) 106 represents the LFE
(low frequency effects) channel.
Assuming that we define the expectancy operator as

and thus the energies for the channels outlined above can
be defined according to (here exemplified by the left sur-
round channel):

The five channels are on the encoder side down-mixed to a
two channel representation or a one channel representation.
This can be done in several ways, and one commonly used is
the ITU down-mix defined according to:
The 5.1 to two channel down-mix:

And the 5.1 to one channel down-mix:

Commonly used values for the constants α,β,γ and δ are

The IID parameters are defined as energy ratios of two ar-
bitrarily chosen channels or weighted groups of channels.
Given the energies of the channels outlined above for the
5.1 channel configuration several sets of IID parameters
can be defined.

Fig. 7 indicates a general down-mixer 700 using the above-
referenced equations for calculating a single-based chan-
nel m or two preferably stereo-based channels ld and rd.
Generally, the down-mixer uses certain down-mixing informa-
tion. In the preferred embodiment of a linear down-mix,
this down-mixing information includes weighting factors α,
β,γ, and δ. It is known in the art that more or less con-
stant or non-constant weighting factors can be used.
In an ITU recommended down-mix, a is set to 1, β and γ are
set to be equal, and equal to the square root of 0.5, and δ
is set to 0. Generally, the factor α can vary between 1.5
and 0.5. Additionally, the factors β, and γ can be differ-
ent from each other, and vary between 0 and 1. The same is
true for the low frequency enhancement channel f(t). The
factor δ for this channel can vary between 0 and 1. Addi-
tionally, the factors for the left-down mix and the right-
down mix do not have to be equal to each other. This be-
comes clear, when a non-automatic down-mix is considered,
which is, for example, performed by a sound engineer. The
sound engineer is more directed to perform a creative down-
mix rather than a down-mix, which is guided by any math-
ematic laws. Instead, the sound engineer is guided by his
own creative feeling. When this "creative" down-mixing is
recorded by a certain parameter set, it will be used in ac-
cordance with the present invention by an inventive up-
mixer as shown in Fig. 8, which is not only guided by the
parameters, but also by additional information on the down-
mixing scheme.
When a linear down-mix has been performed as in Fig. 7, the
weighting parameters are the preferred information on the
down-mixing scheme to be used by the up-mixer. When, how-
ever, other information is present, which are used in the
down-mixing scheme, this other information can also be used
by an up-mixer as the information on the down-mixing
scheme. Such other information can, for example, be certain
matrix elements or certain factors or functions within ma-

trix elements of an upmix-matrix as, for example, indicated
in Fig. 11.
Given the 5.1 channel configuration outlined in Figure 1
and observing how other channel configurations relate to
the 5.1 channel configuration: For a three channel case
where no surround channels are available, i.e. B, C, and D
are available according to the notation above. For a four
channel configuration B, C and D are available but also a
combination of A and E representing the single surround
channel, or more commonly denoted in this context, the back
channel.
The present invention defines IID parameters that apply to
all these channels, i.e. the four channel subset of the
5.1. channel configuration has a corresponding subset
within the IID parameter set describing the 5.1 channels.
The following IID parameter set solves this problem:

It is evident that the r1 parameter corresponds to the en-
ergy ratio between the left down-mix channel and the right
channel down-mix. The r2 parameter corresponds to the en-
ergy ratio between the center channel and the left and
right front channels. The r3 parameter corresponds to the
energy ratio between the three front channels and the two
surround channels. The r4 parameter corresponds to the en-

ergy ratio between the two surround channels. The r5 pa-
rameter corresponds to the energy ratio between the LFE
channel and all other channels.
In Fig. 4 the energy ratios as explained above are illus-
trated. The different output channels are indicated by 101
to 105 and are the same as in Fig.l and are hence not
elaborated on further here. The speaker set-up is divided
into a left and a right half, where the center channel 103
are part of both halves. The energy ratio between the left
half plane and the right half plane is exactly the parame-
ter referred to as r1 according to the present invention.
This is indicated by the solid line below r1 in Fig. 4.
Furthermore, the energy distribution between the center
channel 103 and the left front 102 and right front 103
channels are indicated by r2 according to the present in-
vention. Finally, the energy distribution between the en-
tire front channel set-up (102, 103 and 104) and the back
channels (101 and 105) are illustrated by the arrow in Fig.
5 by the r3 parameter.
Given the parameterization above and the energy of the
transmitted single down-mixed channel:
M = -(a2 (B + D) + J32 (A + E) + 2y2C + 2S2F) ,
the energies of the reconstructed channels can be expressed
as:



Hence the energy of the M signal can be distributed to the
re-constructed channels resulting in re- constructed chan-
nels having the same energies as the original channels.
The above-preferred up-mixing scheme is illustrated in
Fig. 8. It becomes clear from the equations for F, A, E, C,
B, and D that the information on the down-mixing scheme to
be used by the up-mixer are the weighting factors α,β,γ,
and δ, which are used for weighting the original channels
before such weighted or unweighted channels are added to-
gether or subtracted from each other in order to arrive at
a number of down-mix channels, which is smaller than the
number of original channels. Thus, it is clear from Fig. 8
that in accordance with the present invention, the energies
of the reconstructed channels are not only determined by
the balance parameters transmitted from an encoder-side to
a decoder-side, but are also determined by the down-mixing
factor α,β,γ, and δ.
When Fig. 8 is considered, it becomes clear that, for cal-
culating the left and right energies B and D the already
calculated channel energies F, A, E, C, are used within the
equation. This, however, does not necessarily imply a se-
quential up-mixing scheme. Instead, for obtaining a fully
parallel up-mixing scheme, which is, for example, performed
using a certain up-mixing matrix having certain up-mixing
matrix elements, the equations for A, C, E, and F are in-
serted into the equations for B and D. Thus, it becomes
clear that reconstructed channel energy is only determined
by balance parameters, the down-mix channel(s), and the in-
formation on the down-mixing scheme such as the down-mixing
factors.

When Fig. 8 is considered, it becomes clear that, for
calculating the left and right energies B and D the already
calculated channel energies F, A, E, C, are used within the
equation. This, however, does not necessarily imply a
sequential up-mixing scheme. Instead, for obtaining a fully
parallel up-mixing scheme, which is, for example, performed
using a certain up-mixing matrix having certain up-mixing
matrix elements, the equations for A, C, E, and F are
inserted into the equations for B and D. Thus, it becomes
clear that reconstructed channel energy is only determined
by Dalance parameters, the down-mix channel (s), and the
information on the down-mixing scheme such as the down-
mixing factors.
Given the above IID parameters it is evident that the
problem of defi.ning a parameter set of IID parameters that
can be used for several channel configurations has been
solved as will be obvious from the below. As an example,
observing the three channel configuration (i.e. recreating
three front channels from one available channel), it is
evident that the r3, r4 and r5 parameters are obsolete since
the A, E and F channels do not exist. It is also evident
that the parameters r1 and r2 are sufficient to recreate
the three channels from a downmixed single channel since r1
describes the energy ratio between the left and right front
channels, and r2 describes the energy ratio between the
center channel and the left and right front channels.
In the more general case it is easily seen that the IID
parameters (r1 . . . r5) as defined above apply to all subsets
of recreating n channels from m channels where m Observing Fig. 4 it can be said:
- For a system recreating 2 channels from 1 channel,
sufficient information to retain the correct energy
ratio between the channels is obtained from the r1
parameter;

- For a system recreating 5.1 channels from 1 channel,
sufficient information to retain the correct energy ra-
tio between the channels is obtained from the r1, r2, r3,
r4 and r5 parameters;
- For a system recreating 5.1 channels from 2 channels,
sufficient information to retain the correct energy ra-
tio between the channels is obtained from the r2, r3, r4
and r5 parameters.
The above described scalability feature is illustrated by
the table in Fig. 10b. The scalable bit stream illustrated
in Fig. 10a and explained later on can also be adapted to
the table in Fig. 10b for obtaining a much finer scalabil-
ity than shown in Fig. 10a.
The inventive concept is especially advantageous in that
the left and right channels can be easily reconstructed
from a single balance parameter r1 without knowledge or ex-
traction of any other balance parameter. To this end, in
the equations for B, D in Fig. 8, the channels A, C, F, and
E are simply set to zero.
Alternatively, when only the balance parameter r2 is con-
sidered, the reconstructed channels are the sum between the
center channel and the low frequency channel (when this
channel is not set to zero) on the one hand and the sum be-
tween the left and right channels on the other hand. Thus,
the center channel on the one hand and the mono signal on
the other hand can be reconstructed using only a single pa-
rameter. This feature can already be useful for a simple 3-
channel representation, where the left and right signals
are derived from the sum of left and right such as by halv-
ing, and where the energy between the center and the sum of
left and right is exactly determined by the balance parame-
ter r2.

- For a system recreating 5.1 channels from 1 channel,
sufficient information to retain the correct energy ra-
tio between the channels is obtained from the rl r2, r3,
r4 and r5 parameters;
- For a system recreating 5.1 channels from 2 channels,
sufficient information to retain the correct energy ra-
tio between the channels is obtained from the r2, r3, r4
and r5 parameters.
The above described scalability feature is illustrated by
the table in Fig. 10b. The scalable bit stream illustrated
in Fig. 10a and explained later on can also be adapted to
the table in Fig. 10b for obtaining a much finer scalabil-
ity than shown in Fig. 10a.
The inventive concept is especially advantageous in that
the left and right channels can be easily reconstructed
from a single balance parameter r1 without knowledge or ex-
traction of any other balance parameter. To this end, in
the equations for B, D in Fig. 8, the channels A, C, F, and
E are simply set to zero.
Alternatively, when only the balance parameter r2 is con-
sidered, the reconstructed channels are the sum between the
center channel and the low frequency channel (when this
channel is not set to zero) on the one hand and the sum be-
tween the left and right channels on the other hand. Thus,
the center channel on the one hand and the mono signal on
the other hand can be reconstructed using only a single pa-
rameter. This feature can already be useful for a simple 3-
channel representation, where the left and right signals
are derived from the sum of left and right such as by halv-
ing, and where the energy between the center and the sum of
left and right is exactly determined by the balance parame-
ter r2.

In this context, the balance parameters r1 or r2 are situ-
ated in a lower scaling layer.
As to the second entry in the Fig. 10b table, which indi-
cates how 3 channels B, D, and the sum between C and F can
be generated using only two balance parameters instead of
all 5 balance parameters, one of those parameters r1 and r2
can already be in a higher scaling layer than the parameter
r1 or r2, which is situated in the lower scaling layer.
When the equations in Fig. 8 are considered, it becomes
clear that, for calculating C, the non-extracted parameter
r5 and the other non-extracted parameter r3 are set to 0.
Additionally, the non-used channels A, E, F are also set to
0, so that the 3 channels B, D, and the combination between
the center channel C and the low frequency enhancement
channel F can be calculated.
When a 4-channel representation is to be up-mixed, it is
sufficient to only extract parameters r1, r2, and r3 from
the parameter data stream. In this context, r3 could be in
a next-higher scaling layer than the other parameter r1 or
r2. The 4-channel configuration is specially suitable in
connection with the super-balance parameter representation
of the present invention, since, as it will be described
later on in connection with Fig. 6, the third balance pa-
rameter r3 already is derived from a combination of the
front channels on the one hand and the back channels on the
other hand. This is due to the fact that the parameter r3
is a front-back balance parameter, which is derived from
the channel pair having, as a first channel, a combination
of the back channels A and E, and having, as the front
channels, a combination of left channel B, right channel E,
and center channel C.
Thus, the combined channel energy of both surround channels
is automatically obtained without any further separate cal-

culation and subsequent combination, as would be the case
in a single reference channel set-up.
When 5 channels have to be recreated from a single channel,
the further balance parameter r4 is necessary. This parame-
ter r4 can again be in a next-higher scaling layer.
When a 5.1 reconstruction has to be performed, each balance
parameter is required. Thus, a next-higher scaling layer
including the next balance parameter r5 will have to be
transmitted to a receiver and evaluated by the receiver.
However, using the same approach of extending the IID pa-
rameters in accordance to the extended number of channels,
the above IID parameters can be extended to cover channel
configuration s with a larger number of channels than the
5.1 configuration. Hence the present invention is not lim-
ited to the examples outlined above.
Now observing the case were the channel configuration is a
5.1 channel configuration this being one of the most com-
monly used cases. Furthermore, assume that the 5.1. chan-
nels are recreated from two channels. A different set of
parameters can for this case be defined by replacing the
parameters r3 and r4 by:

The parameters q3 and g4 represent the energy ratio between
the front and back left channels, and the energy ratio be-
tween the front and back right channels. Several other
parameterizations can be envisioned.

In Fig. 5 the modified parameterization is visualized. In-
stead of having one parameter outlining the energy distri-
bution between the front and back channels (as was outlined
by r3 in Fig. 4) and a parameter describing the energy dis-
tribution between the left surround channel and the right
surround channel (as was outlined by r4 in Fig. 4) the pa-
rameters q3 and q4 are used describing the energy ratio be-
tween the left front 102 and left surround 101 channel, and
the energy ratio between the right front channel 104 and
right surround channel 105.
The present invention teaches that several parameter sets
can be used to represent the multi-channel signals. An ad-
ditional feature of the present invention is that different
parameterizations can be chosen dependent on the type of
quantization of the parameters that is used.
As an example, a system using coarse quantization of the
parameterization, due to high bit rate constraints, a
parameterization should be used that does not amplify er-
rors during the upmixing process.
Observing two of the expressions above for the recon-
structed energies in a system that re-creates 5.1 channels
from one channel:

It is evident that the subtractions can yield large varia-
tions of the B and D energies due to quite small quantiza-
tion effects of the M, A, C, and F parameters.
According to the present invention a different parameteri-
zation should be used that is less sensitive to quantiza-
tion of the parameters. Hence, if coarse quantization is
used, the r1 parameter as defined above:


can be replaced by the alternative definition according to:

This yields equations for the reconstructed energies ac-
cording to:

and the equations for the reconstructed energies of A, E, C
and F stay the same as above. It is evident that this
parameterization represents a more well conditioned system
from a quantization point of view.
In Fig. 6 the energy ratios as explained above are illus-
trated. The different output channels are indicated by 101
to 105 and are the same as in Fig.1 and are hence not
elaborated on further here. The speaker set-up is divided
into a front part and a back part. The energy distribution
between the entire front channel set-up (102, 103 and 104)
and the back channels (101 and 105) are illustrated by the
arrow in Fig. 6 indicated by the r3 parameter.
Another important noteworthy feature of the present inven-
tion is that when observing the parameterization

it is not only a more well conditioned system from a quan-
tization point of view. The above parameterization also has
the advantage that the parameters used to reconstruct the
three front channels are derived without any influence of


as taught by the present invention, since the back channels
are not included in the estimation of the parameters used
on the decoder side to re-create the front channels.
The energy distribution between the center channel 103 and
the left front 102 and right front 103 channels are indi-
cated by r2 according to the present invention. The energy
distribution between the left surround channel 101 and the
right surround channel 105 is illustrated by r4. Finally,
the energy distribution between the left front channel 102
and the right front channel 104 is given by r1. As is evi-
dent all parameters are the same as outlined in Fig.4 apart
from rl that here corresponds to the energy distribution
between the left front speaker and the right front speaker,
as opposed to the entire left side and the entire right
side. For completeness the parameter r5 is also given out-
lining the energy distribution between the center channel
103 and the lfe channel 106.
Fig. 6 shows an overview of the preferred parameterization
embodiment of the present invention. The first balance pa-
rameter r1 (indicated by the solid line) constitutes a
front-left/front-right balance parameter. The second bal-
ance parameter r2 is a center left-right balance parameter.
The third balance parameter r3 constitutes a front/back
balance parameter. The forth balance parameter r4 consti-
tutes a rear-left/rear-right balance parameter. Finally,
the fifth balance parameter r5 constitutes a center/lfe
balance parameter.
Fig. 4 shows a related situation. The first balance parame-
ter r1, which is illustrated in Fig. 4 by solid lines in
case of a down-mix-left/right balance can be replaced by an
original front-left/front-right balance parameter defined

the surround channels. One could envision a parameter r2
that describes the relation between the center channel and
all other channels. However, this would have the drawback
that the surround channels would be included in the estima-
tion of the parameters describing the front channels.
Remembering that the, in the present invention, described
parameterization also can be applied to measurements of
correlation or coherence between channels, it is evident
that including the back channels in the calculation of r2
can have significant negative influence of the success of
re-creating the front channels accurately.
As an example, one could imagine a situation with the same
signal in all the front channels, and completely uncorre-
lated signals in the back channels. This is not uncommon,
given that the back channels are frequently used to re-
create ambience information of the original sound.
If the center channel is described in relation to all other
channels, the correlation measure between the center and
the sum of all other channels will be rather low, since the
back channels are completely uncorrelated. The same will be
true for a parameter estimating the correlation between the
front left/right channels, and the back left/right chan-
nels .
Hence, we arrive with a parameterization that can recon-
struct the energies correctly, but that does not include
the information that all front channels were identical,
i.e. strongly correlated. It does include the information
that the left and right front channels are decorrelated to
the back channels, and that the center channel is also
decorrelated to the back channels. However, the fact that
all front channels are the same is not derivable from such
a parameterization.
This is overcome by using the parameterization


between the channels B and D as the underlying channel
pair. This is illustrated by the dashed line r1 in Fig. 4
and corresponds to the solid line r1 in Fig. 5 and Fig. 6.
In a two-base channel situation, the parameters r3 and r4,
i.e. the front/back balance parameter and the rear-
left/right balance parameter are replaced by two single-
sided front/rear parameters. The first single-sided
front/rear parameter q3 can also be regarded as the first
balance parameter, which is derived from the channel pair
consisting of the left surround channel A and the left
channel B. The second single-sided front/left balance pa-
rameter is the parameter q4, which can be regarded as the
second parameter, which is based on the second channel pair
consisting of the right channel D and the right surround
channel E. Again, both channel pairs are independent from
each other. The same is true for the center/left-right bal-
ance parameter r2, which have, as a first channel, a center
channel C, and as a second channel, the sum of the left and
right channels B, and D.
Another parameterization that lends itself well to coarse
quantization for a system re-creating 5.1 channels from one
or two channel is defined according to the present inven-
tion below.
For the one to 5.1 channels:

And for the two to 5.1 channels case:

It is evident that the above parameterizations include more
parameters than is required from the strictly theoretical
point of view to correctly re-distribute the energy of the
transmitted signals to the re-created signals. However, the
parameterization is very insensitive to quantization er-
rors.

The above-referenced parameter set for a two-base channel
set-up, makes use of several reference channels. In con-
trast to the parameter configuration in Fig. 6, however,
the parameter set in Fig. 7 solely relies on down-mix chan-
nels rather than original channels as reference channels.
The balance parameters q1, q3, and q4 are derived from com-
pletely different channel pairs.
Although several inventive embodiments have been described,
in which the channel pairs for deriving balance parameters
include only original channels (Fig. 4, Fig. 5, Fig. 6) or
include original channels as well as down-mix channels
(Fig. 4, Fig. 5) or solely rely on the down-mix channels as
the reference channels as indicated at the bottom of
Fig. 7, it is preferred that the parameter generator in-
cluded within the surround data encoder 206 of Fig. 2 is
operative to only use original channels or combinations of
original channels rather than a base channel or a combina-
tion of base channels for the channels in the channel
pairs, on which the balance parameters are based. This is
due to the fact that one cannot completely guarantee that
there does not occur an energy change to the single base
channel or the two stereo base channels during their trans-
mission from a surround encoder to a surround decoder. Such
energy variations to the down-mix channels or the single
down-mix channel can be caused by an audio encoder 205
(Fig. 2) or an audio decoder 302 (Fig. 3) operating under a
low-bit rate condition. Such situations can result in ma-
nipulation of the energy of the mono down-mix channel or
the stereo down-mix channels, which manipulation can be
different between the left and right stereo down-mix chan-
nels, or can even be frequency-selective and time-
selective .
In order to be completely safe against such energy varia-
tions, an additional level parameter is transmitted for
each block and frequency band for every downmix channel in

accordance with the present invention. When the balance pa-
rameters are based on the original signal rather than the
down-mix signal, a single correction factor is sufficient
for each band, since any energy correction will not influ-
ence a balance situation between the original channels.
Even when no additional level parameter is transmitted, any
down-mix channel energy variations will not result in a
distorted localization of sound sources in the audio image
but will only result in a general loudness variation, which
is not as annoying as a migration of a sound source caused
by varying balance conditions.
It is important to note that care needs to be taken so that
the energy M (of the down-mixed channels), is the sum of
the energies B, D, A, E, C and F as outlined above. This is
not always the case due to phase dependencies between the
different channels being down-mixed in to one channel. The
energy correction factor can be transmitted as an addi-
tional parameter rM , and the energy of the downmixed sig-
nal received on the decoder side is thus defined as:

In Fig. 9 the application of the additional parameter rM is
outlined. The downmixed input signal is modified by the rM
parameter in 901 prior to sending it into the upmix modules
of 701 - 705. These are the same as in Fig. 7 and will
therefore not be elaborated on further. It is obvious for
those skilled in the art that the parameter rM for the sin-
gle channel downmix example above, can be extended to be
one parameter per downmix channel, and is hence not limited
to a single downmix channel.
Fig. 9a illustrates an inventive level parameter calcula-
tor 900, while Fig. 9b indicates an inventive level correc-
tor 902. Fig. 9a indicates the situation on the encoder-
side, and Fig. 9b illustrates the corresponding situation
on the decoder-side. The level parameter or "addiricnal"

parameter rM is a correction factor giving a certain energy
ratio. To explain this, the following exemplary scenario is
assumed. For a certain original multi-channel signal, there
exists a "master down-mix" on the one hand and a "parameter
down-mix" on the other hand. The master down-mix has been
generated by a sound engineer in a sound studio based on,
for example, subjective quality impressions. Additionally,
a certain audio storage medium also includes the parameter
down-mix, which has been performed by for example the sur-
round encoder 203 of Fig. 2. The parameter down-mix in-
cludes one base channel or two base channels, which base
channels form the basis for the multi-channel reconstruc-
tion using the set of balance parameters or any other para-
metric representation of the original multi-channel signal.
There can be the case, for example, that a broadcaster
wishes to not transmit the parameter down-mix but the mas-
ter down-mix from a transmitter to a receiver. Addition-
ally, for upgrading the master down-mix to multi-channel
representation, the broadcaster also transmits a parametric
representation of the original multi-channel signal. Since
the energy (in one band and in one block) can (and typi-
cally will) vary between the master down-mix and the pa-
rameter down-mix, a relative level parameter rM is gener-
ated in block 900 and transmitted to the receiver as an ad-
ditional parameter. The level parameter is derived from the
master down-mix and the parameter down-mix and is prefera-
bly, a ratio between the energies within one block and one
band of the master down-mix and the parameter down-mix.
Generally, the level parameter is calculated as the ratio
of the sum of the energies (Eorig) of the original channels
and the energy of the downmix channel(s), wherein this
downmix channel (s) can be the parameter downmix (EPD) or
the master downmix (EMD) or any other downmix signal. Typi-
cally, the energy of the specific downmix signal is used,
which is transmitted from an encoder to a decoder.

Fig. 9b illustrates a decoder-side implementation of the
level parameter usage. The level parameter as well as the
down-mix signal are input into the level corrector
block 902. The level corrector corrects the single-base
channel or the several-base channels depending on the level
parameter. Since the additional parameter rM is a relative
value, this relative value is multiplied by the energy of
the corresponding base channel.
Although Figs. 9a and 9b indicate a situation, in which the
level correction is applied to the down-mix channel or the
down-mix channels, the level parameter can also be inte-
grated into the up-mixing matrix. To this end, each occur-
rence of M in the equations in Fig. 8 is replaced by the
term "rM M".
Studying the case when re-creating 5.1 channels from 2
channels, the following observation is made.
If the present invention is used with an underlying audio
codec as outlined in Fig 2 and Fig 3 205 and 302. some more
consideration needs to be made. Observing the IID parame-
ters as defined earlier where r1 was defined according to

this parameter is implicitly available on the decoder side
since the system is re-creating 5.1 channels from 2 chan-
nels, provided that the two transmitted channels is the
stereo downmix of the surround channels.
However, the audio codec operating under a bit rate con-
straint may modify the spectral distribution so that the L
and R energies as measured on the decoder differ from their
values on the encoder side. According to the present inven-
tion such influence on the energy distribution of the re-
created channels vanishes by transmitting the parameter


also for the case when reconstruction 5.1 channels from two
channels.
If signaling means are provided the encoder can code the
present signal segment using different parameter sets and
choose the set of IID parameters that give the lowest over-
head for the particular signal segment being processed. It
is possible that the energy levels between the right front
and back channels are similar, and that the energy levels
between the front and back left channel are similar but
significantly different to the levels in the right front
and back channel. Given delta coding of parameters and sub-
sequent entropy coding it can be more efficient to use pa-
rameters q3 and q4 instead of r3 and r4. For another signal
segment with different characteristics a different parame-
ter set may give a lower bit rate overhead. The present in-
vention allows to freely switching between different pa-
rameter representations in order to minimize the bit rate
overhead for the presently encoded signal segment given the
characteristics of the signal segment. The ability to
switch between different parameterizations of the IID pa-
rameters in order to obtain the lowest possible bit rate
overhead, and provide signaling means to indicate what
parameterization is presently used, is an essential feature
of the present invention.
Furthermore, the delta coding of the parameters can be done
in either the frequency direction or in the time direction,
as well as delta coding between different parameters. Ac-
cording to the present invention, a parameter can be delta
coded with respect to any other parameter, given that sig-
naling means are provided indicating the particular delta
coding used.
An interesting feature for any coding scheme is the ability
to do scalable coding. This means that the coded bitstream
can be divided into several different layers. The core

layer is decodable by itself, and the higher layers can be
decoded to enhance the decoded core layer signal. For dif-
ferent circumstances the number of available layers may
vary, but as long as the core layer is available the de-
coder can produce output samples. The parameterization for
the multi-channel coding as outlined above using the r1 to
r5 parameters lend them selves very well to scalable cod-
ing. Hence, it is possible to store the data for e.g. the
two surround channels (A and E) in an enhancement layer,
i.e. the parameters r3 and r4, and the parameters corre-
sponding to the front channels in a core layer, represented
by parameters r1 and r2.
In Fig. 10 a scalable bitstream implementation according to
the present invention is outlined. The bitstream layers are
illustrated by 1001 and 1002, where 1001 is the core layer
holding the wave-form coded downmix signals and the parame-
ters r1 and r2 required to re-create the front channels
(102, 103 and 104). The enhancement layer illustrated by
1002 holds the parameters for re-creating the back channels
(101 and 105).
Another important aspect of the present invention is the
usage of decorrelators in a multi-channel configuration.
The concept of using a decorrelator was elaborated on for
the one to two channel case in the PCT/SE02/01372 document.
However when extending this theory to more than two chan-
nels several problems arise that the present invention
solves.
Elementary mathematics show that in order to achieve M
mutually decorrelated signals from N signals, M-N
decorrelators are required, where all the different decor-
relators are functions that create mutually orthogonal out-
put signals from a common input signal. A decorrelator is
typically an allpass or near allpass filter that given an

input x(t)produces an output y(t)with E \y\ =E \x\ and almost
vanishing cross-correlation E[YX.] Further perceptual cri-
teria come in to the design of a good decorrelator, some
examples of design methods can be to also minimize the
comb-filter character when adding the original signal to
the decorrelated signal and to minimize the effect of a
sometimes too long impulse response at transient signals.
Some prior art decorrelators utilizes an artificial rever-
berator to decorrelate. Prior art also includes fractional
delays by e.g. modifying the phase of the complex subband
samples, to achieve higher echo density and hence more time
diffusion.
The present invention suggests methods of modifying a re-
verberation based decorrelator in order to achieve multiple
decorrelators creating mutually decorrelated output signals
from a common input signal. Two decorrelators are mutually
decorrelated if their outputs y1{t) and y2(t) have vanishing
or almost vanishing cross-correlation given the same input.
Assuming the input is stationary white noise it follows
that the impulse responses h1 and h2 must be orthogonal in
the sense that E[h1h2.]is vanishing or almost vanishing. Sets
of pair wise mutually decorrelated decorrelators can be
constructed in several ways. An efficient way of doing such
modifications is to alter the phase rotation factor q that
is part of the fractional delay.
The present invention stipulates that the phase rotation
factors can be part of the delay lines in the all-pass fil-
ters or just an overall fractional delay. In the latter
case this method is not limited to all-pass or reverbera-
tion like filters, but can also be applied to e.g. simple
delays including a fractional delay part. An all-pass fil-

ter link in the decorrelator can be described in the Z-
domain as:

where q is the complex valued phase rotation factor
(|q| = l), m is the delay line length in samples and a is the
filter coefficient. For stability reasons, the magnitude of
the filter coefficient has to be limited to |α| by using the alternative filter coefficient a=-a, a new
reverberator is defined having the same reverberation decay
properties but with an output significantly uncorrelated
with the output from the non-modified reverberator. Fur-
thermore, a modification of the phase rotation factor q ,
can be done by e.g. adding a constant phase offset, q' = qejC.
The constant C, can be used as a constant phase offset or
could be scaled in a way that it would correspond to a con-
stant time offset for all frequency bands it is applied on.
The phase offset constant C , can also be a random value
that is different for all frequency bands.
According to the present invention, the generation of n
channels from m channels is performed by applying an upmix
matrix H of size n×(m + p) to a column vector of size
(m+p)×1 of signals

wherein mare the m downmixed and coded signals, and
the p signals in s are both mutually decorrelated and
decorrelated from all signals in m . These decorrelated
signals are produced from the signals in m by decorrela-
tors. The n reconstructed signals a',b',...are then contained
in the column vector


The above is illustrated by Fig. 11, where the decorre-
lated signals are created by the decorrelators 1102, 1103
and 1104. The upmix matrix H is given by 1101 operating on
the vector y giving the output signal x' .
LetR = E[xx.] be the correlation matrix of the original sig-
nal vector let R' = E[x'x'.] be the correlation matrix of the
reconstructed signal. Here and in the following, for a ma-
trix or a vector X with complex entries, X. denotes the
adjoint matrix, the complex conjugate transpose of X.
The diagonal of R contains the energy values A,B,C,... and
can be decoded up to a total energy level from the energy
quotas defined above. Since R.=R, there are only n(n-1)/2
different off diagonal cross-correlation values containing
information that is to be reconstructed fully or partly by
adjusting the upmix matrix H. A reconstruction of the
full correlation structure corresponds to the case R'=R.
Reconstruction of correct energy levels only correspond to
the case where R'and Rare equal on their diagonals.
In the case of n channels from m=1channel, a reconstruc-
tion of the full correlation structure is achieved by us-
ing p=n-\ mutually decorrelated decorrelators an upmix
matrixHwhich satisfies the condition

where M is the energy of the single transmitted signal .
Since R is positive semidefinite it is well known that
such a solution exists. Moreover, n(n-1)/2 degrees of free-
dom are left over for the design of H, which are used in
the present invention to obtain further desirable proper-
ties of the upmix matrix. A central design criterion is
that the dependence of H on the transmitted correlation
data shall be smooth.

One convenient way of parametrizing the upmix matrix is
H = UDV where U and V are orthogonal matrices and D is a
diagonal matrix. The squares of the absolute values of
D can be chosen equal to the eigenvalues of R/M . Omit-
ting V and sorting the eigenvalues so that the largest
value is applied to the first coordinate will minimize the
overall energy of decorrelated signals in the output. The
orthogonal matrix U is in the real case parameterized by
n(n-1)/2 rotation angles. Transmitting correlation data in
the form of those angles and the n diagonal values of
D would immediately give the desired smooth dependence of
H. However since energy data has to be transformed into
eigenvalues, scalability is sacrificed by this approach.
A second method taught by the present invention, consists
of separating the energy part from the correlation part
in R by defining a normalized correlation matrix R0 by
R = GR0G where G is a diagonal matrix with the diagonal
values equal to the square roots of the diagonal entries
of R, that is, √A,√B..., and R0 has ones on the diagonal.
Let H0 be is an orthogonal upmix matrix defining the pre-
ferred normalized upmix in the case of totally uncorre-
lated signals of equal energy. Examples of such preferred
upmix matrices are

The upmix is then defined by H = GSH0/√M, where the ma-
trix S solves SS.=R0 . The dependence of this solution on

the normalized cross-correlation values in R0 is chosen to
be continuous and such that Sis equal to the identity ma-
trix I in the case R0=I.
Dividing the n channels into groups of fewer channels is a
convenient way to reconstruct partial cross-correlation
structure. According to the present invention, a particu-
lar advantageous grouping for the case of 5.1 channels
from 1 channel is {a,e},{c},{b,d},{f} , where no decorrelation
is applied for the groups {c},{f}, and the groups {a,e},{b,d}
are produced by upmix of the same downmixed/decorrelated
pair. For these two subsystems, the preferred normalized
upmixes in the totally uncorrelated case are to be chosen
as

respectively. Thus only two of the totality of 15 cross-
correlations will be transmitted and reconstructed, namely
those between channels {a,e) and {b,d} . In the terminology
used above, this is an example of a design for the case
n = 6, m = 1, and p = 1. The upmix matrix His of size
6 x 2 with zeros at the two entries in the second column at
rows 3 and 6 corresponding to outputs c'and f' .
A third approach taught by the present invention for in-
corporating decorrelated signals is the simpler point of

view that each output channel has a different decorrelator
giving rise to decorrelated signals sa,sb,... . The recon-
structed signals are then formed as

The parameters φa,,φb,... control the amount of decorrelated
signal present in output channels a',b',.... The correlation
data is transmitted in form of these angles. It is easy to
compute that the resulting normalized cross-correlation be-
tween, for instance, channel a' and V is equal to the
product coφacosφb. As the number of pairwise cross-
correlations is n(n-1)/2 and there are n decorrelators it
will not be possible in general with this approach to match
a given correlation structure if n>3, but the advantages
are a very simple and stable decoding method, and the di-
rect control on the produced amount of decorrelated signal
present in each output channel. This enables for the mixing
of decorrelated signals to be based on perceptual criteria
incorporating for instance energy level differences of
pairs of channels.
For the case of n channels from m>\ channels, the correla-
tion matrix Ry = E[yy*]can no longer be assumed diagonal,
and this has to be taken into account in the matching of
R' = HRyH. to the targetR . A simplification occurs, since
Ry has the block matrix structure


where Rm = E[mm.] and Rs = E[ss.]. Furthermore, assuming mutu-
ally decorrelated decorrelators, the matrix Rs is diago-
nal. Note that this also affects the upmix design with re-
spect to the reconstruction of correct energies. The solu-
tion is to compute in the decoder, or to transmit from the
encoder, information about the correlation structure Rm of
the downmixed signals.
For the case of 5.1 channels from 2 channels a preferred
method for upmix is

where s1 is obtained from decorrelation of m1=ld and s2 is ob-
tained from decorrelation of m2=rd .
Here the groups [a,b] and {d,e} are treated as separate
1—>2 channels systems taking into account the pairwise
cross-correlations. For channels cand f, the weights are
to be adjusted such that


The present invention can be implemented in both hardware
chips and DSPs, for various kinds of systems, for storage
or transmission of signals, analogue or digital, using ar-
bitrary codecs. Fig. 2 and Fig. 3 show a possible implemen-
tation of the present invention. In this example a system
operating on six input signals (a 5.1 channel configura-
tion) is displayed. In Fig.2 the encoder side is displayed
the analogue input signals for the separate channels are
converted to a digital signal 201 and analyzed using a fil-
terbank for every channel 202. The output from the filter-
banks is fed to the surround encoder 203 including a pa-
rameter generator that performs a downmix creating the one
or two channels encoded by the audio encoder 205. Further-
more, the surround parameters such as the IID and ICC pa-
rameters are extracted according to the present invention,
and control data outlining the time frequency grid of the
data as well as which parameterization is used is extracted
204 according to the present invention. The extracted pa-
rameters are encoded 206 as taught by the present inven-
tion, either switching between different parameterizations
or arranging the parameters in a scalable fashion. The sur-
round parameters 207, control signals and the encoded down
mixed signals 208 are multiplexed 209 into a serial bit-
stream.
In Fig. 3 a typical decoder implementation, i.e. an appara-
tus for generating multi-channel reconstruction is dis-
played. Here it is assumed that the Audio decoder outputs a
signal in a frequency domain representation, e.g. the out-
put from the MPEG-4 High efficiency AAC decoder prior to
the QMF synthesis filterbank. The serial bitstream is de-
multiplexed 301 and the encoded surround data is fed to the
surround data decoder 303 and the down mixed encoded chan-
nels are fed to the audio decoder 302, in this example an
MPEG-4 High Efficiency AAC decoder. The surround data de-
coder decodes the surround data and feeds it to the sur-
round decoder 305, which includes an upmixer, that recre-

ates six channels based on the decoded down-mixed channels
and the surround data and the control signals. The fre-
quency domain output from the surround decoder is synthe-
sized 306 to time domain signals that are subsequently con-
verted to analogue signals by the DAC 307.
Although the present invention has mainly been described
with reference to the generation and usage of balance pa-
rameters, it is to be emphasized here that preferably the
same grouping of channel pairs for deriving balance parame-
ters is also used for calculating inter-channel coherence
parameters or "width" parameters between these two channel
pairs. Additionally, inter-channel time differences or a
kind of "phase cues" can also be derived using the same
channel pairs as used for the balance parameter calcula-
tion. On the receiver-side, these parameters can be used in
addition or as an alternative to the balance parameters to
generate a multi-channel reconstruction. Alternatively, the
inter-channel coherence parameters or even the inter-
channel time differences can also be used in addition to
other inter-channel level differences determined by other
reference channels. In view of the scalability feature of
the present invention as discussed in connection with
Fig. 10a and Fig. 10b, it is, however, preferred to use the
same channel pairs for all parameters so that, in a scal-
able bit stream, each scaling layer includes all parameters
for reconstructing the sub-group of output channels, which
can be generated by the respective scaling layer as out-
lined in the penultimate column of the Fig. 10b table. The
present invention is useful, when only the coherence pa-
rameters or the time difference parameters between the re-
spective channel pairs are calculated and transmitted to a
decoder. In this case, the level parameters already exist
at the decoder for usage when a multichannel reconstruction
is performed.

Depending on certain implementation requirements of the in-
ventive methods, the inventive methods can be implemented
in hardware or in software. The implementation can be per-
formed using a digital storage medium, in particular a disk
or a CD having electronically readable control signals
stored thereon, which cooperate with a programmable com-
puter system such that the inventive methods are performed.
Generally, the present invention is, therefore, a computer
program product with a program code stored on a machine
readable carrier, the program code being operative for per-
forming the inventive methods when the. computer program
product runs on a computer. In other words, the inventive
methods are, therefore, a computer program having a program
code for performing at least one of the inventive methods
when the computer program runs on a computer.

WE CLAIM
1. Apparatus for generating a parameter representation of a multi-channel
input signal having original channels, the original channels including a left
channel (B), a right channel (D), a center channel (C), a rear left channel
(A), and a rear right channel (E), comprising:
a parameter generator (203) for generating a first balance parameter (r1),
a first coherence parameter or a first time difference parameter between a
first channel pair, and for generating a second balance parameter (r2),
between a second channel pair, and for generating a third balance
parameter (r3) between a third channel pair, the balance parameters,
coherence parameters or time parameters forming the parameter
representation,
wherein each channel of the two channel pair is one of the original
channels or a weight or unweighted combination of the original channels,
and
wherein the first balance parameter (r1) is a left/right balance parameter,
and wherein the first channel pair includes, as a first channel, a left-
channel or a left down-mix channel and, as a second channel, a right
channel, or a right down-mix channel,
wherein the second balance parameter (r2) is a center balance parameter
and the second channel pair includes, as a first channel, the center

channel or a channel combination of original channels including the center
channel, and, as a second channel, a channel combination including the
left channel land the right channel, and
wherein the third balance parameter (r3) is a front/back balance
parameter and the third channel pair has, as a first channel, a channel
combination including the rear-left channel and the rear-right channel
and, as a second channel, a channel combination including a left channel
and a right channel.
2. Apparatus as claimed in claim 1, wherein the original channels additional
comprise a low frequency enhancement channel and wherein the channel
combination of original channels having the center channel comprises a
combination of the center channel and the low frequency enhancement
channel.
3. Apparatus as claimed in claim 1, wherein the parameter generator is
operative to calculate the center balance parameter in accordance with
the following equation:

wherein r2 is the center balance parameter, wherein C represents the
centre channel, wherein B represents a left-channel, wherein D represents
a right channel, and wherein γ and a represents down-mixing factors.
4. Apparatus in as claimed in claim 1, wherein the parameter generator is

operative to calculate the first balance parameter in accordance with the
following equation:

wherein r1 is the first balance parameter, wherein L is a first down-mix
channel, wherein R is a second down-mix channel, wherein B represents
the left-channel, wherein D represents the right-channel, wherein A
represents the rear-left channel, wherein E represents the rear-right
channel, wherein C represents the center channel, wherein F represents a
low-frequency enhancement channel, and wherein a, p, 7 and δ are down-
mixing factors.
5. Apparatus as claimed in claim 1, wherein the parameter generator is
operative to calculate the front/back parameter (r3) on the following
equation:

wherein r3 is the front/back balance parameter, wherein A is a rear-left
channel, wherein E is a rear-right channel, wherein B represents a left-
channel, wherein D represents a right channel, wherein C represents a
center channel, and wherein α,β, and γ represent down-mixing
parameters.
6. Apparatus as claimed in one of the preceding claims, wherein the
parameter generator a back/right balance parameter (r4) between a back

left/right channel pair having, as the first channel, the rear-left channel
and, as a second channel, the rear-right channel.
7. Apparatus as claimed in one of the preceding claim, wherein the original
multi-channel signal additionally comprises a low-frequency enhancement
channel,
wherein the parameter generator is operative to generate, as an
additional balance parameter a low-frequency enhancement balance
parameter (r5) between a low-frequency enhancement channel pair
having, as a first channel, the low-frequency enhancement channel, and
as a second channel, the center channel or a channel combination
comprising the center channel and a left and a right channel of the
original channels.
8. Apparatus as claimed in claim 7, wherein the parameter generator is
operative to calculate the low-frequency enhancement balance parameter,
in accordance with the following equation:

wherein A corresponds to the rear-left channel, wherein E corresponds to
the rear-right channel, wherein B corresponds to the left channel, wherein
D corresponds to the right channel, wherein C corresponds to the center
channel, wherein F corresponds to the low-frequency enhancement
channel, wherein α,β,γ and δ are down-mixing factors, and wherein r5 is
the low-frequency enhancement balance parameter.

9. Apparatus as claimed in one of the preceding claims, comprising a data
stream generator for generating a scalable data stream (1001,1002), the
data stream generator being operative to enter the first or the second
balance parameters into a lower scaling layer and any further parameter
in a higher scaling layer.
10.Apparatus as claimed in claim 9, wherein the parameter generator is
operative to generate one or more balance parameters in addition to the
first or the second balance parameters, and wherein the data stream
generator is operative for entering the one or more additional balance
parameters into a single or a plurality of higher scaling layers.
11.Apparatus as claimed in claim 10, wherein the data stream generator is
operative to introduce each additional parameter into a dedicated scaling
layer.
12.Apparatus as claimed in one of the preceding claims, wherein the
parameter generator is operative to generate as a forth balance
parameter, a rear-left/right balance parameter, and as a fifth balance
parameter, a low-frequency enhancement balance parameter, and
wherein the data stream generator is operative to enter the first and
second balance parameters into a lower scaling layer and to enter the
third to forth balance parameters or corresponding coherence parameters
or corresponding time difference in one or more higher scaling layers.
13. Apparatus as claimed in one of the preceding clams, wherein the

parameter generator is operative to generate, as an additional balance
parameter, at least one single-sided front/back balance parameter (q3 q4)
between a single-sided front/back channel pair having, as a first channel,
a rear-left channel and, as a second channel, a left channel or, as a first
channel, a rear-right channel and, as a second channel, a right channel.
14. Apparatus as claimed in one of the preceding claims, comprising:
a parameter encoder for generating an encoded version of the balance
parameters, the coherence parameters, or the inter-channel time
differences, the parameter encoder including the quantizer.
15. Apparatus as claimed in one of the preceding claims, wherein
the parameter generator is operative to generate difference sets of
parameters, each set including at least two parameters, wherein channel
pairs used for calculating the parameters in the different sets are different
from each other, and
wherein the parameter generator is operative to select one set of the
different sets for output, which results in a lower bit rate given a certain
parameter-coding scheme,
the apparatus comprising a parameter encoder for encoding the selected
set using a certain parameter coding scheme; and
a parameter control information generator for generating control

information indicating a characteristic of the selected parameter scheme.
16. Apparatus for generating a reconstructed multi-channel representation of
an original multi-channel signal having original channels the original
channels including a left channel (B), a right channel (D), a center channel
(C), a rear left channel (A), and a rear right channel (E), using one or
more base channels generating by converting the original multi-channel
signal using a down-mix scheme, and using a first balance parameter,
between a first channel pair, a second balance parameter between a
second channel pair, and a third balance parameter between a third
channel pair, wherein the first balance parameter (r1) is a left/right
balance parameter, and wherein the first channel pair comprises, as a first
channel, a left-channel or a left down-mix channel and, as a second
channel, a right channel, or a right down-mix channel, wherein the second
balance parameter (r2) is a center balance parameter and the second
channel pair includes, as a first channel, the center channel or a channel
combination of original channels having the center channel, and, as a
second channel, a channel combination comprising the left channel and
the right channel, and wherein the third balance parameter (r3) is a
front/back balance parameter and the third channel pair has, as a first
channel, a channel combination comprising the rear-left channel and the
rear-right channel and, as a second channel, a channel combination
including a left channel and a right channel, the apparatus comprising:
an up-mixer (305) for generating a number of up-mix channels, the
number of up-mix channels being greater than the number of base
channels and smaller than or equal to a number of original channels,

wherein the up-mixer is operative to generate reconstructed channels
based on information on the down-mixing scheme and using the first,
second, and third balance parameters,
wherein the up-mixer is operative to generate a reconstructed center
channel based on the second balance parameter (r2),
wherein the up-mixer is operative to generate a reconstructed left channel
and a reconstructed right channel based on the first parameter (r1), and
wherein the up-mixer is operative to reconstruct rear channels using the
front/back balance parameter (r3).
17.Apparatus as claimed in claim 16, wherein the parameter representation
comprises as an additional balance parameter a back left/right balance
parameter (r4) between a back left/right channel pair having, as the first
channel, the rear-left channel and, as a second channel, the rear-right
channel, and
wherein the up-mixer is operative to generate a reconstructed rear-left
channel and a constructed rear-right channel based on the back left/right
balance parameter.
18. Apparatus as claimed in claim 16 or 17,
wherein the parameter information comprises as a fifth balance
parameter, a low-frequency enhancement balance parameter, and in

which a data stream comprises the first and second balance parameters in
a lower scaling layer and the third and fourth balance parameters or
corresponding coherence parameters or corresponding time differences in
one or more higher scaling layers, and
wherein the up-mixer is operative to use the first balance parameter and
the second balance parameter for generating a left output channel, a right
output channel, and an output channel including the center channel, or
wherein the up-mixer is operative to additionally use the front/back
balance parameter for additionally reconstructing a sum between the rear-
left channel and the rear-right channel; or
wherein the up-mixer is operative to use, in addition, the rear left/right
balance parameter for reconstructing a rear left channel and a rear right
channel.
19.Apparatus as claimed in claim 18, wherein the up-mixer is operative to
generate the reconstructed multi-channel signal such that the following
equations are fulfilled:



wherein F corresponds to a low-frequency enhancement channel, wherein
A corresponds to a left surround channel, wherein E corresponds to a
right surround channel, wherein C corresponds to a center channel,
wherein B corresponds to a left channel, wherein D corresponds to a right
channel, wherein r1 is a left/right balance parameter, wherein r2 is a
center/left-right balance parameter, wherein r3 is a front/right balance
parameter, wherein r4 is a rear left/right balance parameter, wherein r5 is
a center/low frequency enhancement balance parameter, and wherein a,
β,γ and δ are down-mixing factors.
20. Apparatus as claimed in one of claims 16 to 19,
wherein the number of base channels is greater than or equal to two, and
in which the parameter representation includes, as an additional balance
parameter at least one single-side front/back balance parameter (q3, q4)
between a single-sided front/back channel pair having, as a first channel,
a rear-left channel and, as a second channel, a left channel or, as a first
channel, a rear-right channel and, as a second channel, a right channel,
and

wherein the up-mixer is operative to generate a reconstructed rear left or
a reconstructed rear right channel based on a left channel or a right
channel and the corresponding single-sided front/back balance parameter.
21. Apparatus as claimed in one of claims 16 to 20,
wherein the balance parameters are part of a scalable bit stream having,
in the lower scaling layer, the first and the second balance parameters
and, in at least one higher scaling layer, at least one additional balance
parameter, and
which comprises a data stream extractor for extracting the lower scaling
layer and a number of higher scaling layers, the number of higher scaling
layers being between 0 and a number smaller than a full number of
scaling layers, and
wherein the data stream extractor is operative to extract the number of
higher scaling layers depending on an output channel configuration
associated with the apparatus, the channel configuration having fewer
channels than a channel configuration of the original multi-channel signal.
22. Apparatus as claimed in one of claims 16 to 21, comprising:
a parameter scheme selector for controlling the up-mixer such that the
up-mixer applies a parameter scheme indicated by a parameter scheme
control information.

23. Method of generating a parameter representation of a multi-channel input
signal having original channels, the original channels including a left
channel (B), a right channel (D), a center channel (C), a rear left channel
(A), and rear right channel (E), comprising:
generating (203) a first balance parameter, wherein the first balance
parameter (r1) is a left/right balance parameter, and wherein the first
channel pair includes, as a first channel, a left-channel or a left down-mix
channel and, as a second channel, a right channel, or a right down-mix
channel,
generating a second balance parameter, wherein the second balance
parameter (r2) as a first channel, the center channel or a channel
combination of original channels including the center channel, and, as a
second channel, a channel combination including the left channel and the
right channel,
generating a third balance parameter, wherein the third balance
parameter (r3) is a front/back balance parameter and the third channel
pair has, as a first channel, a channel combination including the rear-left
channel and the rear-right channel and, as a second channel, a channel
combination including a left channel and a right channel, and
wherein each channel of the two channel pair is one of the original
channels, a weighted or unweighted combination of the original channels,
a downmix channel, or a weighted or unweighted combination of at least
two downmix channels.

24. Method of generating a reconstructed multi-channel representation of an
original multi-channel signal having original channels, the original
channels including a left channel (B), a right channel (D), a center channel
(C), a rear left channel (A), and a rear right channel (E), using one or
more base channels generating by converting the original multi-channel
signal using a down-mix scheme, and using a fist balance parameter,
between a first channel pair, a second balance parameter between a
second channel pair, and a third balance parameter between a third
channel pair, wherein the first balance parameter (r1) is a left/right
balance parameter, and wherein the first channel pair includes, as a first
channel, a left-channel or a left down-mix channel and, as a second
channel, a right channel, or a right down-mix channel, wherein the second
balance parameter (r2) is a center balance parameter and the second
channel pair comprises, as a first channel, the center channel or a channel
combination of original channels having the center channel, and, as a
second channel, a channel combination comprising the left channel and
the right channel, and wherein the third balance parameter (r3) is a
front/back balance parameter and the third channel pair having, as a first
channel, a channel combination comprising the rear-left channel and the
rear-right channel and, as a second channel, a channel combination
comprising a left channel and a right channel, the method comprising:
generating (305) a number of up-mix channels, the number of up-mix
channels being greater than the number of base channels and smaller
than or equal to a number of original channels,
wherein the step of generating comprises generating reconstructed

channels based on information on the down-mixing scheme and using
first, second and third balance parameter, by generating a reconstructed
center channel based on the second balance parameter (r2), by
generating a reconstructed left channel and a reconstructed right channel
based on the first parameter (r1), and by reconstructing rear channels
using the front/back balance parameter (r3).
25. Apparatus for generating a parameter representation of a multi-channel
input signal having at least three original channels, comprising:
a parameter generator (203) for generating a first balance parameter, a
first coherence parameter or a first time difference parameter between a
first channel pair, and for generating a second balance parameter, a
second coherence parameter or a second time parameter between a
second channel pair, the balance parameters, coherence parameters or
time parameters forming the parameter representation,
wherein the first channel pair has two channels, which are different from
two channels of the second channel pair, and
wherein each channel of the two channel pair is one of the original
channels, a weighted or unweighted combination of the original channels,
a downmix channel, or a weighted or unweighted combination of at least
two downmix channels, and
wherein the first channel pair and the second channel pair include
information on the three original channels,

wherein the parameter generator is operative to generate different sets of
parameters, each set including at least two parameters, wherein channel
pairs used for calculating the parameters in the different sets are different
from each other,
wherein the parameter generator is operative to select one set of the
different sets for output for a presently encoded signal segment, which
results in a lower bit rate given a certain parameter-coding scheme,
wherein the apparatus further comprises:
a parameter encoder for encoding the selected set using a certain
parameter coding scheme; and
a parameter control information generator for generating control
information indicating a characteristic of the selected parameter scheme,
and
wherein the control information (204) signaling the selected parameter
scheme is indicated into an output bit stream.
26. Apparatus as claimed in claim 25, wherein the original channels comprise
a left channel (B), a right channel (D) and a center channel (C), and
Wherein the second balance parameter (r2) is a center balance parameter
and the second channel pair includes, as a first channel, the center

channel and, as a second channel, a channel combination including the
left channel and the right channel.
27.Apparatus as claimed in claim 26, wherein the parameter generator is
operative to calculate the center balance parameter in accordance with
the following equation:

wherein r2 is the center balance parameter, wherein C represents the
centre channel, wherein B represents a left-channel, wherein D represents
a right channel, and wherein γ and a represent down-mixing factors.
28. Apparatus as claimed in claim 25, wherein the first balance parameter (r1)
is a left/right balance parameter, and wherein the first channel pair
includes, as a first channel, a left-channel or a left down-mix channel and,
as a second channel, a right channel, or a right down-mix channel.
29.Apparatus as claimed in claim 28, wherein the parameter generator is
operative to calculate the first balance parameter in accordance with the
following equation:

wherein r1 is the first balance parameter, wherein L is a first down-mix
channel, wherein R is a second down-mix channel, wherein B represents a

left-channel, wherein D represents a right-channel, wherein A represents
a rear-left channel, wherein E represents a rear-right channel, wherein C
represents a center channel, wherein F represents a low-frequency
enhancement channel, and wherein α,β,γ and δ are down - mixing
factors.
30. Apparatus as claimed in claim 25, wherein the original channels comprise
a rear-left channel (A) and a rear-right channel (E), and
wherein the parameter generator is operative to generate, as a third
balance parameter (r3) or as one of the first and second balance
parameters a front/back parameter between a front/back channel pair
having, as a first channel, a channel combination comprising the rear-left
channel and the rear-right channel and, as a second channel, a channel
combination comprising a left channel and a right channel.
31.Apparatus as claimed in claim 30, wherein the parameter generator is
operative to calculate the front/back parameter (r3) based on the
following equation:

wherein r3 is the front/back balance parameter, wherein A is a rear-left
channel, wherein E is a rear-right channel, wherein B represents a left-
channel, wherein D represents a right channel, wherein C represents a
center channel, and wherein α, β, and γ represent down-mixing
parameters.

32.Apparatus as claimed in claim 25, wherein the original multi-channel
signal comprises a rear-left channel and a rear-right channel, and
wherein the parameter generator is operative to generate, as an
additional balance parameter or as the first or the second balance
parameter a back left/right balance parameter (r4) between a back
left/right channel pair having, as the first channel, the rear-left channel
and, as a second channel, the rear-right channel.
33.Apparatus as claimed in claim 25, wherein the original multi-channel
signal comprises a low-frequency enhancement channel and a center
channel, and
the parameter generator is operative to generate, as an additional balance
parameter or as the first or the second balance parameters as low-
frequency enhancement balance parameter between a low-frequency
enhancement channel pair having, as a first channel, the low-frequency
enhancement channel, and as a second channel, the center channel or a
channel combination comprising the center channel and a left and a right
channel of the original channels.
34.Apparatus as claimed in claim 33, wherein the parameter generator is
operative to calculate the low-frequency enhancement balance parameter,
in accordance with the following equation:


wherein A corresponds to a rear-left channel, wherein E corresponds to a
rear-right channel, wherein B corresponds to a left channel, wherein D
corresponds to a right channel, wherein C corresponds to a center
channel, wherein F corresponds to the low-frequency enhancement
channel, wherein α,β,γ and δ are down-mixing factors, and wherein r5 is
the low-frequency enhancement balance parameter.
35.Apparatus as claimed in claim 25, wherein the parameter generator is
operative to generate, as one of the first and second balance parameters
or as an additional balance parameter at least one single-sided front/back
balance parameter (q3, q4) between a single-sided front/back channel pair
having, as a first channel, a rear-left channel and, as a second channel, a
left channel or, as a first channel, a rear-right channel and, as a second
channel, a right channel.
36. Apparatus as claimed in claim 25, wherein one of the first and second
balance parameters is a first left or right balance parameter and the
channel pair includes, as a first channel, a left down-mix channel and, as
a second channel, a left original channel, or a rear-left original channel,
or,
wherein one of the first and second balance parameter is a right balance
parameter and the channel pair includes as a firs channel, a right down-
mix channel and, as a second channel, a right original channel or a rear-
right original channel, or
wherein one of the first or second balance parameters or an additional

balance parameter is a center balance parameter and the channel pair
comprises as a first channel, a sum of the left and right down-mix
channels and, as a second channel, an original center channel.
37.Apparatus as claimed in claim 36, wherein the parameter generator is
operative to generate, as a first balance parameter, a left balance
parameter, as a second balance parameter, a right balance parameter, as
a third balance parameter, a center balance parameter.
38.Apparatus as claimed in claim 36, wherein the parameter generator is
operative to generate, as a forth balance parameter, a left/left surround
balance parameter and, as a fifth balance parameter, a right/right
surround balance parameter.
39.Apparatus as claimed in claim 25, wherein the parameter encoder
comprises a quantizer.
40.Apparatus as claimed in claim 25, wherein the parameter generator is
operative to only use original channels or combinations of original
channels rather than a base channel or a combination of base channels as
channels within the channel pairs.
41.Apparatus as claimed in claim 25, wherein the parameter encoder is
configured for performing delta coding of the selected set and subsequent
entropy coding.
42. Apparatus as claimed in claim 41, wherein the delta coding is done in

either a frequency direction or a time direction or between different
parameters, and
wherein the apparatus is configured to provide signaling means indicating
a particular delta coding used.
43.Apparatus for generating a reconstructed multi-channel representation of
an original multi-channel signal having at least three original channels, the
apparatus using a number of base channels generated by converting the
original multi-channel signal using a down-mix scheme, the apparatus
using a first balance parameter, a first coherence parameter or a first time
difference parameter between a first channel pair, and for generating a
second balance parameter, a second coherence parameter or a second
time parameter between a second channel pair, the balance parameters,
coherence parameters or time parameters forming the parameter
representation, wherein the first channel pair has two channels, which are
different from two channels of the second channel pair, and wherein each
channel of the two channel pair is one of the original channels, a weighted
or unweighted combination of the original channels, a downmix channel,
or a weighted or unweighted combination of at least two downmix
channels, and wherein the first channel pair and the second channel pair
comprise information on the three original channels, and the apparatus
furthermore using control information (304) signaling the selected
parameter scheme, the apparatus comprising:
an up-mixer (305) for generating a number of up-mix channels, the

number of up-mix channels being greater than the number of base
channels and smaller than or equal to a number of original channels,
wherein the up-mixer is operative to generate reconstructed channels
based on information on the down-mixing scheme and using the balance
parameters, the coherence parameters, or the inter-channel time
differences such that a balance or coherence or inter-channel time
difference between a first channel pair is determined based on the first
balance parameter, the first inter-channel coherence parameter, or the
first inter-channel time difference, and a balance, an inter-channel
coherence, or an inter-channel level difference between a second channel
pair is determined based on the second balance parameter, the second
inter-channel coherence parameter, or the second inter-channel time
difference parameter,
wherein the apparatus comprises a parameter scheme selector for
controlling the up-mixer (305) such that the up-mixer applies a parameter
scheme indicated by a parameter scheme control information.
44. Apparatus as claimed in claim 43, wherein the original channels comprise
a left channel (B), a right control channel (D) and a centre channel (C),
and wherein the second balance parameter (r2) is a centre balance
parameter and the second channel pair comprises, as a first channel, the
centre channel and, as a second channel, a channel combination including
the left channel and the right channel, and
wherein the up-mixer is operative to generate a reconstructed center

channel based on the second balance parameter (r2).
45. Apparatus as claimed in claim 43, wherein the first balance parameter (r1)
is a left/right balance parameter, and wherein the first channel pair
comprises, as a first channel, a left-channel or a left down-mix channel
and, as a second channel, a right channel, or a right down-mix channel,
and
wherein the up-mixer is operative to generate a reconstructed left channel
and a reconstructed right channel based on the first parameter (r1).
46.Apparatus as claimed in claim 43, wherein the original channels comprise
a rear-left channel (A) and a rear-right channel (E), in which the
parameter representation comprises as a third balance parameter (r3) or
as one of the first and second balance parameters a front/back parameter
between a front/back channel pair having, as a first channel, a channel
combination comprising the rear-left channel and the rear-right channel
and, as a second channel, a channel combination comprising a left
channel and a right channel, and
wherein the up-mixer is operative to generate a reconstructed combined
rear channel using the front/back balance parameter (r3).
47.Apparatus as claimed in claim 43, wherein the original multi-channel
signal comprises a rear-left channel and a rear-right channel, the
parameter representation comprising as an additional balance parameter

or as the first or the second balance parameter a back left/right balance
parameter (r4) between a back left/right channel pair having, as the first
channel, the rear-left channel and, as a second channel, the rear-right
channel, and
wherein the up-mixer is operative to generate a reconstructed rear-left
and a reconstructed rear-right channel based on the back left/right
balance parameter.
48.Apparatus as claimed in claim 43, wherein a parameter information
provided to the apparatus comprises, as the first balance parameter, a
left/right balance parameter, as the second balance parameter, a centre
balance parameter, as a third balance parameter, a front/back balance
parameter, as a forth balance parameter, a rear-left/right balance
parameter, and as a fifth balance parameter, a low-frequency
enhancement balance parameter, and in which a data stream comprises
the first and second balance parameters in a lower scaling layer and the
third and fourth balance parameters or corresponding coherence
parameters or corresponding time differences in one or more higher
scaling layers, and
wherein the up-mixer is operative to use the first balance parameter and
the second balance parameter for generating a left output channel, a right
output channel, and an output channel comprising the center channel, or
wherein the up-mixer is operative to additionally use the front/back

balance parameter for additionally reconstructing a sum between the rear-
left channel and the rear-right channel; or
wherein the up-mixer is operative to use, in addition, the rear left/right
balance parameter for reconstructing a rear left channel and a rear right
channel.
49.Apparatus as claimed in claim 48, wherein the up-mixer is operative to
generate the reconstructed multi-channel signal such that the following
equations are fulfilled:

wherein F corresponds to a low-frequency enhancement channel, wherein
A corresponds to a left surround channel, wherein E corresponds to a
right surround channel, wherein C corresponds to a center channel,

wherein B corresponds to a left channel, and wherein D corresponds to a
right channel, wherein r1 is a left/right balance parameter, wherein r2 is a
center/left-right balance parameter, wherein r3 is a front/right balance
parameter, wherein r4 is a rear left/right balance parameter, wherein r5 is
a center/low frequency enhancement balance parameter, and wherein α,
β,γ and δ are down-mixing factors.
50. Apparatus as claimed in claim 43,
wherein the number of base channels is greater than or equal to two,
wherein the parameter representation comprises, as one of the first and
second balance parameters or as an additional balance parameter at least
one single-sided front/back balance parameter (q3,q4) between a single-
sided front/back channel pair having, as a first channel, a rear-left channel
and, as a second channel, a left channel or, as a first channel, a rear-right
channel and, as a second channel, a right channel, and
wherein the up-mixer is operative to generate a reconstructed rear left or
a reconstructed rear right channel based on a left channel or a right
channel and the corresponding single-sided front/back balance parameter.
51.Apparatus as claimed in claim 43, wherein one of the first and second
balance parameters is a first left or right balance parameter and the
channel pair comprises, as a first channel, a left down-mix channel and,
as a second channel, a left original channel, or a rear-left original channel,
or,

wherein one of the first and second balance parameter is a right balance
parameter and the channel pair includes as a first channel, a right down-
mix channel and, as a second channel, a right original channel or a rear-
right original channel, or
wherein one of the first or second balance parameters or an additional
balance parameter is a center balance parameter and the channel pair
comprising, as a first channel, a sum of the left and right down-mix
channels and, as a second channel, an original center channel, and
wherein the up-mixer is operative to generate the reconstructed channels
using the parameters and the first base channel, the second base channel,
or a combination of the first and second base channels.
52. Apparatus as claimed in claim 43, wherein the parameter scheme either
comprises q3 and q4 or, instead, r3 and r4,
wherein q3 is a first single-sided front/back balance parameter (q3, q4)
between a single-sided front/back channel pair having, as a first channel,
a rear-left channel and, as a second channel, a left channel,
wherein q4 is a second single-sided front/back balance parameter between
a single-sided front/back channel pair having or, as a first channel, a rear-
right channel and, as a second channel, a right channel,
wherein r3 is a front/right balance parameter, and
wherein r4 is a rear left/right balance parameter.

53. Method of generating a parameter representation of a multi-channel input
signal having at least three original channels, comprising:
generating (203) a first balance parameter, a first coherence parameter or
a first time difference parameter between a first channel pair, and
generating a second balance parameter, a second coherence parameter or
a second time parameter between a second channel pair, the balance
parameters, coherence parameters or time parameters forming the
parameter representation,
wherein the first channel pair has two channels, which are different from
two channels of the second channel pair, and
wherein each channel of the two channel pair is one of the original
channels, a weighted or unweighted combination of the original channels,
a donwmix channel, or a weighted or unweighted combination of at least
two downmix channels, and
wherein the first channel pair and the second channel pair comprise
information on the three original channels,
wherein different sets of parameters are generated, each set comprising
at least two parameters, wherein channel pairs used for calculating the
parameters in the different sets are different from each other,
wherein one set of the different sets is selected for output for a presently
encoded signal segment, which results in a lower bit rate given a certain
parameter scheme,

wherein the method comprising:
encoding the selected set using a certain parameter coding scheme, and
generating control information indicating a characteristic of the selected
parameter scheme, and
wherein the control information signaling the selected parameter scheme
is comprised into an output bit stream.
54. Method of generating a reconstructed multi-channel representation of an
original multi-channel signal having at least three original channels, the
method using a number of base channels generating by converting the
original multi-channel signal using a down-mix scheme, the method
furthermore using a first balance parameter, a first coherence parameter
or a first time difference parameter between a first channel pair, and for
generating a second balance parameter, a second coherence parameter or
a second time parameter between a second channel pair, the balance
parameter, coherence parameter or time parameters forming the
parameter representation, wherein the first channel pair has two
channels, which are different from two channels of the second channel
pair, and wherein each channel of two channel pair is one of the original
channels, a weighted or unweighted combination of the original channels,
a downmix channel, or a weighted or unweighted combination of at least
two downmix channels, and wherein the first channel pair and the second
channel pair comprise information on the three original channels, and the
method furthermore using parameter scheme control information

signaling the parameter scheme selected for a signal segment, the
method comprising:
generating (305) a number of up-mix channels, the number of up-mix
channels being greater than the number of base channels and smaller
than or equal to a number of original channels,
wherein the step of generating (305) is controlled such that the selected
parameter scheme indicated by a parameter scheme control is applied for
the signal segment,
wherein the step of generating comprises generating reconstructed
channels based on information on the down-mixing scheme and using the
balance parameters, the coherence parameters, or the inter-channels time
differences such that a balance or coherence or inter-channel time
difference between a first channel pair is determined based on the first
balance parameter, the first inter-channel coherence parameter, or the
first interOchannel time difference, and a balance, an inter-channel
coherence, or an inter-channel level difference between a second channel
pair is determined based on the second balance parameter, the second
inter-channel coherence parameter, or the second inter-channel time
difference parameter.



ABSTRACT


"APPARATUSES AND METHODS FOR GENERATING A PARAMETER
REPRESENTATION OF A MULTI-CHANNEL INPUT SIGNAL"
The invention relates to an apparatus for generating a parameter representation of a
multi-channel input signal having original channels, the original channels including a left
channel (B), a right channel (D), a center channel (C), a rear left channel (A), and a rear
right channel (E), comprising a parameter generator (203) for generating a first balance
parameter (r1), a first coherence parameter or a first time difference parameter between
a first channel pair, and for generating a second balance parameter (r2), between a
second channel pair, and for generating a third balance parameter (r3) between a third
channel pair, the balance parameters, coherence parameters or time parameters
forming the parameter representation, wherein each channel of the two channel pair is
one of the original channels or a weight or unweighted combination of the original
channels, and wherein the first balance parameter (r1) is a left/right balance parameter,
and wherein the first channel pair includes, as a first channel, a left-channel or a left
down-mix channel and, as a second channel, a right channel, or a right down-mix
channel, wherein the second balance parameter (r2) is a center balance parameter and
the second channel pair includes, as a first channel, the center channel or a channel
combination of original channels including the center channel, and, as a second channel,
a channel combination including the left channel land the right channel, and wherein the
third balance parameter (r3) is a front/back balance parameter and the third channel
pair has, as a first channel, a channel combination including the rear-left channel and
the rear-right channel and, as a second channel, a channel combination including a left
channel and a right channel.

Documents:

03004-kolnp-2006 abstract.pdf

03004-kolnp-2006 claims.pdf

03004-kolnp-2006 correspondence others.pdf

03004-kolnp-2006 description(complete).pdf

03004-kolnp-2006 drawings.pdf

03004-kolnp-2006 form-1.pdf

03004-kolnp-2006 form-2.pdf

03004-kolnp-2006 form-3.pdf

03004-kolnp-2006 form-5.pdf

03004-kolnp-2006 international publication.pdf

03004-kolnp-2006 international search authority report.pdf

03004-kolnp-2006 others.pdf

03004-kolnp-2006 pct request form.pdf

03004-kolnp-2006 priority document.pdf

03004-kolnp-2006-correspondence-1.1.pdf

03004-kolnp-2006-correspondence-1.2.pdf

03004-kolnp-2006-form-26.pdf

3004-KOLNP-2006-(01-11-2011)-CORRESPONDENCE.pdf

3004-KOLNP-2006-(01-11-2011)-EXAMINATION REPORT REPLY RECIEVED.PDF

3004-KOLNP-2006-(01-11-2011)-PA.pdf

3004-KOLNP-2006-(03-10-2012)-AMANDED PAGES OF SPECIFICATION.pdf

3004-KOLNP-2006-(03-10-2012)-CORRESPONDENCE.pdf

3004-KOLNP-2006-(03-10-2012)-DRAWINGS.pdf

3004-KOLNP-2006-(03-10-2012)-FORM-1.pdf

3004-KOLNP-2006-(03-10-2012)-FORM-2.pdf

3004-KOLNP-2006-(03-10-2012)-FORM-3.pdf

3004-KOLNP-2006-(03-10-2012)-FORM-5.pdf

3004-KOLNP-2006-(12-01-2012)-ABSTRACT.pdf

3004-KOLNP-2006-(12-01-2012)-AMANDED CLAIMS.pdf

3004-KOLNP-2006-(12-01-2012)-CORRESPONDENCE.pdf

3004-KOLNP-2006-(12-01-2012)-DESCRIPTION (COMPLETE).pdf

3004-KOLNP-2006-(12-01-2012)-DRAWINGS.pdf

3004-KOLNP-2006-(12-01-2012)-FORM 1.pdf

3004-KOLNP-2006-(12-01-2012)-FORM 2.pdf

3004-KOLNP-2006-(12-01-2012)-OTHERS.pdf

3004-KOLNP-2006-(12-01-2012)-PETITION UNDER RULE 137.pdf

3004-KOLNP-2006-(15-11-2012)-CORRESPONDENCE.pdf

3004-KOLNP-2006-(15-11-2012)-OTHERS.pdf

3004-KOLNP-2006-(30-03-2012)-CERTIFIED COPIES(OTHER COUNTRIES).pdf

3004-KOLNP-2006-(30-03-2012)-CORRESPONDENCE.pdf

3004-KOLNP-2006-(30-03-2012)-FORM-13-1.pdf

3004-KOLNP-2006-(30-03-2012)-FORM-13.pdf

3004-KOLNP-2006-(30-03-2012)-PA-CERTIFIED COPIES.pdf

3004-KOLNP-2006-ABSTRACT 1.1.pdf

3004-KOLNP-2006-AMANDED CLAIMS.pdf

3004-KOLNP-2006-CANCELLED PAGES.pdf

3004-KOLNP-2006-CORRESPONDENCE 1.1.pdf

3004-KOLNP-2006-CORRESPONDENCE 1.2.pdf

3004-KOLNP-2006-CORRESPONDENCE 1.3.pdf

3004-KOLNP-2006-EXAMINATION REPORT.pdf

3004-KOLNP-2006-FORM 13.pdf

3004-kolnp-2006-form 18.pdf

3004-KOLNP-2006-FORM 26.pdf

3004-KOLNP-2006-GPA.pdf

3004-KOLNP-2006-GRANTED-ABSTRACT.pdf

3004-KOLNP-2006-GRANTED-CLAIMS.pdf

3004-KOLNP-2006-GRANTED-DESCRIPTION (COMPLETE).pdf

3004-KOLNP-2006-GRANTED-DRAWINGS.pdf

3004-KOLNP-2006-GRANTED-FORM 1.pdf

3004-KOLNP-2006-GRANTED-FORM 2.pdf

3004-KOLNP-2006-GRANTED-FORM 3.pdf

3004-KOLNP-2006-GRANTED-FORM 5.pdf

3004-KOLNP-2006-GRANTED-SPECIFICATION-COMPLETE.pdf

3004-KOLNP-2006-INTERNATIONAL PUBLICATION.pdf

3004-KOLNP-2006-INTERNATIONAL SEARCH REPORT & OTHERS.pdf

3004-KOLNP-2006-MARKED COPY.pdf

3004-KOLNP-2006-MISCLLENIOUS.pdf

3004-KOLNP-2006-PETITION UNDER RULE 137.pdf

3004-KOLNP-2006-REPLY TO EXAMINATION REPORT.pdf

abstract-03004-kolnp-2006.jpg


Patent Number 255436
Indian Patent Application Number 3004/KOLNP/2006
PG Journal Number 08/2013
Publication Date 22-Feb-2013
Grant Date 21-Feb-2013
Date of Filing 18-Oct-2006
Name of Patentee DOLBY INTERNATIONAL AB
Applicant Address APOLLO BUILDING, 3E HERIKERBERGWEG 1-35, 1101 CN, AMSTERDAM ZUID-OOST, NETHERLANDS
Inventors:
# Inventor's Name Inventor's Address
1 HEIKO PURNHAGEN GJUTERIBACKEN 17, SWEDEN-17265, SUNDBYBERG, SWEDEN
2 JONAS ENGDEGARD WENSTROEMSVAEGEN 6 SE-115 43 STOCKHOLM,SE
3 JONAS ROEDEN OESTERVAEGEN 40 SE-169 55 SOLNA,SE
4 KRISTOFER KJOERLING LOSTIGEN 10 SE-170 75 SOLNA,SE
5 LARS VILLEMOES MANDOLINVAEGEN 22 SE-175 56 JAEFAELLA,SE
PCT International Classification Number G10L19/00
PCT International Application Number PCT/EP05/003849
PCT International Filing date 2005-04-12
PCT Conventions:
# PCT Application Number Date of Convention Priority Country
1 0400998-1 2004-04-16 Sweden