Patent application number | Description | Published |
20090319281 | CUE-BASED AUDIO CODING/DECODING - Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “0.1” indicates a single low-frequency effects (LFE) channel and “0.2” indicates two LFE channels. | 12-24-2009 |
20100070287 | ADAPTING MASKING THRESHOLDS FOR ENCODING A LOW FREQUENCY TRANSIENT SIGNAL IN AUDIO DATA - An improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal. | 03-18-2010 |
20110054911 | Enhanced Audio Decoder - Methods, systems, and apparatus are presented for decoding an audio signal that includes bandwidth extension data. An audio signal that includes core audio data and bandwidth extension data can be received in a decoder. The core audio data can be associated with a core portion of an audio signal, such as the frequency range below a cutoff frequency, and the bandwidth extension data can be associated with an extended portion of the audio signal, such as a frequency range above the cutoff frequency. The core audio data can be decoded to generate a decoded core audio signal in a time domain representation. Further, an extended portion of the audio signal can be reconstructed in accordance with extension data and decoded core audio signal. Additionally, the decoded core audio signal can be lowpass filtered and the extended portion can be highpass filtered before being combined to generate a decoded output signal. | 03-03-2011 |
20110106544 | ADAPTING MASKING THRESHOLDS FOR ENCODING A LOW FREQUENCY TRANSIENT SIGNAL IN AUDIO DATA - An improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal. | 05-05-2011 |
20110164756 | Cue-Based Audio Coding/Decoding - Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “.1” indicates a single low-frequency effects (LFE) channel and “.2” indicates two LFE channels. | 07-07-2011 |
20120016679 | ADAPTING MASKING THRESHOLDS FOR ENCODING AUDIO DATA - According to one embodiment, an improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal. | 01-19-2012 |
20140294200 | METADATA FOR LOUDNESS AND DYNAMIC RANGE CONTROL - An audio normalization gain value is applied to an audio signal to produce a normalized signal. The normalized signal is processed to compute dynamic range control (DRC) gain values in accordance with a selected one of several pre-defined DRC characteristics. The audio signal is encoded, and the DRC gain values are provided as metadata associated with the encoded audio signal. Several other embodiments are also described and claimed. | 10-02-2014 |
Patent application number | Description | Published |
20080249769 | Method and Apparatus for Determining Audio Spatial Quality - Techniques for evaluating the audio quality of an audio test signal are disclosed. These techniques provide a quality analysis that takes into account spatial audio distortions between the audio test signal and a reference audio signal. These techniques involve, for example, determining a plurality of audio spatial cues for an audio test signal, determining a corresponding plurality of audio spatial cues for an audio reference signal, comparing the determined audio spatial cues of the audio test signal to the audio spatial cues of the audio reference signal, and determining the audio quality of the audio test signal. | 10-09-2008 |
20140067404 | INTENSITY STEREO CODING IN ADVANCED AUDIO CODING - A system and method for selectively applying Intensity Stereo coding to an audio signal is described. The system and method make decisions on whether to apply Intensity Stereo coding to each scale factor band of the audio signal based on (1) the number of bits necessary to encode each scale factor band using Intensity Stereo coding, (2) spatial distortions generated by using Intensity Stereo coding with each scale factor band, and (3) switching distortions for each scale factor band resulting from switching Intensity Stereo coding on or off in relation to a previous scale factor band. | 03-06-2014 |
20140297291 | METADATA DRIVEN DYNAMIC RANGE CONTROL - A system for encoding and applying Dynamic Range Control/Compression (DRC) gain values to a piece of sound program content is described. In particular, a set of DRC gain values representing a DRC gain curve for the piece of content may be divided into frames corresponding to frames of the piece of content. A set of fields may be included with an audio signal representing the piece of content. The additional fields may represent the DRC gain values using linear or spline interpolation. The additional fields may include 1) an initial gain value for each DRC frame, 2) a set of slope values at particular points in the DRC curve, 3) a set of time delta values for each consecutive pair of slope values, and/or 4) one or more gain delta values representing changes of DRC gain values in the DRC gain curve between points of the slope values. | 10-02-2014 |