Patent application number | Description | Published |
20080198933 | ADAPTIVE TRUNCATION OF TRANSFORM COEFFICIENT DATA IN A TRANSFORM-BASED DITIGAL MEDIA CODEC - A block transform-based digital media codec has a signaling scheme and bitstream syntax to flexibly signal that truncation of less significant information bits of transform coefficients coded as an optional layer of the bitstream has been performed adaptively per region or tile of the image. | 08-21-2008 |
20080198935 | COMPUTATIONAL COMPLEXITY AND PRECISION CONTROL IN TRANSFORM-BASED DIGITAL MEDIA CODEC - A digital media encoder/decoder includes signaling of various modes relating to computation complexity and precision at decoding. The encoder may send a syntax element indicating arithmetic precision (e.g., using 16 or 32-bit operations) of the transform operations performed at decoding. The encoder also may signal whether to apply scaling at the decoder output, which permits a wider dynamic range of intermediate data at decoding, but adds to computational complexity due to the scaling operation. | 08-21-2008 |
20080198936 | SIGNALING AND USE OF CHROMA SAMPLE POSITIONING INFORMATION - Rules for the signaling and interpretation of chroma position are described. One rule, called the short rule, defines fifteen discrete chroma centering positions and corresponding four-bit syntax element. Another rule, called the extended rule, defines 81 discrete chroma centering positions and corresponding seven-bit syntax elements. A described method includes receiving digital media data at a digital media encoder, determining chroma position information for the received digital media data, and representing the chroma position information with one or more syntax elements in an encoded bitstream. The one or more syntax elements are operable to communicate the chroma position information to a digital media decoder. The chroma position information facilitates an image rotation or flip. | 08-21-2008 |
20080199091 | SIGNALING AND USES OF WINDOWING INFORMATION FOR IMAGES - Techniques and tools for signaling and using image tiling information (such as syntax elements relating index tables and header size), signaling and using windowing information (such as techniques for using windowing parameters when rotating, cropping or flipping images), and signaling and using alpha channel information are described. | 08-21-2008 |
20080232452 | Parameterized filters and signaling techniques - Filter taps for filters are specified by filter coefficient parameters. The filter taps are greater in number than the coefficient parameters from which the filter taps are calculated. For example, two coefficient parameters are used to specify a four-tap filter. Filter information can be signaled in a bitstream, such as by signaling one or more family parameters for a filter family and, for each filter in a family, signaling one or more filter tap parameters from which filter taps can be derived. Family parameters can include a number of filters parameter, a resolution parameter, a scaling bits parameter, and/or a full integer position filter present parameter that indicates whether or not the filters include an integer position filter. Filter parameters can be signaled and used to determine coefficient parameters from which filter taps are calculated. | 09-25-2008 |
20090196338 | ENTROPY CODING EFFICIENCY ENHANCEMENT UTILIZING ENERGY DISTRIBUTION REMAPPING - Architecture for enhancing the compression (e.g., luma, chroma) of a video signal and improving the perceptual quality of the video compression schemes. The architecture operates to reshape the normal multimodal energy distribution of the input video signal to a new energy distribution. In the context of luma, the algorithm maps the black and white (or contrast) information of a picture to a new energy distribution. For example, the contrast can be enhanced in the middle range of the luma spectrum, thereby improving the contrast between a light foreground object and a dark background. At the same time, the algorithm reduces the bit-rate requirements at a particular quantization step size. The algorithm can be utilized also in post-processing to improve the quality of decoded video. | 08-06-2009 |
20090219994 | Scalable video coding and decoding with sample bit depth and chroma high-pass residual layers - Techniques and tools are described for scalable video encoding and decoding. In some embodiments, an input frame is downsampled in terms of sample depth and chroma sampling rate, encoded, and output from the encoder as a base layer bitstream. The base layer bitstream is also reconstructed and upsampled to produce a reconstructed bitstream which is subtracted from the original input frame to produce a residual layer. The residual layer is split and encoded as a sample depth residual layer bitstream and a chroma high-pass residual layer bitstream. To recover the encoded input frame, a decoder receives one or more of these bitstreams, decodes them, and combines them to form a reconstructed image. The use of separate codecs is allowed for the base layer and the enhancement layers, without inter-layer dependencies. | 09-03-2009 |
20090238279 | Motion-compensated prediction of inter-layer residuals - Techniques and tools are described for scalable video encoding and decoding. In some embodiments, an encoding tool encodes base layer video and outputs encoded base layer video in a base layer bit stream. The encoding tool encodes inter-layer residual video (representing differences between input video and reconstructed base layer video) using motion compensation relative to previously reconstructed inter-layer residual video. For the inter-layer residual video, the encoding tool outputs motion information and motion-compensated prediction residuals in an enhancement layer bit stream. A decoding tool receives the base layer bit stream and enhancement layer bit stream, reconstructs base layer video, reconstructs inter-layer residual video, and combines the reconstructed base layer video and reconstructed inter-layer residual video. Using motion compensation for the inter-layer residual video facilitates the use of separate motion vectors and separate codecs for the base layer video and inter-layer residual video. | 09-24-2009 |
20090296808 | Adaptive quantization for enhancement layer video coding - Techniques and tools for encoding enhancement layer video with quantization that varies spatially and/or between color channels are presented, along with corresponding decoding techniques and tools. For example, an encoding tool determines whether quantization varies spatially over a picture, and the tool also determines whether quantization varies between color channels in the picture. The tool signals quantization parameters for macroblocks in the picture in an encoded bit stream. In some implementations, to signal the quantization parameters, the tool predicts the quantization parameters, and the quantization parameters are signaled with reference to the predicted quantization parameters. A decoding tool receives the encoded bit stream, predicts the quantization parameters, and uses the signaled information to determine the quantization parameters for the macroblocks of the enhancement layer video. The decoding tool performs inverse quantization that can vary spatially and/or between color channels. | 12-03-2009 |
20090297054 | REDUCING DC LEAKAGE IN HD PHOTO TRANSFORM - In certain embodiments, to eliminate DC leakage into surrounding AC values, scaling stage within a photo overlap transform operator is modified such that the off-diagonal elements of the associated scaling matrix have the values of 0. In certain embodiments, the on-diagonal scaling matrix are given the values (0.5, 2). In some embodiments, the scaling is performed using a combination of reversible modulo arithmetic and lifting steps. In yet other embodiments, amount of DC leakage is estimated at the encoder, and preprocessing occurs to mitigate amount of leakage, with the bitstream signaling that preprocessing has occurred. A decoder may then read the signal and use the information to mitigate DC leakage. | 12-03-2009 |
20100046612 | CONVERSION OPERATIONS IN SCALABLE VIDEO ENCODING AND DECODING - Techniques and tools for conversion operations between modules in a scalable video encoding tool or scalable video decoding tool are described. For example, given reconstructed base layer video in a low resolution format (e.g., 4:2:0 video with 8 bits per sample) an encoding tool and decoding tool adaptively filter the reconstructed base layer video and upsample its sample values to a higher sample depth (e.g., 10 bits per sample). The tools also adaptively scale chroma samples to a higher chroma sampling rate (e.g., 4:2:2). The adaptive filtering and chroma scaling help reduce energy in inter-layer residual video by making the reconstructed base layer video closer to input video, which typically makes compression of the inter-layer residual video more efficient. The encoding tool also remaps sample values of the inter-layer residual video to adjust dynamic range before encoding, and the decoding tool performs inverse remapping after decoding. | 02-25-2010 |
20100046626 | ENTROPY CODING/DECODING OF HIERARCHICALLY ORGANIZED DATA - Techniques and tools for encoding and decoding data values that are hierarchically organized are presented. For example, an encoder encodes data as a set that has a hierarchy of subsets with set symbols. In the encoding, the encoder evaluates the data values of the set and selectively encodes a symbol combination code that indicates the set symbols of multiple subsets of the set. Then, for each of the multiple subsets considered as a new set, the encoder selectively repeats the evaluating, selective encoding and selective repetition for the new set. In corresponding decoding, a decoder decodes data encoded as a set that has a hierarchy of subsets with set symbols. In some implementations, the encoding and decoding are adaptive and use a symbol alphabet with nested elements. | 02-25-2010 |
20100061447 | SKIP MODES FOR INTER-LAYER RESIDUAL VIDEO CODING AND DECODING - Techniques and tools for skip modes in encoding and decoding of inter-layer residual video are described. For example, an encoder encodes multiple macroblocks of a picture of inter-layer residual video. For a current macroblock that is skipped, the encoder selects a skip mode from among multiple available skip modes and uses the selected skip mode when encoding the current macroblock. The skip modes can include intra skip mode and predicted-motion skip mode. A corresponding decoder, for the current macroblock, selects and uses the skip mode for the current macroblock during decoding. As another example, an encoder encodes multiple channels of a picture of inter-layer residual video. For each channel, the encoder determines whether to skip the channel. The encoder signals channel skip information to indicate which channels are skipped. A corresponding decoder parses the channel skip information and determines on a channel-by-channel basis whether to skip the respective channels. | 03-11-2010 |
20100092098 | REDUCED DC GAIN MISMATCH AND DC LEAKAGE IN OVERLAP TRANSFORM PROCESSING - In certain embodiments, overlap operators are applied during encoding and/or decoding of digital media, where the overlap operators have reduced DC gain mismatch and/or DC leakage between interior overlap operators and overlap operators at the edge and/or corner. In other embodiments, information indicating a selected tile boundary option for overlap processing can be encoded and/or decoded. The selected tile boundary option indicates one of a hard tile boundary option and a soft tile boundary option for processing with overlap operators. Overlap transform processing can then be applied based at least in part on the selected tile boundary option. | 04-15-2010 |
20100104025 | TECHNIQUE TO ENABLE EFFICIENT ADAPTIVE STREAMING AND TRANSCODING OF VIDEO AND OTHER SIGNALS - A method is described for efficiently determining total end-to-end distortion of a pre-compressed data stream, such as video streams or other media streams, at the time of delivery over a lossy-network, and for providing adaptive error-resilient delivery schemes based on distortion estimates. The methods can be utilized with single or multilayer packet streams and are particularly well suited for video streams. By way of example, distortion estimates are performed by generating side-information at the time of data stream compression, wherein the side-information is used in conjunction with information about the network status to determine an estimated distortion for the group of packets when the data stream is transported over the network to a destination end. This estimation may be utilized within described resiliency techniques in which the error correction mechanism is selected in response to the estimated distortion, which may be additionally refined in reference to cost factors. | 04-29-2010 |
20100246671 | DC COEFFICIENT SIGNALING AT SMALL QUANTIZATION STEP SIZES - Described tools and techniques relate to signaling for DC coefficients at small quantization step sizes. The techniques and tools can be used in combination or independently. For example, a tool such as a video encoder or decoder processes a VLC that indicates a DC differential for a DC coefficient, a FLC that indicates a value refinement for the DC differential, and a third code that indicates the sign for the DC differential. Even with the small quantization step sizes, the tool uses a VLC table with DC differentials for DC coefficients above the small quantization step sizes. The FLCs for DC differentials have lengths that vary depending on quantization step size. | 09-30-2010 |
20100280827 | NOISE ROBUST SPEECH CLASSIFIER ENSEMBLE - Embodiments for implementing a speech recognition system that includes a speech classifier ensemble are disclosed. In accordance with one embodiment, the speech recognition system includes a classifier ensemble to convert feature vectors that represent a speech vector into log probability sets. The classifier ensemble includes a plurality of classifiers. The speech recognition system includes a decoder ensemble to transform the log probability sets into output symbol sequences. The speech recognition system further includes a query component to retrieve one or more speech utterances from a speech database using the output symbol sequences. | 11-04-2010 |
20110280303 | FLEXIBLE RANGE REDUCTION - Techniques and tools are described for flexible range reduction of samples of video. For example, an encoder signals a first set of one or more syntax elements for range reduction of luma samples and signals a second set of one or more syntax elements for range reduction of chroma samples. The encoder selectively scales down the luma samples and chroma samples in a manner consistent with the first syntax element(s) and second syntax element(s), respectively. Or, an encoder signals range reduction syntax element(s) in an entry point header for an entry point segment, where the syntax element(s) apply to pictures in the entry point segment. If range reduction is used for the pictures, the encoder scales down samples of the pictures. Otherwise, the encoder skips the scaling down. A decoder performs corresponding parsing and scaling up operations. | 11-17-2011 |
20120020409 | ESTIMATING SAMPLE-DOMAIN DISTORTION IN THE TRANSFORM DOMAIN WITH ROUNDING COMPENSATION - Techniques and tools are described for compensating for rounding when estimating sample-domain distortion in the transform domain. For example, a video encoder estimates pixel-domain distortion in the transform domain for a block of transform coefficients after compensating for rounding in the DC coefficient of the block. In this way, the video encoder improves the accuracy of pixel-domain distortion estimation but retains the computational advantages of performing the estimation in the transform domain. Rounding compensation includes, for example, looking up an index (from a de-quantized transform coefficient) in a rounding offset table to determine a rounding offset, then adjusting the coefficient by the offset. Other techniques and tools described herein are directed to creating rounding offset tables and encoders that make encoding decisions after considering rounding effects that occur after an inverse frequency transform on de-quantized transform coefficient values. | 01-26-2012 |
20120243615 | MULTI-LEVEL REPRESENTATION OF REORDERED TRANSFORM COEFFICIENTS - Techniques and tools for encoding and decoding a block of frequency coefficients are presented. An encoder selects a scan order from multiple available scan orders and then applies the selected scan order to a two-dimensional matrix of transform coefficients, grouping non-zero values of the frequency coefficients together in a one-dimensional string. The encoder entropy encodes the one-dimensional string of coefficient values according to a multi-level nested set representation. In decoding, a decoder entropy decodes the one-dimensional string of coefficient values from the multi-level nested set representation. The decoder selects the scan order from among multiple available scan orders and then reorders the coefficients back into a two-dimensional matrix using the selected scan order. | 09-27-2012 |
20120307910 | PARAMETERIZATION FOR FADING COMPENSATION - Techniques and tools for performing fading compensation in video processing applications are described. For example, during encoding, a video encoder performs fading compensation using fading parameters comprising a scaling parameter and a shifting parameter on one or more reference images. During decoding, a video decoder performs corresponding fading compensation on the one or more reference images. | 12-06-2012 |
20130010861 | USE OF FRAME CACHING TO IMPROVE PACKET LOSS RECOVERY - Various new and non-obvious apparatus and methods for using frame caching to improve packet loss recovery are disclosed. One of the disclosed embodiments is a method for using periodical and synchronized frame caching within an encoder and its corresponding decoder. When the decoder discovers packet loss, it informs the encoder which then generates a frame based on one of the shared frames stored at both the encoder and the decoder. When the decoder receives this generated frame it can decode it using its locally cached frame. | 01-10-2013 |
20130215971 | REDUCING DC LEAKAGE IN HD PHOTO TRANSFORM - In certain embodiments, to eliminate DC leakage into surrounding AC values, scaling stage within a photo overlap transform operator is modified such that the off-diagonal elements of the associated scaling matrix have the values of 0. In certain embodiments, the on-diagonal scaling matrix are given the values (0.5, 2). In some embodiments, the scaling is performed using a combination of reversible modulo arithmetic and lifting steps. In yet other embodiments, amount of DC leakage is estimated at the encoder, and preprocessing occurs to mitigate amount of leakage, with the bitstream signaling that preprocessing has occurred. A decoder may then read the signal and use the information to mitigate DC leakage. | 08-22-2013 |
20140056531 | SIGNALING AND USES OF WINDOWING INFORMATION FOR IMAGES - Techniques and tools for signaling and using image tiling information (such as syntax elements relating index tables and header size), signaling and using windowing information (such as techniques for using windowing parameters when rotating, cropping or flipping images), and signaling and using alpha channel information are described. | 02-27-2014 |
20140133583 | USE OF FRAME CACHING TO IMPROVE PACKET LOSS RECOVERY - Various new and non-obvious apparatus and methods for using frame caching to improve packet loss recovery are disclosed. One of the disclosed embodiments is a method for using periodical and synchronized frame caching within an encoder and its corresponding decoder. When the decoder discovers packet loss, it informs the encoder which then generates a frame based on one of the shared frames stored at both the encoder and the decoder. When the decoder receives this generated frame it can decode it using its locally cached frame. | 05-15-2014 |
20140294070 | ADAPTIVE QUANTIZATION FOR ENHANCEMENT LAYER VIDEO CODING - Techniques and tools for encoding enhancement layer video with quantization that varies spatially and/or between color channels are presented, along with corresponding decoding techniques and tools. For example, an encoding tool determines whether quantization varies spatially over a picture, and the tool also determines whether quantization varies between color channels in the picture. The tool signals quantization parameters for macroblocks in the picture in an encoded bit stream. In some implementations, to signal the quantization parameters, the tool predicts the quantization parameters, and the quantization parameters are signaled with reference to the predicted quantization parameters. A decoding tool receives the encoded bit stream, predicts the quantization parameters, and uses the signaled information to determine the quantization parameters for the macroblocks of the enhancement layer video. The decoding tool performs inverse quantization that can vary spatially and/or between color channels. | 10-02-2014 |
20150043633 | ADAPTIVE QUANTIZATION FOR ENHANCEMENT LAYER VIDEO CODING - Techniques and tools for encoding enhancement layer video with quantization that varies spatially and/or between color channels are presented, along with corresponding decoding techniques and tools. For example, an encoding tool determines whether quantization varies spatially over a picture, and the tool also determines whether quantization varies between color channels in the picture. The tool signals quantization parameters for macroblocks in the picture in an encoded bit stream. In some implementations, to signal the quantization parameters, the tool predicts the quantization parameters, and the quantization parameters are signaled with reference to the predicted quantization parameters. A decoding tool receives the encoded bit stream, predicts the quantization parameters, and uses the signaled information to determine the quantization parameters for the macroblocks of the enhancement layer video. The decoding tool performs inverse quantization that can vary spatially and/or between color channels. | 02-12-2015 |