| Patent application number | Description | Published |
| 20080212599 | METHODS AND SYSTEMS FOR ENCODING DATA IN A COMMUNICATION NETWORK - Methods and systems for encoding data in a communication network are presented. In an aspect, a method is provided for processing multimedia data. The method includes detecting a smoothness factor associated with one or more portions of the multimedia data, and determining that smoothing is required based on the smoothness factor. The method also includes moving selected multimedia data from a first selected portion of the multimedia data to a second selected portion of the multimedia data, wherein the smoothness factor is adjusted. In an aspect, an apparatus is provided that includes a detector configured to detect a smoothness factor associated with one or more portions of the multimedia data, and to determine that smoothing is required based on the smoothness factor. The apparatus also includes an encoder configured to move selected multimedia data from a first selected portion of the multimedia data to a second selected portion of the multimedia data. | 09-04-2008 |
| 20080212673 | Systems and Methods for Adaptively Determining I Frames for Acquisition and Base and Enhancement Layer Balancing - The invention includes apparatus, systems and methods for processing multimedia data. A method of processing multimedia data may include encoding a frame of the multimedia data as an I frame, a channel switch frame, and a P frame and selecting the encoded I frame if a size of the encoded I frame and a size of the encoded channel switch frame and the encoded P frame meet a first condition. An apparatus for processing multimedia data may include an encoder for encoding a frame of the multimedia data as an I frame, a channel switch frame, and a P frame and selecting the encoded I frame if a size of the encoded I frame and a size of the encoded channel switch frame and the encoded P frame meet a first condition. | 09-04-2008 |
| 20080212677 | Efficient Video Block Mode Changes in Second Pass Video Coding - This disclosure describes techniques for second pass video coding in a multi-pass video coding scenario. The coding modes for some video blocks encoded during a second pass may be changed relative to the coding modes used for such video blocks in the first pass. However, motion information does not change for those video blocks that have the changed modes. In particular, mode changes can be made in the second coding pass relative to the modes used in the first coding pass without changing the manner in which motion information will be derived at the decoder, e.g., due to similarities between the original modes of the first pass and changed modes used in the second pass. The second pass coding techniques may also include quantization parameter adjustments, and the mode changes can cause such quantization parameter adjustments to have more profound refinements effects on the second pass coding. | 09-04-2008 |
| 20080273810 | VIDEO CODING MODE SELECTION USING ESTIMATED CODING COSTS - This disclosure describes techniques for coding mode selection using estimated coding costs. To provide high compression efficiency, for example, an encoding device may attempt to select a coding mode for coding blocks of pixels that codes the data of the blocks with high efficiency. To this end, the encoding device may perform coding mode selection based on estimates of coding cost for at least a portion of the possible modes. In accordance with the techniques described herein, the encoding device estimates the coding cost for the different modes without actually coding the blocks. In fact, in some aspects, the encoding module device may estimate the coding cost for the modes without quantizing the data of the block for each mode. In this manner, the coding cost estimation techniques of this disclosure reduce the amount of computationally intensive calculations needed to perform effective mode selection. | 11-06-2008 |
| 20090016445 | EARLY RENDERING FOR FAST CHANNEL SWITCHING - The disclosure relates to techniques for switching between channels of digital multimedia content. In particular, a decoding device decodes and renders to a display at least one frame of a segment of data prior to receiving the entire segment. In certain aspects, the decoding device may render one of the frames of the segment and freeze the rendered frame until the decoding device receives all of the frames of the segment. In other aspects, the decoding device may render frames of one or more segments at a reduced rendering rate until the receiving and rendering operations of decoding device are synchronized such that the rendering of the current segment occurs at substantially the same time as the receiving of the next segment. By rendering at least frame prior to receiving the entire segment the decoding device more quickly displays content to a user during a channel switching event. | 01-15-2009 |
| 20090175331 | TWO PASS QUANTIZATION FOR CABAC CODERS - This disclosure describes techniques for quantizing coefficients of a video block for a video coding process that supports context-based adaptive binary arithmetic coding (CABAC). A method may comprise estimating a last non-zero coefficient of the video block, and generating a set of quantized coefficients for the video block based on an assumption that the estimated last non-zero coefficient is actually the last non-zero coefficient of the video block. | 07-09-2009 |
| 20090213930 | FAST MACROBLOCK DELTA QP DECISION - A system and method for encoding multimedia video is described. As video is encoded a quantization parameter is selected for each macroblock. As described herein, the quantization parameter for each macroblock may be selected by limiting the universe of all possible quantization parameters to a particular range of possible quantization parameter values. This increases the speed of video encoding by reducing the number of quantization parameters that are tested for each video macroblock. | 08-27-2009 |
| 20090257489 | RATE-DISTORTION QUANTIZATION FOR CONTEXT-ADAPTIVE VARIABLE LENGTH CODING (CAVLC) - In general, this disclosure provides techniques for quantization of the coefficients of video blocks in a manner that can achieve a desirable balance of rate and distortion. The described techniques may analyze a plurality of quantization levels associated with each individual coefficient to select the quantization level for the individual coefficients that results in a lowest coding cost. Since CAVLC does not encode each coefficient independently, the techniques may compute the coding costs for each of the candidate quantization levels associated with the individual coefficients based on quantization levels selected for previously quantized coefficients and estimated (or predicted) quantization levels for subsequent coefficients of a coefficient vector. The quantization levels for each of the coefficients are selected based on computed coding costs to obtain a set of quantized coefficients that minimize a rate-distortion model. | 10-15-2009 |
| 20090257499 | ADVANCED INTERPOLATION TECHNIQUES FOR MOTION COMPENSATION IN VIDEO CODING - This disclosure describes various interpolation techniques performed by an encoder and a decoder during the motion compensation process of video coding. In one example, an encoder interpolates pixel values of reference video data based on a plurality of different pre-defined interpolation filters. In this example, the decoder receives a syntax element that identifies an interpolation filter, and interpolates pixel values of reference video data based on the interpolation filter identified by the syntax element. In another example, a method of interpolating predictive video data includes generating half-pixel values based on integer pixel values, rounding the half-pixel values to generate half-pixel interpolated values, storing the half-pixel values as non-rounded versions of the half-pixel values, and generating quarter-pixel values based on the non-rounded versions of the half-pixel values and the integer pixel values. | 10-15-2009 |
| 20090257503 | ADVANCED INTERPOLATION TECHNIQUES FOR MOTION COMPENSATION IN VIDEO CODING - This disclosure describes various interpolation techniques performed by an encoder and a decoder during the motion compensation process of video coding. In one example, an encoder interpolates pixel values of reference video data based on a plurality of different pre-defined interpolation filters. In this example, the decoder receives a syntax element that identifies an interpolation filter, and interpolates pixel values of reference video data based on the interpolation filter identified by the syntax element. In another example, a method of interpolating predictive video data includes generating half-pixel values based on integer pixel values, rounding the half-pixel values to generate half-pixel interpolated values, storing the half-pixel values as non-rounded versions of the half-pixel values, and generating quarter-pixel values based on the non-rounded versions of the half-pixel values and the integer pixel values. | 10-15-2009 |
| 20090262801 | DEAD ZONE PARAMETER SELECTIONS FOR RATE CONTROL IN VIDEO CODING - Quantization techniques are used in video coding to quantize residual coefficients. So-called “dead zone parameters” are selected in the quantization process of residual coefficients of residual video blocks. The dead zone refers to a region of magnitude for coefficients below which any coefficient will be quantized to zero. A method and apparatus of quantizing coefficient values of video blocks in a video coding scheme is provided. A quantization parameter is selected for a set of video blocks. Dead zone parameters are then selected for different video blocks in the set of video blocks. Next, the quantization parameter and the dead zone parameters are applied to quantize the coefficient values of each of the video blocks. | 10-22-2009 |
| 20100020886 | SCALABILITY TECHNIQUES BASED ON CONTENT INFORMATION - Apparatus and methods of using content information for encoding multimedia data are described. A method of processing multimedia data includes classifying content of multimedia data, and encoding the multimedia data in a first data group and in a second data group based on the content classification. The first and second groups are associated with quality levels. A user can request a target quality level. | 01-28-2010 |
| 20100074332 | OFFSET CALCULATION IN SWITCHED INTERPOLATION FILTERS - This disclosure describes techniques for adding offset to predictive video blocks during video coding. In one example, a method of encoding a video block includes interpolating a first block of predictive values based on a first reference video unit within a first list of reference data, and a second block of predictive values based on a second reference video unit within a second list of reference data, calculating, for sub-integer pixel positions, a first offset value based on the first block and the current video block, and a second offset value based on the first offset value and the second block, determining a final block of offset values based on the first block of predictive values, the second block of predictive values, the first offset values, and the second offset values, and encoding the current video block based on the final block of offset values. | 03-25-2010 |
| 20100086025 | QUANTIZATION PARAMETER SELECTIONS FOR ENCODING OF CHROMA AND LUMA VIDEO BLOCKS - This disclosure describes rules that may be applied during block-based video coding to ensure that quantization parameter selections for luma blocks will not adversely affect the quality of chroma blocks. In accordance with this disclosure, rate-controlled video encoding occurs in which quantization parameter changes in luma blocks are pre-evaluated to determine whether such quantization parameter changes in luma blocks will also cause quantization changes for chroma blocks. If quantization parameter changes in the luma blocks will also cause quantization changes for chroma blocks, then that quantization parameter change for luma blocks may be skipped and not evaluated. In this way, secondary effects of quantization parameter changes in the luma blocks (with respect to the chroma blocks) can be avoided. | 04-08-2010 |
| 20100086029 | VIDEO CODING WITH LARGE MACROBLOCKS - Techniques are described for encoding and decoding digital video data using macroblocks that are larger than the macroblocks prescribed by conventional video encoding and decoding standards. For example, the techniques include encoding and decoding a video stream using macroblocks comprising greater than 16×16 pixels, for example, 64×64 pixels. In one example, an apparatus includes a video encoder configured to encode a video block having a size of more than 16×16 pixels, generate block-type syntax information that indicates the size of the block, and generate a coded block pattern value for the encoded block, wherein the coded block pattern value indicates whether the encoded block includes at least one non-zero coefficient. The encoder may set the coded block pattern value to zero when the encoded block does not include at least one non-zero coefficient or set the coded block pattern value to one when the encoded block includes a non-zero coefficient. | 04-08-2010 |
| 20100086030 | VIDEO CODING WITH LARGE MACROBLOCKS - Techniques are described for encoding and decoding digital video data using macroblocks that are larger than the macroblocks prescribed by conventional video encoding and decoding standards. For example, the techniques include encoding and decoding a video stream using macroblocks comprising greater than 16×16 pixels, for example, 64×64 pixels. Each macroblock may be partitioned into two or more partitions, and two or more of the partitions may be encoded using different modes. In one example, an apparatus includes a video encoder configured to receive a video block having a size of more than 16×16 pixels, partition the block into partitions, encode one of the partitions using a first encoding mode, encode another of the partitions using a second encoding mode different from the first encoding mode, and generate block-type syntax information that indicates the size of the block and identifies the partitions and the encoding modes used to encode the partitions. | 04-08-2010 |
| 20100086031 | VIDEO CODING WITH LARGE MACROBLOCKS - Techniques are described for encoding and decoding digital video data using macroblocks that are larger than the macroblocks prescribed by conventional video encoding and decoding standards. For example, the techniques include encoding and decoding a video stream using macroblocks comprising greater than 16×16 pixels, for example, 64×64 pixels. In one example, an apparatus includes a video encoder configured to receive a video coding unit, determine a first rate-distortion metric for encoding the video coding unit using first video blocks with sizes of 16×16 pixels, determine a second rate-distortion metric for encoding the video coding unit using second video blocks with sizes of more than 16×16 pixels, encode the video coding unit using the first video blocks when the first rate-distortion metric is less than second rate-distortion metric, and encode the video coding unit using the second video blocks when the second rate-distortion metric is less than the first rate-distortion metric. | 04-08-2010 |
| 20100086032 | VIDEO CODING WITH LARGE MACROBLOCKS - Techniques are described for encoding and decoding digital video data using macroblocks that are larger than the macroblocks prescribed by conventional video encoding and decoding standards. For example, the techniques include encoding and decoding a video stream using macroblocks comprising greater than 16×16 pixels. In one example, an apparatus includes a video encoder configured to encode a coded unit comprising a plurality of video blocks, wherein at least one of the plurality of video blocks comprises a size of more than 16×16 pixels and to generate syntax information for the coded unit that includes a maximum size value, wherein the maximum size value indicates a size of a largest one of the plurality of video blocks in the coded unit. The syntax information may also include a minimum size value. In this manner, the encoder may indicate to a decoder the proper syntax decoder to apply to the coded unit. | 04-08-2010 |
| 20100086049 | VIDEO CODING USING TRANSFORMS BIGGER THAN 4X4 AND 8X8 - In a video processing system, a method and system for applying transforms larger than 8×8 and non-rectangular transforms, and generating transform size syntax elements indicative of the transforms for video decoding are provided. The transform size syntax element may be generated by an encoder based on a prediction block size of a video block and the contents of the video block. Further, the transform size syntax element may be generated according to a set of rules to select from 4×4, 8×8, and larger transform sizes during an encoding process. A decoder may perform an inverse transform based on the transform size syntax element and the rules used by the encoder. The transform size syntax element may be transmitted to the decoder as part of the encoded video bitstream. | 04-08-2010 |
| 20100111182 | DIGITAL VIDEO CODING WITH INTERPOLATION FILTERS AND OFFSETS - This disclosure describes techniques for encoding digital video data using interpolation filters and offsets. An encoder may be configured to select interpolation filters for sub-pixel precision motion estimation based on historical interpolation results obtained for previously encoded video units, such as frames or slices. The encoder also may be configured to compute and assign offsets to the sub-pixel positions after interpolation based on differences between a reference unit and the unit to be coded. The computation and assignment of offsets may be performed before motion estimation. Motion estimation may be refined so that the motion search considers sub-pixel positions to which offsets have been previously added and evaluates sub-pixel positions that have a non-zero offset. In some cases, interpolation filter selection, offset computation, and/or refined motion estimation for a given unit may be performed in a single encoding pass. | 05-06-2010 |
| 20100177822 | FILTER PREDICTION BASED ON ACTIVITY METRICS IN VIDEO CODING - This disclosure describes techniques associated with filtering of video data in a video encoding and/or decoding process. In accordance with this disclosure, filtering is applied at an encoder, and filter information is encoded in the bitstream to identify the filtering that was applied at the encoder. Different types of filtering may be applied based on an activity metric determined for the video data. Moreover, in accordance with this disclosure, the manner in which the filter information is encoded into the bitstream may be dependent on the activity metric. In particular, for a first range of the activity metric, one or more filters are encoded directly, and for a second range of the activity metric, one or more filters are predictively encoded. | 07-15-2010 |
| 20100316134 | ASSEMBLING MULTIVIEW VIDEO CODING SUB-BISTREAMS IN MPEG-2 SYSTEMS - A demultiplexer may assemble view components of sub-bitstreams. In one example, an apparatus comprises a demultiplexer that produces a multiview video coding (MVC) standard compliant bitstream from a received bitstream comprising a primary sub-bitstream and an embedded sub-bitstream. To produce the MVC standard compliant bitstream, the demultiplexer determines whether a view component of the primary sub-bitstream has a view order index that is greater than a view order index of a view component of the embedded sub-bitstream, and to add the view component from the sub-bitstream for which the view order index is lower to the produced bitstream. The received bitstream may comprise delimiter network abstraction layer (NAL) units between each view component to differentiate the view components. The apparatus may further comprise a video decoder to decode the bitstream produced by the demultiplexer. | 12-16-2010 |
| 20100329342 | VIDEO CODING BASED ON FIRST ORDER PREDICTION AND PRE-DEFINED SECOND ORDER PREDICTION MODE - This disclosure describes video encoding and decoding techniques in which a first order prediction process and a second order prediction process are used in combination to generate predictive video blocks for video coding. First order prediction may be similar to conventional motion estimation and motion compensation that generates residual video blocks. The second order prediction may involve a process similar to conventional intra-prediction, but is performed on the residual video blocks. The techniques of this disclosure may pre-define the second order prediction to a specific mode, such as a mode similar to the intra-DC mode used in intra coding. In addition, the techniques of this disclosure may combine aspects of the first order and second order prediction into a single process so that the effects of second order prediction on the residuals are taken into account during the first order prediction process, which may improve compression. | 12-30-2010 |
| 20110007799 | NON-ZERO ROUNDING AND PREDICTION MODE SELECTION TECHNIQUES IN VIDEO ENCODING - In one aspect of this disclosure, techniques are described for selecting among default weighted prediction, implicit weighted prediction, and explicit weighted prediction. In this context, techniques are also described for adding offset to prediction data, e.g., using the format of explicit weighted prediction to allow for offsets to predictive data that is otherwise determined by implicit or default weighted prediction. | 01-13-2011 |
| 20110007802 | NON-ZERO ROUNDING AND PREDICTION MODE SELECTION TECHNIQUES IN VIDEO ENCODING - In one aspect of this disclosure, rounding adjustments to bi-directional predictive data may be purposely eliminated to provide predictive data that lacks any rounding bias. In this case, rounded and unrounded predictive data may both be considered in a rate-distortion analysis to identify the best data for prediction of a given video block. In another aspect of this disclosure, techniques are described for selecting among default weighted prediction, implicit weighted prediction, and explicit weighted prediction. In this context, techniques are also described for adding offset to prediction data, e.g., using the format of explicit weighted prediction to allow for offsets to predictive data that is otherwise determined by implicit or default weighted prediction. | 01-13-2011 |
| 20110032999 | SIGNALING CHARACTERISTICS OF AN MVC OPERATION POINT - Source and destination video devices may use data structures that signal details of an operation point for an MPEG-2 (Motion Picture Experts Group) System bitstream. In one example, an apparatus includes a multiplexer that constructs a data structure corresponding to a multiview video coding (MVC) operation point of an MPEG-2 (Motion Picture Experts Group) System standard bitstream, wherein the data structure signals a rendering capability value that describes a rendering capability to be satisfied by a receiving device to use the MVC operation point, a decoding capability value that describes a decoding capability to be satisfied by the receiving device to use the MVC operation point, and a bitrate value that describes a bitrate of the MVC operation point, and that includes the data structure as part of the bitstream, and an output interface that outputs the bitstream comprising the data structure. | 02-10-2011 |