Patent application number | Description | Published |
20080212599 | METHODS AND SYSTEMS FOR ENCODING DATA IN A COMMUNICATION NETWORK - Methods and systems for encoding data in a communication network are presented. In an aspect, a method is provided for processing multimedia data. The method includes detecting a smoothness factor associated with one or more portions of the multimedia data, and determining that smoothing is required based on the smoothness factor. The method also includes moving selected multimedia data from a first selected portion of the multimedia data to a second selected portion of the multimedia data, wherein the smoothness factor is adjusted. In an aspect, an apparatus is provided that includes a detector configured to detect a smoothness factor associated with one or more portions of the multimedia data, and to determine that smoothing is required based on the smoothness factor. The apparatus also includes an encoder configured to move selected multimedia data from a first selected portion of the multimedia data to a second selected portion of the multimedia data. | 09-04-2008 |
20080212673 | Systems and Methods for Adaptively Determining I Frames for Acquisition and Base and Enhancement Layer Balancing - The invention includes apparatus, systems and methods for processing multimedia data. A method of processing multimedia data may include encoding a frame of the multimedia data as an I frame, a channel switch frame, and a P frame and selecting the encoded I frame if a size of the encoded I frame and a size of the encoded channel switch frame and the encoded P frame meet a first condition. An apparatus for processing multimedia data may include an encoder for encoding a frame of the multimedia data as an I frame, a channel switch frame, and a P frame and selecting the encoded I frame if a size of the encoded I frame and a size of the encoded channel switch frame and the encoded P frame meet a first condition. | 09-04-2008 |
20080212677 | Efficient Video Block Mode Changes in Second Pass Video Coding - This disclosure describes techniques for second pass video coding in a multi-pass video coding scenario. The coding modes for some video blocks encoded during a second pass may be changed relative to the coding modes used for such video blocks in the first pass. However, motion information does not change for those video blocks that have the changed modes. In particular, mode changes can be made in the second coding pass relative to the modes used in the first coding pass without changing the manner in which motion information will be derived at the decoder, e.g., due to similarities between the original modes of the first pass and changed modes used in the second pass. The second pass coding techniques may also include quantization parameter adjustments, and the mode changes can cause such quantization parameter adjustments to have more profound refinements effects on the second pass coding. | 09-04-2008 |
20080273810 | VIDEO CODING MODE SELECTION USING ESTIMATED CODING COSTS - This disclosure describes techniques for coding mode selection using estimated coding costs. To provide high compression efficiency, for example, an encoding device may attempt to select a coding mode for coding blocks of pixels that codes the data of the blocks with high efficiency. To this end, the encoding device may perform coding mode selection based on estimates of coding cost for at least a portion of the possible modes. In accordance with the techniques described herein, the encoding device estimates the coding cost for the different modes without actually coding the blocks. In fact, in some aspects, the encoding module device may estimate the coding cost for the modes without quantizing the data of the block for each mode. In this manner, the coding cost estimation techniques of this disclosure reduce the amount of computationally intensive calculations needed to perform effective mode selection. | 11-06-2008 |
20090016445 | EARLY RENDERING FOR FAST CHANNEL SWITCHING - The disclosure relates to techniques for switching between channels of digital multimedia content. In particular, a decoding device decodes and renders to a display at least one frame of a segment of data prior to receiving the entire segment. In certain aspects, the decoding device may render one of the frames of the segment and freeze the rendered frame until the decoding device receives all of the frames of the segment. In other aspects, the decoding device may render frames of one or more segments at a reduced rendering rate until the receiving and rendering operations of decoding device are synchronized such that the rendering of the current segment occurs at substantially the same time as the receiving of the next segment. By rendering at least frame prior to receiving the entire segment the decoding device more quickly displays content to a user during a channel switching event. | 01-15-2009 |
20090175331 | TWO PASS QUANTIZATION FOR CABAC CODERS - This disclosure describes techniques for quantizing coefficients of a video block for a video coding process that supports context-based adaptive binary arithmetic coding (CABAC). A method may comprise estimating a last non-zero coefficient of the video block, and generating a set of quantized coefficients for the video block based on an assumption that the estimated last non-zero coefficient is actually the last non-zero coefficient of the video block. | 07-09-2009 |
20090213930 | FAST MACROBLOCK DELTA QP DECISION - A system and method for encoding multimedia video is described. As video is encoded a quantization parameter is selected for each macroblock. As described herein, the quantization parameter for each macroblock may be selected by limiting the universe of all possible quantization parameters to a particular range of possible quantization parameter values. This increases the speed of video encoding by reducing the number of quantization parameters that are tested for each video macroblock. | 08-27-2009 |
20090257489 | RATE-DISTORTION QUANTIZATION FOR CONTEXT-ADAPTIVE VARIABLE LENGTH CODING (CAVLC) - In general, this disclosure provides techniques for quantization of the coefficients of video blocks in a manner that can achieve a desirable balance of rate and distortion. The described techniques may analyze a plurality of quantization levels associated with each individual coefficient to select the quantization level for the individual coefficients that results in a lowest coding cost. Since CAVLC does not encode each coefficient independently, the techniques may compute the coding costs for each of the candidate quantization levels associated with the individual coefficients based on quantization levels selected for previously quantized coefficients and estimated (or predicted) quantization levels for subsequent coefficients of a coefficient vector. The quantization levels for each of the coefficients are selected based on computed coding costs to obtain a set of quantized coefficients that minimize a rate-distortion model. | 10-15-2009 |
20090257499 | ADVANCED INTERPOLATION TECHNIQUES FOR MOTION COMPENSATION IN VIDEO CODING - This disclosure describes various interpolation techniques performed by an encoder and a decoder during the motion compensation process of video coding. In one example, an encoder interpolates pixel values of reference video data based on a plurality of different pre-defined interpolation filters. In this example, the decoder receives a syntax element that identifies an interpolation filter, and interpolates pixel values of reference video data based on the interpolation filter identified by the syntax element. In another example, a method of interpolating predictive video data includes generating half-pixel values based on integer pixel values, rounding the half-pixel values to generate half-pixel interpolated values, storing the half-pixel values as non-rounded versions of the half-pixel values, and generating quarter-pixel values based on the non-rounded versions of the half-pixel values and the integer pixel values. | 10-15-2009 |
20090257503 | ADVANCED INTERPOLATION TECHNIQUES FOR MOTION COMPENSATION IN VIDEO CODING - This disclosure describes various interpolation techniques performed by an encoder and a decoder during the motion compensation process of video coding. In one example, an encoder interpolates pixel values of reference video data based on a plurality of different pre-defined interpolation filters. In this example, the decoder receives a syntax element that identifies an interpolation filter, and interpolates pixel values of reference video data based on the interpolation filter identified by the syntax element. In another example, a method of interpolating predictive video data includes generating half-pixel values based on integer pixel values, rounding the half-pixel values to generate half-pixel interpolated values, storing the half-pixel values as non-rounded versions of the half-pixel values, and generating quarter-pixel values based on the non-rounded versions of the half-pixel values and the integer pixel values. | 10-15-2009 |
20090262801 | DEAD ZONE PARAMETER SELECTIONS FOR RATE CONTROL IN VIDEO CODING - Quantization techniques are used in video coding to quantize residual coefficients. So-called “dead zone parameters” are selected in the quantization process of residual coefficients of residual video blocks. The dead zone refers to a region of magnitude for coefficients below which any coefficient will be quantized to zero. A method and apparatus of quantizing coefficient values of video blocks in a video coding scheme is provided. A quantization parameter is selected for a set of video blocks. Dead zone parameters are then selected for different video blocks in the set of video blocks. Next, the quantization parameter and the dead zone parameters are applied to quantize the coefficient values of each of the video blocks. | 10-22-2009 |
20100020886 | SCALABILITY TECHNIQUES BASED ON CONTENT INFORMATION - Apparatus and methods of using content information for encoding multimedia data are described. A method of processing multimedia data includes classifying content of multimedia data, and encoding the multimedia data in a first data group and in a second data group based on the content classification. The first and second groups are associated with quality levels. A user can request a target quality level. | 01-28-2010 |
20100074332 | OFFSET CALCULATION IN SWITCHED INTERPOLATION FILTERS - This disclosure describes techniques for adding offset to predictive video blocks during video coding. In one example, a method of encoding a video block includes interpolating a first block of predictive values based on a first reference video unit within a first list of reference data, and a second block of predictive values based on a second reference video unit within a second list of reference data, calculating, for sub-integer pixel positions, a first offset value based on the first block and the current video block, and a second offset value based on the first offset value and the second block, determining a final block of offset values based on the first block of predictive values, the second block of predictive values, the first offset values, and the second offset values, and encoding the current video block based on the final block of offset values. | 03-25-2010 |
20100086025 | QUANTIZATION PARAMETER SELECTIONS FOR ENCODING OF CHROMA AND LUMA VIDEO BLOCKS - This disclosure describes rules that may be applied during block-based video coding to ensure that quantization parameter selections for luma blocks will not adversely affect the quality of chroma blocks. In accordance with this disclosure, rate-controlled video encoding occurs in which quantization parameter changes in luma blocks are pre-evaluated to determine whether such quantization parameter changes in luma blocks will also cause quantization changes for chroma blocks. If quantization parameter changes in the luma blocks will also cause quantization changes for chroma blocks, then that quantization parameter change for luma blocks may be skipped and not evaluated. In this way, secondary effects of quantization parameter changes in the luma blocks (with respect to the chroma blocks) can be avoided. | 04-08-2010 |
20100086029 | VIDEO CODING WITH LARGE MACROBLOCKS - Techniques are described for encoding and decoding digital video data using macroblocks that are larger than the macroblocks prescribed by conventional video encoding and decoding standards. For example, the techniques include encoding and decoding a video stream using macroblocks comprising greater than 16×16 pixels, for example, 64×64 pixels. In one example, an apparatus includes a video encoder configured to encode a video block having a size of more than 16×16 pixels, generate block-type syntax information that indicates the size of the block, and generate a coded block pattern value for the encoded block, wherein the coded block pattern value indicates whether the encoded block includes at least one non-zero coefficient. The encoder may set the coded block pattern value to zero when the encoded block does not include at least one non-zero coefficient or set the coded block pattern value to one when the encoded block includes a non-zero coefficient. | 04-08-2010 |
20100086030 | VIDEO CODING WITH LARGE MACROBLOCKS - Techniques are described for encoding and decoding digital video data using macroblocks that are larger than the macroblocks prescribed by conventional video encoding and decoding standards. For example, the techniques include encoding and decoding a video stream using macroblocks comprising greater than 16×16 pixels, for example, 64×64 pixels. Each macroblock may be partitioned into two or more partitions, and two or more of the partitions may be encoded using different modes. In one example, an apparatus includes a video encoder configured to receive a video block having a size of more than 16×16 pixels, partition the block into partitions, encode one of the partitions using a first encoding mode, encode another of the partitions using a second encoding mode different from the first encoding mode, and generate block-type syntax information that indicates the size of the block and identifies the partitions and the encoding modes used to encode the partitions. | 04-08-2010 |
20100086031 | VIDEO CODING WITH LARGE MACROBLOCKS - Techniques are described for encoding and decoding digital video data using macroblocks that are larger than the macroblocks prescribed by conventional video encoding and decoding standards. For example, the techniques include encoding and decoding a video stream using macroblocks comprising greater than 16×16 pixels, for example, 64×64 pixels. In one example, an apparatus includes a video encoder configured to receive a video coding unit, determine a first rate-distortion metric for encoding the video coding unit using first video blocks with sizes of 16×16 pixels, determine a second rate-distortion metric for encoding the video coding unit using second video blocks with sizes of more than 16×16 pixels, encode the video coding unit using the first video blocks when the first rate-distortion metric is less than second rate-distortion metric, and encode the video coding unit using the second video blocks when the second rate-distortion metric is less than the first rate-distortion metric. | 04-08-2010 |
20100086032 | VIDEO CODING WITH LARGE MACROBLOCKS - Techniques are described for encoding and decoding digital video data using macroblocks that are larger than the macroblocks prescribed by conventional video encoding and decoding standards. For example, the techniques include encoding and decoding a video stream using macroblocks comprising greater than 16×16 pixels. In one example, an apparatus includes a video encoder configured to encode a coded unit comprising a plurality of video blocks, wherein at least one of the plurality of video blocks comprises a size of more than 16×16 pixels and to generate syntax information for the coded unit that includes a maximum size value, wherein the maximum size value indicates a size of a largest one of the plurality of video blocks in the coded unit. The syntax information may also include a minimum size value. In this manner, the encoder may indicate to a decoder the proper syntax decoder to apply to the coded unit. | 04-08-2010 |
20100086049 | VIDEO CODING USING TRANSFORMS BIGGER THAN 4X4 AND 8X8 - In a video processing system, a method and system for applying transforms larger than 8×8 and non-rectangular transforms, and generating transform size syntax elements indicative of the transforms for video decoding are provided. The transform size syntax element may be generated by an encoder based on a prediction block size of a video block and the contents of the video block. Further, the transform size syntax element may be generated according to a set of rules to select from 4×4, 8×8, and larger transform sizes during an encoding process. A decoder may perform an inverse transform based on the transform size syntax element and the rules used by the encoder. The transform size syntax element may be transmitted to the decoder as part of the encoded video bitstream. | 04-08-2010 |
20100111182 | DIGITAL VIDEO CODING WITH INTERPOLATION FILTERS AND OFFSETS - This disclosure describes techniques for encoding digital video data using interpolation filters and offsets. An encoder may be configured to select interpolation filters for sub-pixel precision motion estimation based on historical interpolation results obtained for previously encoded video units, such as frames or slices. The encoder also may be configured to compute and assign offsets to the sub-pixel positions after interpolation based on differences between a reference unit and the unit to be coded. The computation and assignment of offsets may be performed before motion estimation. Motion estimation may be refined so that the motion search considers sub-pixel positions to which offsets have been previously added and evaluates sub-pixel positions that have a non-zero offset. In some cases, interpolation filter selection, offset computation, and/or refined motion estimation for a given unit may be performed in a single encoding pass. | 05-06-2010 |
20100177822 | FILTER PREDICTION BASED ON ACTIVITY METRICS IN VIDEO CODING - This disclosure describes techniques associated with filtering of video data in a video encoding and/or decoding process. In accordance with this disclosure, filtering is applied at an encoder, and filter information is encoded in the bitstream to identify the filtering that was applied at the encoder. Different types of filtering may be applied based on an activity metric determined for the video data. Moreover, in accordance with this disclosure, the manner in which the filter information is encoded into the bitstream may be dependent on the activity metric. In particular, for a first range of the activity metric, one or more filters are encoded directly, and for a second range of the activity metric, one or more filters are predictively encoded. | 07-15-2010 |
20100316134 | ASSEMBLING MULTIVIEW VIDEO CODING SUB-BISTREAMS IN MPEG-2 SYSTEMS - A demultiplexer may assemble view components of sub-bitstreams. In one example, an apparatus comprises a demultiplexer that produces a multiview video coding (MVC) standard compliant bitstream from a received bitstream comprising a primary sub-bitstream and an embedded sub-bitstream. To produce the MVC standard compliant bitstream, the demultiplexer determines whether a view component of the primary sub-bitstream has a view order index that is greater than a view order index of a view component of the embedded sub-bitstream, and to add the view component from the sub-bitstream for which the view order index is lower to the produced bitstream. The received bitstream may comprise delimiter network abstraction layer (NAL) units between each view component to differentiate the view components. The apparatus may further comprise a video decoder to decode the bitstream produced by the demultiplexer. | 12-16-2010 |
20100329342 | VIDEO CODING BASED ON FIRST ORDER PREDICTION AND PRE-DEFINED SECOND ORDER PREDICTION MODE - This disclosure describes video encoding and decoding techniques in which a first order prediction process and a second order prediction process are used in combination to generate predictive video blocks for video coding. First order prediction may be similar to conventional motion estimation and motion compensation that generates residual video blocks. The second order prediction may involve a process similar to conventional intra-prediction, but is performed on the residual video blocks. The techniques of this disclosure may pre-define the second order prediction to a specific mode, such as a mode similar to the intra-DC mode used in intra coding. In addition, the techniques of this disclosure may combine aspects of the first order and second order prediction into a single process so that the effects of second order prediction on the residuals are taken into account during the first order prediction process, which may improve compression. | 12-30-2010 |
20110007799 | NON-ZERO ROUNDING AND PREDICTION MODE SELECTION TECHNIQUES IN VIDEO ENCODING - In one aspect of this disclosure, techniques are described for selecting among default weighted prediction, implicit weighted prediction, and explicit weighted prediction. In this context, techniques are also described for adding offset to prediction data, e.g., using the format of explicit weighted prediction to allow for offsets to predictive data that is otherwise determined by implicit or default weighted prediction. | 01-13-2011 |
20110007802 | NON-ZERO ROUNDING AND PREDICTION MODE SELECTION TECHNIQUES IN VIDEO ENCODING - In one aspect of this disclosure, rounding adjustments to bi-directional predictive data may be purposely eliminated to provide predictive data that lacks any rounding bias. In this case, rounded and unrounded predictive data may both be considered in a rate-distortion analysis to identify the best data for prediction of a given video block. In another aspect of this disclosure, techniques are described for selecting among default weighted prediction, implicit weighted prediction, and explicit weighted prediction. In this context, techniques are also described for adding offset to prediction data, e.g., using the format of explicit weighted prediction to allow for offsets to predictive data that is otherwise determined by implicit or default weighted prediction. | 01-13-2011 |
20110032999 | SIGNALING CHARACTERISTICS OF AN MVC OPERATION POINT - Source and destination video devices may use data structures that signal details of an operation point for an MPEG-2 (Motion Picture Experts Group) System bitstream. In one example, an apparatus includes a multiplexer that constructs a data structure corresponding to a multiview video coding (MVC) operation point of an MPEG-2 (Motion Picture Experts Group) System standard bitstream, wherein the data structure signals a rendering capability value that describes a rendering capability to be satisfied by a receiving device to use the MVC operation point, a decoding capability value that describes a decoding capability to be satisfied by the receiving device to use the MVC operation point, and a bitrate value that describes a bitrate of the MVC operation point, and that includes the data structure as part of the bitstream, and an output interface that outputs the bitstream comprising the data structure. | 02-10-2011 |
20110194613 | VIDEO CODING WITH LARGE MACROBLOCKS - A video coder may utilize large macroblocks having more than 16×16 pixels. Syntax for the large macroblocks may define whether a bitstream includes large macroblocks, such as superblocks having 64×64 pixels or bigblocks having 32×32 pixels. The syntax may be included in a slice header or a sequence parameter set. The large macroblocks may also be encoded according to a large macroblock syntax. The bitstream may further include syntax data that indicates a level value based on whether the bitstream includes any of the large macroblocks, for example, as a smallest-sized luminance prediction block. A decoder may use the level value to determine whether the decoder is capable of decoding the bitstream. | 08-11-2011 |
20110200097 | ADAPTIVE TRANSFORM SIZE SELECTION FOR GEOMETRIC MOTION PARTITIONING - In one example, an apparatus includes a video encoder configured to partition a block of video data into a first geometric partition and a second geometric partition using a geometric motion partition line, wherein the block comprises N×N pixels, divide the block of video data into four equally-sized, non-overlapping (N/2)×(N/2) sub-blocks, and encode at least one of the sub-blocks through which the geometric motion partition line passes using a transform size smaller than (N/2)×(N/2). The video encoder may determine transform sizes for the sub-blocks based on whether the geometric motion partition line passes through the sub-blocks. In one example, a video decoder may inverse transform the sub-blocks, and may determine transform sizes for the sub-blocks based on whether the geometric motion partition line passes through the sub-blocks. | 08-18-2011 |
20110200108 | CHROMINANCE HIGH PRECISION MOTION FILTERING FOR MOTION INTERPOLATION - A video coding unit may be configured to encode or decode chrominance blocks of video data by reusing motion vectors for corresponding luminance blocks. A motion vector may have greater precision for chrominance blocks than luminance blocks, due to downsampling of chrominance blocks relative to corresponding luminance blocks. The video coding unit may interpolate values for a reference chrominance block by selecting interpolation filters based on the position of the pixel position pointed to by the motion vector. For example, a luminance motion vector may have one-quarter-pixel precision and a chrominance motion vector may have one-eighth-pixel precision. There may be interpolation filters associated with the quarter-pixel precisions. The video coding unit may use interpolation filters either corresponding to the pixel position or neighboring pixel positions to interpolate a value for the pixel position pointed to by the motion vector. | 08-18-2011 |
20110200109 | FIXED POINT IMPLEMENTATION FOR GEOMETRIC MOTION PARTITIONING - In one example, an apparatus includes a video encoder configured to partition a block of video data into a first partition and a second partition using a geometric motion partition line, calculate a slope value and a y-intercept value of the geometric motion partition line, wherein the slope value and the y-intercept value comprise integer values, calculate a mask indicative of pixels of the block in the first partition and pixels of the block in the second partition, encode the first partition and the second partition based on the mask, and output the encoded first partition, the encoded second partition, the slope value, and the y-intercept value. This may allow for a fixed point implementation. A video decoder may receive the slope and y-intercept values to calculate the mask and decode the block based on the mask. | 08-18-2011 |
20110200110 | SMOOTHING OVERLAPPED REGIONS RESULTING FROM GEOMETRIC MOTION PARTITIONING - In one example, an apparatus includes a video encoder configured to partition a block of video data into a first partition and a second partition using a geometric motion partition line, calculate a prediction value of a pixel in a transition region of the block using a filter that applies a value for at least one neighboring pixel from the first partition and a value for at least one neighboring pixel from the second partition, calculate a residual value of the pixel in the transition region of the block based on the prediction value of the pixel in the transition region, and output the residual value of the pixel. In one example, a video decoder may use a similar filter to decode an the encoded block after receiving the residual value for the encoded block, and using a definition of the geometric motion partition line. | 08-18-2011 |
20110200111 | ENCODING MOTION VECTORS FOR GEOMETRIC MOTION PARTITIONING - In one example, an apparatus includes a video encoder configured to partition a block of video data into a first partition and a second partition using a geometric motion partition line, determine a first motion vector for the first partition and a second motion vector for the second partition, encode the first motion vector based on a first motion predictor selected from motion vectors for blocks neighboring the first partition, encode the second motion vector based on a second motion predictor selected from motion vectors for blocks neighboring the second partition, wherein the blocks neighboring the second partition are determined independently of the blocks neighboring the first partition, and output the encoded first and second motion vectors. A video decoder may similarly decode the motion vectors based on determining the first and second motion predictors for the first and second partitions. | 08-18-2011 |
20110206125 | ADAPTIVE MOTION RESOLUTION FOR VIDEO CODING - A video encoder may encode video data by adaptively selecting between one-eighth-pixel and one-quarter-pixel precision motion vectors, and signal the selected precision. In one example, an apparatus includes a video encoder to encode a block of video data using a one-eighth-pixel precision motion vector when use of the one-eighth-pixel precision motion vector is determined to be preferable for the block over a one-quarter-pixel precision motion vector, and to generate a signal value indicative of the use of the one-eighth-pixel precision motion vector for the block, and an output interface to output the encoded block and the signal value. A video decoder may be configured to receive the signal value and the encoded block, analyze the signal value to determine whether the block was encoded using one-eighth-pixel precision or one-quarter-pixel precision, and decode the block based on the determination. | 08-25-2011 |
20110249745 | BLOCK AND PARTITION SIGNALING TECHNIQUES FOR VIDEO CODING - A video block syntax element indicates whether all of the partitions of a video block are predicted based on a same reference list and no greater than quarter-pixel accuracy is used. If the video block syntax element is set, partition-level signaling of the reference lists is avoided. If the video block syntax element is not set, partition-level signaling of the reference lists occurs. If the video block syntax element is set, partition-level syntax elements may be used for each of the partitions of the video block, wherein the partition-level syntax elements each identify one of the reference lists and motion vector accuracy for a given one of the partitions. | 10-13-2011 |
20110280311 | ONE-STREAM CODING FOR ASYMMETRIC STEREO VIDEO - An asymmetric frame of a coded video bitstream may include a first resolution picture of a left view and a reduced resolution picture of a right view, where the left and right views form a stereo view pair for three-dimensional video playback. In addition, the reduced resolution frame may be predicted relative to a picture of the left view. In one example, an apparatus includes a video encoder configured to encode a first picture of a first view of a scene to produce an encoded picture with a first resolution, encode at least a portion of a second picture of a second view of the scene relative to a reference picture of the first view to produce an encoded picture with a reduced resolution relative to the first resolution, and output the encoded first resolution picture and the encoded reduced resolution picture in a common bitstream. | 11-17-2011 |
20120008675 | CODING SYNTAX ELEMENTS FOR ADAPTIVE SCANS OF TRANSFORM COEFFICIENTS FOR VIDEO CODING - In one example, an apparatus for encoding video data includes a video encoder configured to scan a two-dimensional block of transform coefficients to produce a one-dimensional vector of the transform coefficients, determine values indicative of whether the transform coefficients in the one-dimensional vector are significant; and entropy encode at least one of the values using a context model selected based on at least a percentage of significant coefficients in a predetermined number of the values encoded before the at least one of the values. | 01-12-2012 |
20120008682 | VIDEO CODING USING DIRECTIONAL TRANSFORMS - In one example, an apparatus for encoding video data includes a video encoder configured to calculate a residual block for a block of video data based on a predicted block formed using an intra-prediction mode, and transform the residual block using a transform mapped from the intra-prediction mode. In another example, an apparatus includes video encoder configured to receive an indication of a first intra-prediction mode in a first set of intra-prediction modes for a block of video data, determine a second intra-prediction mode from a second set of intra-prediction modes, smaller than the first set of intra-prediction modes, to which the first intra-prediction mode is mapped, determine a directional transform to which the second intra-prediction mode is mapped, and apply the directional transform to residual data of the block. | 01-12-2012 |
20120008683 | SIGNALING SELECTED DIRECTIONAL TRANSFORM FOR VIDEO CODING - In one example, an apparatus for encoding video data includes a video encoder configured to select an intra-prediction mode to use to encode a block of video data, determine whether the block includes a sub-block of a size for which multiple transforms are possible based on the size of the sub-block and the selected intra-prediction mode, when the block includes the sub-block of the size for which multiple transforms are possible based on the size of the sub-block and the selected intra-prediction mode, select one of the multiple possible transforms, transform the sub-block using the selected one of the multiple possible transforms, and provide an indication of the selected one of the multiple possible transforms for the size of the block. | 01-12-2012 |
20120014455 | Variable Localized Bit-Depth Increase for Fixed-Point Transforms in Video Coding - This disclosure describes techniques for mitigating rounding errors in a fixed-point transform associated with video coding by applying a variable localized bit-depth increase at the transform. More specifically, the techniques include selecting a constant value based on a size of a fixed-point transform in a video coding device and applying a variable localized bit-depth increase at the transform with a value equal to the constant value. Applying the variable localized bit-depth increase includes left-shifting a transform input signal by a number of bits equal to the constant value before the fixed-point transform, and right-shifting a transform output signal by a number of bits equal to the constant value after the fixed-point transform. The constant value is selected from a plurality of constant values stored on the video coding device. Each of the constant values is pre-calculated for one of a plurality of different transform sizes supported by the video coding. | 01-19-2012 |
20120027088 | CODING MOTION PREDICTION DIRECTION IN VIDEO CODING - This disclosure relates to techniques for reducing a cost of coding prediction information in video coding. Video blocks in a generalized P/B (GPB) frame are encoded using up to two motion vectors calculated from reference pictures in two separate reference picture lists that are identical. When one of the reference picture lists is preferred over the other reference picture list, the preferred reference picture list may be used for unidirectional prediction, by default. When a GPB frame is enabled such that the first and second reference picture lists are identical, either of the first and second reference picture lists may be used for unidirectional prediction. The techniques include coding one or more syntax elements indicating that a video block is coded using one of the unidirectional prediction mode with respect to a reference picture in a reference picture list and the bidirectional prediction mode using less than two bits. | 02-02-2012 |
20120027089 | CODING MOTION VECTORS IN VIDEO CODING - This disclosure relates to techniques for reducing a cost of coding prediction information in video coding. Video blocks in a generalized P/B (GPB) frame are encoded using up to two motion vectors calculated from reference pictures in two separate reference picture lists that are identical. Video blocks of a GPB frame may, therefore, be encoded using a bidirectional prediction mode with a first motion vector from a reference picture in a first reference picture list and a second motion vector from the same or substantially similar reference picture in a second reference picture list. The techniques include jointly coding the first and second motion vectors for a video block of a GPB frame. The techniques include coding the first motion vector relative to a first motion predictor generated from a motion vector of a neighboring block, and coding the second motion vector relative to the first motion vector. | 02-02-2012 |
20120033037 | Signaling Attributes for Network-Streamed Video Data - In one example, an apparatus includes a processor configured to receive video data for two or more views of a scene, determine horizontal locations of camera perspectives for each of the two or more views, assign view identifiers to the two or more views such that the view identifiers correspond to the relative horizontal locations of the camera perspectives, form a representation comprising a subset of the two or more views, and, in response to a request from a client device, send information indicative of a maximum view identifier and a minimum view identifier for the representation to the client device. | 02-09-2012 |
20120036544 | Signaling Attributes for Network-Streamed Video Data - In one example, an apparatus for signaling information for video data includes a processor configured to receive video data for two or more views of a scene, form a representation comprising a subset of the two or more views, and send, to a client device, as a part of a manifest of the representation, information indicative of a maximum number of views in the representation that can be targeted for output. An apparatus for receiving information for video data may receive the manifest including the information indicating the maximum number of views and request at least a portion of the video data of the representation based at least in part on a maximum number of views that can be output by the apparatus and the information indicative of the maximum number of views in the representation that can be targeted for output. | 02-09-2012 |
20120063515 | Efficient Coding of Video Parameters for Weighted Motion Compensated Prediction in Video Coding - This disclosure relates to techniques for efficient coding of video parameters for weighted motion compensated prediction in video encoding and decoding. A video coding device may code a video block using weighted motion compensated prediction with respect to prediction data generated based on at least one motion vector and video parameter values. The video parameter values may include scale and/or offset parameter values. The techniques reduce signaling overhead by only signaling video parameter values when the motion vector points to a predefined sub-pixel position of a reference block. The techniques include storing a list of predefined sub-pixels associated with the video parameters. When the motion vector points to a sub-pixel position included in the list of predefined sub-pixels, the video coding device may code the video parameter values. The list of predefined sub-pixels may be signaled to a video decoder at a video coding unit or higher level. | 03-15-2012 |
20120082225 | SELECTIVE INDICATION OF TRANSFORM SIZES - During a video encoding process, a video encoder may transform the residual data of a coding unit (CU) using a single transform if the CU only has a single prediction unit (PU). If the CU has multiple PUs, the video encoder may transform the residual data using multiple transforms. The video encoder outputs an indication of the size of a transform used to transform residual data of the CU only when the CU has more than one PU. If a video decoder receives such an indication, the video decoder may reconstruct residual data of the CU using a transform of the indicated size. Otherwise, the video decoder may reconstruct the residual data of the CU using a transform having same size as the CU. | 04-05-2012 |
20120093226 | ADAPTIVE MOTION VECTOR RESOLUTION SIGNALING FOR VIDEO CODING - A video encoder may be configured to adaptively select a sub-pixel precision for motion vectors used to encode video data. The video encoder may further entropy encode an indication of the sub-pixel precision using context adaptive binary arithmetic coding, where the context may correspond to the size of a block of video data for the motion vector. For example, the size may correspond to the depth of a coding unit, the size of a prediction unit of the coding unit, and/or a type for the prediction unit. The video encoder may also interpolate values for one-sixteenth pixel positions of chrominance data using bilinear interpolation. The video encoder may further encode a motion vector difference value for the motion vector using an encoding scheme corresponding to the sub-pixel precision of the motion vector. A video decoder may use similar, reciprocal techniques for decoding the video data. | 04-19-2012 |
20120106649 | JOINT CODING OF SYNTAX ELEMENTS FOR VIDEO CODING - In one example, a video decoder is configured to determine whether a component of a transform unit of a coding unit of video data includes at least one non-zero coefficient based on a codeword for the transform unit, determine whether the transform unit is split into sub-transform units based on the codeword, and decode the transform unit based on the determinations. In another example, a video encoder is configured to determine whether a component of a transform unit of a coding unit of video data includes at least one non-zero coefficient, determine whether the transform unit is split into sub-transform units, select a codeword from a variable length code table, wherein the variable length code table provides an indication that the codeword corresponds to the determinations, and provide the codeword for the transform unit. | 05-03-2012 |
20120147961 | USE OF MOTION VECTORS IN EVALUATING GEOMETRIC PARTITIONING MODES - During a video encoding process, rectangular prediction units (PUs) for a coding unit (CU) are generated. Furthermore, a geometric partitioning mode is used to generate a first and a second geometric PU for the CU. The first and second geometric PUs are associated with different geometric partitions of a sample block of the CU. One of the rectangular PUs is identified as overlapping the first geometric PU. The motion vector of the identified rectangular PU is used to identify a given area of a reference frame. The given area of a reference frame is then used as a starting point of a search to identify a reference sample for the first geometric PU. A motion vector for the first geometric PU indicates a position of the reference sample relative to a position of the first geometric PU. A prediction block is generated using the motion vector for first geometric PU. | 06-14-2012 |
20120147967 | ADAPTIVE SUPPORT FOR INTERPOLATING VALUES OF SUB-PIXELS FOR VIDEO CODING - This disclosure describes techniques for calculating values of sub-integer pixels applied by an encoder and a decoder to encode blocks of video data. In one example, a video encoder is configured to receive values for a full integer pixel positions of a reference sample, apply an interpolation filter to a first set of the values for the full integer pixel positions to calculate a value for a first sub-integer pixel of one of the full integer pixel positions, apply the interpolation filter to a second, different set of the values for the full integer pixel positions to calculate a value for a second, different sub-integer pixel of the one of the full integer pixel positions, encode a current block of pixels using a motion vector that points to one of the first sub-integer pixel and the second sub-integer pixel. | 06-14-2012 |
20120170648 | FRAME SPLITTING IN VIDEO CODING - In one example, this disclosure describes a method of decoding a frame of video data comprising a plurality of block-sized coding units including one or more largest coding units (LCUs) that include a hierarchically arranged plurality of relatively smaller coding units. In this example, the method includes determining a granularity at which the hierarchically arranged plurality of smaller coding units has been split when forming independently decodable portions of the frame. The method also includes identifying an LCU that has been split into a first section and a second section using the determined granularity. The method also includes decoding an independently decodable portion of the frame that includes the first section of the LCU without the second section of the LCU. | 07-05-2012 |
20120170649 | VIDEO CODING USING MAPPED TRANSFORMS AND SCANNING MODES - A video encoder may transform residual data by using a transform selected from a group of transforms. The transform is applied to the residual data to create a two-dimensional array of transform coefficients. A scanning mode is selected to scan the transform coefficients in the two-dimensional array into a one-dimensional array of transform coefficients. The combination of transform and scanning mode may be selected from a subset of combinations that is based on an intra-prediction mode. The scanning mode may also be selected based on the transform used to create the two-dimensional array. The transforms and/or scanning modes used may be signaled to a video decoder. | 07-05-2012 |
20120177114 | ADAPTIVELY PERFORMING SMOOTHING OPERATIONS - During a video coding process, a prediction block is generated for a CU. The CU has two or more prediction units. A computing device determines, based on sizes of one or more of the prediction units, whether to perform a smoothing operation on samples in a transition zone of the prediction block. The transition zone is located at a boundary between samples of the prediction block associated with different prediction units. If the computing device makes the determination to perform the smoothing operation, the smoothing operation is performed to smooth samples of the prediction block in the transition zone. | 07-12-2012 |
20120177120 | IDENTIFICATION OF SAMPLES IN A TRANSITION ZONE - During a video encoding or decoding process, a predicted prediction block is generated for a CU. The CU may have two or more prediction units (PUs). A computing device selects a neighbor region size. After the computing device selects the neighbor region size, samples in a transition zone of the prediction block are identified. Samples associated with a first PU are in the transition zone if neighbor regions that contain the samples also contain samples associated with a second PU. Samples associated with the second PU may be in the transition zone if neighbor regions that contain the samples also contain samples associated with the first PU. The neighbor regions have the selected neighbor region size. A smoothing operation is then performed on the samples in the transition zone. | 07-12-2012 |
20120189053 | COMBINED REFERENCE PICTURE LIST CONSTRUCTION FOR VIDEO CODING - This disclosure relates to techniques for constructing a combined reference picture list, List C, based on List | 07-26-2012 |
20120189055 | MOTION VECTOR PREDICTION - A method of coding video data includes determining a candidate motion vector for each of one or more candidate portions of a video frame and determining a current motion vector for a current portion of a current frame. The current motion vector identifies a portion of a reference frame that at least partially matches the current portion of the current frame. The method also includes calculating a motion vector difference between the current motion vector and each of the candidate motion vectors, selecting one of the candidate motion vectors based on the calculated motion vector differences, signaling an index identifying the candidate portion having the selected one of the candidate motion vectors, and signaling the corresponding motion vector difference calculated with respect to the selected one of the candidate motion vectors. | 07-26-2012 |
20120189058 | SINGLE REFERENCE PICTURE LIST CONSTRUCTION FOR VIDEO CODING - The example techniques described in this disclosure provide for an efficient manner to encode or decode a video block of a picture using a single reference picture list. The single reference picture list may include identifiers for reference picture or pictures used to encode or decode the video block. In some examples, a video encoder or decoder may encode or decode a video block that is predicted from two reference pictures using the single reference picture list, and encode or decode a video block that is predicted from one reference picture using the same, single reference picture list. | 07-26-2012 |
20120195368 | PERFORMING MOTION VECTOR PREDICTION FOR VIDEO CODING - In general, techniques are described for performing motion vector prediction for video coding. An apparatus comprising a motion compensation unit may implement the techniques. The motion compensation unit determines spatial candidate motion vectors (MVPs) associated with a current portion of a video frame and prunes the spatial candidate motion vectors to remove duplicates without removing a temporal candidate motion vector. The motion compensation unit selects one of the temporal candidate motion vector or one of the spatial candidate motion vectors remaining after pruning as a selected candidate motion vector based on a motion vector predictor (MVP) index signaled in a bitstream and performs motion compensation based on the selected candidate motion vector. | 08-02-2012 |
20120219060 | SYSTEM AND METHOD FOR SCALABLE ENCODING AND DECODING OF MULTIMEDIA DATA USING MULTIPLE LAYERS - A method of using a base layer to predict an enhancement layer is disclosed. The method may include using a block of multimedia data to generate a base residual including base quantized coefficients, using the block of multimedia data to generate an enhancement residual including enhancement quantized coefficients, determining a first value based on the base quantized coefficients, determining a second value based on the enhancement quantized coefficients, and determining the enhancement layer using at least one of the base quantized coefficients or the enhancement quantized coefficients. A method of decoding a multimedia bitstream may include receiving a multimedia bitstream having a base layer and an enhancement layer and decoding the base layer to determine whether the enhancement layer should be decoded using intralayer prediction or interlayer prediction. | 08-30-2012 |
20120230421 | TRANSFORMS IN VIDEO CODING - Aspects of this disclosure relate to a method of coding video data. In an example, the method includes determining a first residual quadtree (RQT) depth at which to apply a first transform to luma information associated with a block of video data, wherein the RQT represents a manner in which transforms are applied to luma information and chroma information. The method also includes determining a second RQT depth at which to apply a second transform to the chroma information associated with the block of video data, wherein the second RQT depth is different than the first RQT depth. The method also includes coding the luma information at the first RQT depth and the chroma information at the second RQT depth. | 09-13-2012 |
20120230433 | VIDEO CODING TECHNIQUES FOR CODING DEPENDENT PICTURES AFTER RANDOM ACCESS - In general, this disclosure describes techniques for coding video data for random access. In particular, this disclosure proposes to code a syntax element that indicates if a dependent picture may be successfully decoded in the event of a random access request to a clean decoding refresh (CDR) picture and may be required for decoding the pictures following the clean decoding refresh (CDR) picture in display order. | 09-13-2012 |
20120269270 | MOTION VECTOR PREDICTION IN VIDEO CODING - Aspects of this disclosure relate to, in an example, a method that includes identifying a first block of video data in a first temporal location from a first view, wherein the first block is associated with a first disparity motion vector. The method also includes determining a motion vector predictor for a second motion vector associated with a second block of video data, wherein the motion vector predictor is based on the first disparity motion vector. When the second motion vector comprises a disparity motion vector, the method includes determining the motion vector predictor comprises scaling the first disparity motion vector to generate a scaled motion vector predictor, wherein scaling the first disparity motion vector comprises applying a scaling factor comprising a view distance of the second disparity motion vector divided by a view distance of the first motion vector to the first disparity motion vector. | 10-25-2012 |
20120269271 | MOTION VECTOR PREDICTION IN VIDEO CODING - Aspects of this disclosure relate to a method of coding video data. In an example, the method includes identifying a first block of video data in a first temporal location from a first view, wherein the first block of video data is associated with a first temporal motion vector. The method also includes determining, when a second motion vector associated with a second block of video data comprises a temporal motion vector and the second block is from a second view, a motion vector predictor for the second motion vector based on the first temporal motion vector. The method also includes coding prediction data for the second block using the motion vector predictor. | 10-25-2012 |
20120294360 | CHANNEL SWITCH FRAME - Methods and apparatus to process multimedia data enabling faster channel acquisitions, improved error recovery and improved efficiency. An encoder device encodes a first portion of multimedia data using inter-coding to generate a first version, and encodes the first portion of multimedia data using intra-coding to generate a second version. A decoder device receives a first version of a first portion of multimedia data, wherein the first version is inter-coded, receives a second version of the first portion of multimedia data, wherein the second version is intra-coded, and selectively decodes the first and second received versions. | 11-22-2012 |
20120314026 | INTERNAL BIT DEPTH INCREASE IN VIDEO CODING - In an example aspects of this disclosure generally relate to a method of coding video data that includes determining a first bit depth for outputting video data and a second bit depth for coding the video data, wherein the first bit depth is less than the second bit depth. The method also includes determining whether the video data will be used as reference data when coding other video data. The method also includes storing, based on the determination, the video data at the first bit depth when the video data is not used as reference data, and the video data at the second bit depth when the video data is used as reference data. | 12-13-2012 |
20120314767 | BORDER PIXEL PADDING FOR INTRA PREDICTION IN VIDEO CODING - A video coder performs a padding operation that processes a set of border pixels according to an order. The order starts at a bottom-left border pixel and proceeds through the border pixels sequentially to a top-right border pixel. When the padding operation processes an unavailable border pixel, the padding operation predicts a value of the unavailable border pixel based on a value of a border pixel previously processed by the padding operation. The video coder may generate an intra-predicted video block based on the border pixels. | 12-13-2012 |
20130003824 | APPLYING NON-SQUARE TRANSFORMS TO VIDEO DATA - In one example, a device for coding video data includes a video coder, such as a video encoder or a video decoder, that is configured to code information indicative of whether a transform unit of the video data is square or non-square, and code data of the transform unit based at least in part on whether the transform unit is square or non-square. In this manner, the video coder may utilize non-square transform units. The video coder may be configured to use non-square transform units for certain situations, such as only for chrominance or luminance components or only when a corresponding prediction unit is non-square. The video coder may further be configured to perform an entropy coding process that selects context for coding data of the transform unit based on whether the transform unit is square or non-square. | 01-03-2013 |
20130003849 | VIDEO CODING USING ADAPTIVE MOTION VECTOR RESOLUTION - In one example, a device for coding video data includes a video coder configured to configured to code information representative of whether an absolute value of an x-component of a motion vector difference value for a current block of video data is greater than zero, code information representative of whether an absolute value of a y-component of the motion vector difference value is greater than zero, when the absolute value of the x-component is greater than zero, code information representative of the absolute value of the x-component, when the absolute value of the y-component is greater than zero, code information representative of the absolute value of the y-component, when the absolute value of the x-component is greater than zero, code a sign of the x-component, and when the absolute value of the y-component is greater than zero, code a sign of the y-component. | 01-03-2013 |
20130010865 | REDUCED RESOLUTION PIXEL INTERPOLATION - An offset can be applied to intermediate values obtained while performing an interpolation filtering operation such that applying the offset reduces the bitdepth of the intermediate value. The intermediate value can be stored with the reduced bitdepth, and when retrieved, the offset can be added back such that future calculation can be performed using the intermediate value with the original bitdepth. | 01-10-2013 |
20130077684 | Signaling of prediction size unit in accordance with video coding - Signaling of prediction size unit in accordance with video coding. In accordance with video coding, various binarization may be performed. In accordance with coding related to different types of slices (e.g., I, P, B slices), one or more binary trees may be employed for performing various respective operations (e.g., coding unit | 03-28-2013 |
20130077697 | Adaptive loop filtering in accordance with video coding - Adaptive loop filtering in accordance with video coding. An adaptive loop filter (ALF) and/or other in-loop filters (e.g., sample adaptive offset (SAO) filter, etc.) may be implemented within various video coding architectures (e.g., encoding and/or decoding architectures) to perform both offset and scaling processing, only scaling processing, and/or only offset processing. Operation of such an ALF may be selective in accordance with any of multiple respective operational modes at any given time and may be adaptive based upon various consideration(s) (e.g., desired complexity level, processing type, local and/or remote operational conditions, etc.). For example, an ALF may be applied to a decoded picture before it is stored in a picture buffer (or digital teacher buffer (DPB)). An ALF can provide for coding noise reduction of a decoded picture, and the filtering operations performed thereby may be selective (e.g., on a slice by slice basis, block by block basis, etc.). | 03-28-2013 |
20130094580 | DETECTING AVAILABILITIES OF NEIGHBORING VIDEO UNITS FOR VIDEO CODING - As part of a video encoding or decoding operation on video data, a video coder performs a coding operation for a current video unit of the video data. As part of performing the coding operation for the current video unit, the video coder determines the availabilities of one or more video units that neighbor the current video unit. In order to determine the availability of a video unit that neighbors the current video unit, the video coder identifies, based on availabilities of video units that neighbor a parent video unit of the current video unit, an entry in a lookup table. The identified entry indicates the availability of the video unit that neighbors the current video unit. The video coder then performs a coding operation on the current video unit based on whether the video unit that neighbors the current video unit is available. | 04-18-2013 |
20130208788 | Sample adaptive offset (SAO) in accordance with video coding - Sample adaptive offset (SAO) in accordance with video coding. SAO filtering may be performed before e-blocking processing (e.g., in accordance with video signal decoding and/or encoding). For example, a receiver and/or decoder communication device may receive signaling from a transmitter and/or encoder communication device that includes various band offsets. Corresponding band indices may be determined via analysis of the received video signal (e.g., received from the transmitter and/or encoder communication device), inferentially without requiring signaling of such band indices from the transmitter and/or encoder communication device. Upon appropriate analysis of one or more largest coding units (LCUs) generated from the video signal to determine a pixel value distribution (e.g., which may be using a histogram in one instance), then based on that pixel value distribution, the band indices are identified and the band offsets applied thereto. | 08-15-2013 |
20130208810 | Frequency domain sample adaptive offset (SAO) - Frequency domain sample adaptive offset (SAO). Video processing of a first signal operates to generate a second video signal such that at least one characteristic of a first portion of video information of the first video signal is replicated in generating a second portion of video information, such that the first portion of video information and the second portion of video information undergo combination to generate the second video signal. Such use of the first video signal may involve replication and scaling of the first video information to generate the second portion of video information. One possible characteristic of the first portion of video information may correspond to an energy profile as a function of frequency. One or more portions of the first video signal may be employed to generate different respective portions of the second signal. Such video processing operations may be performed on a block by block basis. | 08-15-2013 |
20130235926 | Memory efficient video parameter processing - Memory efficient video parameter processing. A communication system including at least two respective devices, namely, a transmitter device and a receiver device, operates with significant reduction in the amount of signaling provided between those respective devices. Such devices may be transceiver devices. Considering such a transmitter device that includes an encoder, such as a video encoder, and a receiver device that includes a decoder, such as a video decoder, and output bitstream corresponding to an encoded video signal may be provided from the transmitter device and received by the receiver device. Such an output bitstream may be generated by a video encoder within the transmitter device and may subsequently undergo appropriate processing by a video decoder within the receiver device. One or more frame-based signals, corresponding respectively to the number of blocks, may be communicated as being respectively limited to at most one step of recursion among the various blocks. | 09-12-2013 |
20130272408 | MOTION VECTOR PREDICTION IN VIDEO CODING - Aspects of this disclosure relate to, in an example, a method that includes identifying a first block of video data in a first temporal location from a first view, wherein the first block is associated with a first disparity motion vector. The method also includes determining a motion vector predictor for a second motion vector associated with a second block of video data, wherein the motion vector predictor is based on the first disparity motion vector. When the second motion vector comprises a disparity motion vector, the method includes determining the motion vector predictor comprises scaling the first disparity motion vector to generate a scaled motion vector predictor, wherein scaling the first disparity motion vector comprises applying a scaling factor comprising a view distance of the second disparity motion vector divided by a view distance of the first motion vector to the first disparity motion vector. | 10-17-2013 |
20130308701 | VIDEO CODING WITH LARGE MACROBLOCKS - Techniques are described for encoding and decoding digital video data using macroblocks that are larger than the macroblocks prescribed by conventional video encoding and decoding standards. For example, the techniques include encoding and decoding a video stream using macroblocks comprising greater than 16×16 pixels. In one example, an apparatus includes a video encoder configured to encode a coded unit comprising a plurality of video blocks, wherein at least one of the plurality of video blocks comprises a size of more than 16×16 pixels and to generate syntax information for the coded unit that includes a maximum size value, wherein the maximum size value indicates a size of a largest one of the plurality of video blocks in the coded unit. The syntax information may also include a minimum size value. In this manner, the encoder may indicate to a decoder the proper syntax decoder to apply to the coded unit. | 11-21-2013 |
20130308707 | METHODS AND DEVICE FOR DATA ALIGNMENT WITH TIME DOMAIN BOUNDARY - Apparatus and methods of using content information for encoding multimedia data are described. A method of processing multimedia data includes obtaining content information of multimedia data, and encoding the multimedia data so as to align a data boundary with a frame boundary in a time domain, wherein said encoding is based on the content information. In another aspect, a method of processing multimedia data includes obtaining a content classification of the multimedia data, and encoding blocks in the multimedia data as intra-coded blocks or inter-coded blocks based on the content classification to increase the error resilience of the encoded multimedia data. Apparatus that can process multimedia data described in these methods are also disclosed. | 11-21-2013 |
20130343447 | Adaptive loop filter (ALF) padding in accordance with video coding - Adaptive loop filter (ALF) padding in accordance with video coding. Various types of video processing are performed including performing virtual padding. When a filter coefficients collocated pixel is not available, that pixel may be replaced using an available pixel within a given location within a filter to process a number of pixels. For example, an available pixel located within the center of such a filter (e.g., which may be a cross shaped filter including a predetermined number of pixels, such as 18 pixels in one instance) may be used to replace those pixel locations which are not available in accordance with such virtual padding. With respect to the implementation of such an adaptive loop filter (ALF), such an ALF may be implemented to process a signal output from a de-blocking filter, from a sample adaptive offset (SAO) filter, and/or from a combined de-blocking/SAO filter in various implementations. | 12-26-2013 |
20140003503 | VIDEO CODING SYSTEM | 01-02-2014 |
20140019830 | Joint application-layer forward error/erasure correction (FEC) and video coding - Layered and scalable coding scheme is applied to one or more communication pathways between a transmitter and one or more receivers. Forward error/erasure correction (FEC) is applied for application layer erasure recovery. Additional FEC may also be employed at the physical layer (PHY) layer or channel coding layer for additional error correction capability and to provide joint application and PHY layer FEC coding. Source information (e.g., data, media such as image, video or audio, etc., or any other type of information) is encoded using two or more layers. These layers may include a base layer and one or more enhancement layers that, when combined with the base layer, modify the quality of the base layer. In a packet-based application, transmission of redundancy packets may be separately time-limited in the two or more layers. Also, adaptation (of signaling, FEC, etc.) may be made based on operating condition changes. | 01-16-2014 |
20140050265 | VIDEO CODING BASED ON FIRST ORDER PREDICTION AND PRE-DEFINED SECOND ORDER PREDICTION MODE - This disclosure describes video encoding and decoding techniques in which a first order prediction process and a second order prediction process are used in combination to generate predictive video blocks for video coding. First order prediction may be similar to conventional motion estimation and motion compensation that generates residual video blocks. The second order prediction may involve a process similar to conventional intra-prediction, but is performed on the residual video blocks. The techniques of this disclosure may pre-define the second order prediction to a specific mode, such as a mode similar to the intra-DC mode used in intra coding. In addition, the techniques of this disclosure may combine aspects of the first order and second order prediction into a single process so that the effects of second order prediction on the residuals are taken into account during the first order prediction process, which may improve compression. | 02-20-2014 |
20140341297 | VIDEO CODING USING ADAPTIVE MOTION VECTOR RESOLUTION - In one example, a device for coding video data includes a video coder configured to configured to code information representative of whether an absolute value of an x-component of a motion vector difference value for a current block of video data is greater than zero, code information representative of whether an absolute value of a y-component of the motion vector difference value is greater than zero, when the absolute value of the x-component is greater than zero, code information representative of the absolute value of the x-component, when the absolute value of the y-component is greater than zero, code information representative of the absolute value of the y-component, when the absolute value of the x-component is greater than zero, code a sign of the x-component, and when the absolute value of the y-component is greater than zero, code a sign of the y-component. | 11-20-2014 |