Patent application number | Description | Published |
20080273810 | VIDEO CODING MODE SELECTION USING ESTIMATED CODING COSTS - This disclosure describes techniques for coding mode selection using estimated coding costs. To provide high compression efficiency, for example, an encoding device may attempt to select a coding mode for coding blocks of pixels that codes the data of the blocks with high efficiency. To this end, the encoding device may perform coding mode selection based on estimates of coding cost for at least a portion of the possible modes. In accordance with the techniques described herein, the encoding device estimates the coding cost for the different modes without actually coding the blocks. In fact, in some aspects, the encoding module device may estimate the coding cost for the modes without quantizing the data of the block for each mode. In this manner, the coding cost estimation techniques of this disclosure reduce the amount of computationally intensive calculations needed to perform effective mode selection. | 11-06-2008 |
20100020886 | SCALABILITY TECHNIQUES BASED ON CONTENT INFORMATION - Apparatus and methods of using content information for encoding multimedia data are described. A method of processing multimedia data includes classifying content of multimedia data, and encoding the multimedia data in a first data group and in a second data group based on the content classification. The first and second groups are associated with quality levels. A user can request a target quality level. | 01-28-2010 |
20110080955 | METHOD AND APPARATUS FOR MOTION VECTOR PROCESSING - A method for processing a plurality of motion vectors is disclosed. The method includes determining a number of different block sizes in the video frame; and, performing a variable block size motion vector process if the number of different block sizes in the video frame is greater than one, the variable block size motion vector process comprising constructing a pyramid of motion vectors from the plurality of motion vectors, the pyramid having at least a first layer and a second layer of motion vectors, each of the first and second layers having a set of motion vectors based on a particular block size. An apparatus for performing the inventive method is also disclosed. | 04-07-2011 |
20120057629 | Rho-domain Metrics - Video encoders, systems and methods are described that characterize video encoding processes using a ρ-domain deviation metric. The deviation metric represents a weighted difference between actual non-zero coefficients and the expected non-zero coefficients, the actual and expected coefficients corresponding to quantization of a macroblock in a video frame during video encoding of the frame. The deviation metric is used to adjust the video encoding process to obtain an optimized encoding bit rate for a desired video encoding quality by selecting a quantizing parameter based on a normalized value of the deviation metric. The quantizing parameter can be selected from a table indexed using the deviation metric. | 03-08-2012 |
20120057633 | Video Classification Systems and Methods - Video encoder systems and methods are described that employ table-based content classification. One or more tables relate quantization parameters and P-points for a frame of video that typically comprises macroblocks. A deviation representative of a difference between original and decoded versions of a macroblock is determined, the deviation being further representative of a distribution frequency of the value of a distortion for a P-point. The P-point corresponds to a distortion value that is associated with a minimum rate difference between encoding modes for a macroblock. A motion complexity index is updated using a quantization parameter and non-zero coefficients of the encoded frame. An encoding mode for the macroblock can be retrieved from the tables using the motion complexity index to reference mode information maintained in the tables. | 03-08-2012 |
20120057634 | Systems and Methods for Video Content Analysis - Video analytics systems and methods are described that typically comprise a video encoder operable to generate macroblock video analytics metadata (VAMD) from a video frame. Functional modules receive the VAMD and an encoded version of the video frame is configured to generate video analytics information related to the frame using the VAMD and the encoded video frame. The downstream decoder can use the VAMD to obtain a global motion vector related to the frame, detect and track motion of an object within the frame and monitor a line provided or found within the frame. Traversals of the line by a moving object can be detected and counted using information in the VAMD and the line may be part of a polygon that delineates an area to be monitored within the encoded frame. The VAMD can comprise macroblock level and video frame level information. | 03-08-2012 |
20120057640 | Video Analytics for Security Systems and Methods - Video processing, encoding and decoding systems are described. A processor receives video frames representative of a sequence of images captured by a video sensor and the video frames are encode according to a desired video encoding standard. A video analytics processor receives video analytics metadata generated by the video encoder from the sequence of images and produces video analytics messages for transmission to a client device which performs client side video analytics processing. The video analytics metadata may comprise pixel domain video analytics information directly from an analog-to-digital front end or directly from an encoding engine as the engine is performing compression. | 03-08-2012 |
20120294360 | CHANNEL SWITCH FRAME - Methods and apparatus to process multimedia data enabling faster channel acquisitions, improved error recovery and improved efficiency. An encoder device encodes a first portion of multimedia data using inter-coding to generate a first version, and encodes the first portion of multimedia data using intra-coding to generate a second version. A decoder device receives a first version of a first portion of multimedia data, wherein the first version is inter-coded, receives a second version of the first portion of multimedia data, wherein the second version is intra-coded, and selectively decodes the first and second received versions. | 11-22-2012 |
20130188742 | METHOD AND APPARATUS FOR ENCODER ASSISTED-FRAME RATE UP CONVERSION (EA-FRUC) FOR VIDEO COMPRESSION - An Encoder Assisted Frame Rate Up Conversion (EA-FRUC) system that utilizes video coding and pre-processing operations at the video encoder to exploit the FRUC processing that will occur in the decoder in order to improve compression efficiency and reconstructed video quality is disclosed. One operation of the EA-FRUC system involves determining whether to encode a frame in a sequence of frames of a video content by determining a spatial activity in a frame of the sequence of frames; determining a temporal activity in the frame; determining a spatio-temporal activity in the frame based on the determined spatial activity and the determined temporal activity; determining a level of a redundancy in the source frame based on at least one of the determined spatial activity, the determined temporal activity, and the determined spatio-temporal activity; and, encoding the non-redundant information in the frame if the determined redundancy is within predetermined thresholds. | 07-25-2013 |
20130308707 | METHODS AND DEVICE FOR DATA ALIGNMENT WITH TIME DOMAIN BOUNDARY - Apparatus and methods of using content information for encoding multimedia data are described. A method of processing multimedia data includes obtaining content information of multimedia data, and encoding the multimedia data so as to align a data boundary with a frame boundary in a time domain, wherein said encoding is based on the content information. In another aspect, a method of processing multimedia data includes obtaining a content classification of the multimedia data, and encoding blocks in the multimedia data as intra-coded blocks or inter-coded blocks based on the content classification to increase the error resilience of the encoded multimedia data. Apparatus that can process multimedia data described in these methods are also disclosed. | 11-21-2013 |
20140369417 | SYSTEMS AND METHODS FOR VIDEO CONTENT ANALYSIS - Video analytics systems and methods are described that typically comprise a video encoder operable to generate macroblock video analytics metadata (VAMD) from a video frame. Functional modules receive the VAMD and an encoded version of the video frame is configured to generate video analytics information related to the frame using the VAMD and the encoded video frame. The downstream decoder can use the VAMD to obtain a global motion vector related to the frame, detect and track motion of an object within the frame and monitor a line provided or found within the frame. Traversals of the line by a moving object can be detected and counted using information in the VAMD and the line may be part of a polygon that delineates an area to be monitored within the encoded frame. The VAMD can comprise macroblock level and video frame level information. | 12-18-2014 |