| AT&T Intellectual Property II, L.P. via transfer from AT&T Corp. Patent applications |
| Patent application number | Title | Published |
| 20110181605 | SYSTEM AND METHOD OF CUSTOMIZING ANIMATED ENTITIES FOR USE IN A MULTIMEDIA COMMUNICATION APPLICATION - In an embodiment, a method is provided for creating a personal animated entity for delivering a multi-media message from a sender to a recipient. An image file from the sender may be received by a server. The image file may include an image of an entity. The sender may be requested to provide input with respect to facial features of the image of the entity in preparation for animating the image of the entity. After the sender provides the input with respect to the facial features of the image of the entity, the image of the entity may be presented as a personal animated entity to the sender to preview. Upon approval of the preview from the sender, the image of the entity may be presented as a sender-selectable personal animated entity for delivering the multi-media message to the recipient. | 07-28-2011 |
| 20110103484 | Scalable Video Encoder/Decoder with Drift Control - A system, method and computer-readable media are introduced that relate to data coding and decoding. A computing device encodes received data such as video data into a base layer of compressed video and an enhancement layer of compressed video. The computing device controls drift introduced into the base layer of the compressed video. The computing device, such as a scalable video coder, allows drift by predicting the base layer from the enhancement layer information. The amount of drift is managed to improve overall compression efficiency. | 05-05-2011 |
| 20110072466 | Browsing and Retrieval of Full Broadcast-Quality Video - A method includes steps of indexing a media collection, searching an indexed library and browsing a set of candidate program segments. The step of indexing a media collection creates the indexed library based on a content of the media collection. The step of searching the indexed library identifies the set of candidate program segments based on a search criteria. The step of browsing the set of candidate program segments selects a segment for viewing. | 03-24-2011 |
| 20110022379 | On-Demand Language Translation for Television Programs - In an embodiment, a method of providing an on demand translation service is provided. A subscriber may be charged a reduced fee or no fee for use of the on demand translation service in exchange for displaying commercial messages to the subscriber, the commercial messages being selected based on subscriber information. A multimedia signal including information in a source language may be received. The information may be obtained as text in the source language from the multimedia signal. The text may be translated from the source language to a target language. Translated information, based on the translated text, may be transmitted to a processing device for presentation to the subscriber. The received multimedia signal may be sent to a multimedia device for viewing. | 01-27-2011 |
| 20100324893 | SYSTEM AND METHOD FOR IMPROVING ROBUSTNESS OF SPEECH RECOGNITION USING VOCAL TRACT LENGTH NORMALIZATION CODEBOOKS - Disclosed are systems, methods, and computer readable media for performing speech recognition. The method embodiment comprises selecting a codebook from a plurality of codebooks with a minimal acoustic distance to a received speech sample, the plurality of codebooks generated by a process of (a) computing a vocal tract length for a each of a plurality of speakers, (b) for each of the plurality of speakers, clustering speech vectors, and (c) creating a codebook for each speaker, the codebook containing entries for the respective speaker's vocal tract length, speech vectors, and an optional vector weight for each speech vector, (2) applying the respective vocal tract length associated with the selected codebook to normalize the received speech sample for use in speech recognition, and (3) recognizing the received speech sample based on the respective vocal tract length associated with the selected codebook. | 12-23-2010 |
| 20100286986 | Methods and Apparatus for Rapid Acoustic Unit Selection From a Large Speech Corpus - A speech synthesis system can select recorded speech fragments, or acoustic units, from a very large database of acoustic units to produce artificial speech. The selected acoustic units are chosen to minimize a combination of target and concatenation costs for a given sentence. However, as concatenation costs, which are measures of the mismatch between sequential pairs of acoustic units, are expensive to compute, processing can be greatly reduced by pre-computing and aching the concatenation costs. Unfortunately, the number of possible sequential pairs of acoustic units makes such caching prohibitive. However, statistical experiments reveal that while about 85% of the acoustic units are typically used in common speech, less than 1% of the possible sequential pairs of acoustic units occur in practice. A method for constructing an efficient concatenation cost database is provided by synthesizing a large body of speech, identifying the acoustic unit sequential pairs generated and their respective concatenation costs, and storing those concatenation costs likely to occur. By constructing a concatenation cost database in this fraction, the processing power required at run-time is greatly reduced with negligible effect on speech quality. | 11-11-2010 |
| 20100272173 | METHOD OF CONTENT ADAPTIVE VIDEO ENCODING - A method of content adaptive encoding video is disclosed. The method comprises segmenting video content into segments based on predefined classifications or models. Examples of such classifications include action scenes, slow scenes, low or high detail scenes, and brightness of the scenes. Based on the segment classifications, each segment is encoded with a different encoder chosen from a plurality of encoders. Each encoder is associated with a model. The chosen encoder is particularly suited to encoding the unique subject matter of the segment. The coded bit-stream for each segment includes information regarding which encoder was used to encode that segment. A matching decoder of a plurality of decoders is chosen using the information in the coded bitstream to decode each segment using a decoder suited for the classification or model of the segment. If scenes exist which do not fall in a predefined classification, or where classification is more difficult based on the scene content, these scenes are segmented, coded and decoded using a generic coder and decoder. | 10-28-2010 |
| 20100253703 | Computer Readable Medium for Modifying an Animation Wire Frame - An animation wireframe is modified with three-dimensional (3D) range and color data having a corresponding shape surface. The animation wireframe is vertically scaled based on distances between consecutive features within the 3D range and color data and corresponding distances within the generic animation wireframe. For each animation wireframe point, the location of the animation wireframe point is adjusted to coincide with a point on the shape surface. The shape surface point lies along a scaling line connecting the animation wireframe point, the shape surface point and an origin point. The scaling line is within a horizontal point. | 10-07-2010 |
| 20100242072 | NETWORK-BASED SERVICE TO PROVIDE ON-DEMAND VIDEO SUMMARIES OF TELEVISION PROGRAMS - A network-based device allows customers to receive television programming and to view summaries of the programming. A method of providing the summaries comprises: dividing a received program into program segments each identified by index marks, summarizing each program segment into summary segments identified by similar index marks, generating metadata files for delimiting a beginning and an end of summary segments and program segments, storing the summary segments and accessing the summary segments to supply the summary segments in lieu of program segments upon demand. | 09-23-2010 |
| 20100241430 | SYSTEM AND METHOD FOR USING META-DATA DEPENDENT LANGUAGE MODELING FOR AUTOMATIC SPEECH RECOGNITION - Disclosed are systems and methods for providing a spoken dialog system using meta-data to build language models to improve speech processing. Meta-data is generally defined as data outside received speech; for example, meta-data may be a customer profile having a name, address and purchase history of a caller to a spoken dialog system. The method comprises building tree clusters from meta-data and estimating a language model using the built tree clusters. The language model may be used by various modules in the spoken dialog system, such as the automatic speech recognition module and/or the dialog management module. Building the tree clusters from the meta-data may involve generating projections from the meta-data and further may comprise computing counts as a result of unigram tree clustering and then building both unigram trees and higher-order trees from the meta-data as well as computing node distances within the built trees that are used for estimating the language model. | 09-23-2010 |
| 20100241420 | AUTOMATED SENTENCE PLANNING IN A TASK CLASSIFICATION SYSTEM - The invention relates to a system that interacts with a user in an automated dialog system ( | 09-23-2010 |
| 20100223354 | Method for Creating and Providing Layered Syndicated Data for Multimedia Content to Users - A layered syndicated feed or content is provided having layers defined according to a media type (e.g., text, graphic, audio, or video) wherein each layer has a plurality of levels such that each level provides either inclusive or exclusive content of the layer's media type in varying degrees of detail or depth. Furthermore, a system is provided for requesting and storing user and mobile device profiles indicative of the syndicated content layers and levels that the user is interested in receiving and that the user's mobile device is capable of receiving. | 09-02-2010 |
| 20100217580 | On-Demand Language Translation for Television Programs - A method, a system and a machine-readable medium are provided for an on demand translation service. A translation module including at least one language pair module for translating a source language to a target language may be made available for use by a subscriber. The subscriber may be charged a fee for use of the requested on demand translation service or may be provided use of the on demand translation service for free in exchange for displaying commercial messages to the subscriber. A video signal may be received including information in the source language, which may be obtained as text from the video signal and may be translated from the source language to the target language by use of the translation module. Translated information, based on the translated text, may be added into the received video signal. The video signal including the translated information in the target language may be sent to a display device. | 08-26-2010 |
| 20100215097 | System, Method and Computer-Readable Medium for Encoding a Signal into Macroblocks - A quantizer and dequantizer for use in a video coding system that applies non linear, piece-wise linear scaling functions to video information signals based on a value of a variable quantization parameter. The quantizer and dequantizer apply different non linear, piece-wise linear scaling functions to a DC luminance signal, a DC chrominance signal and an AC chrominance signal. A code for reporting updates of the value of the quantization parameter is interpreted to require larger changes when the quantization parameter initially is large and smaller changes when the quantization parameter initially is small. | 08-26-2010 |
| 20100211396 | System and Method for Speech Recognition System - A digital speech enabled middleware module is disclosed that facilitates interaction between a large number of client devices and network-based automatic speech recognition (ASR) resources. The module buffers feature vectors associated with speech received from the client devices when the number of client devices is greater than the available ASR resources. When an ASR decoder becomes available, the module transmits the feature vectors to the ASR decoder and a recognition result is returned. | 08-19-2010 |