Patent application number | Description | Published |
20080226256 | Systems and methods of providing modified media content - A method of providing modified media content is disclosed that includes providing media content to a destination device via a network, where the media content comprises video data and audio data have a first viewing rate. The method further includes receiving data indicating a selection of a second viewing rate via the network and modifying the media content to produce modified media content having approximately the second viewing rate. The modified media content includes modified video data and modified audio data synchronized at approximately the second viewing rate. | 09-18-2008 |
20080232775 | Systems and methods of providing modified media content - In an embodiment, a method of providing modified media content is disclosed and includes receiving media content that includes audio data and video data having a first number of video frames. The method also includes generating abstracted media content that includes portions of the video data and audio elements of the audio data, where the abstracted media content includes less than all of the video data and includes fewer video frames than the first number of video frames. | 09-25-2008 |
20080235741 | Systems and Methods of providing modified media content - A method and system of providing media content is disclosed. In a particular embodiment, the method includes receiving media content from a content source at a set-top box device. The media content includes video data having a first playback rate and audio data having the first playback rate. The method further includes transforming the audio data via a non-linear transformation to produce modified audio data having a second playback rate, modifying the video data to produce modified video data having the second playback rate, and synchronizing the modified audio data and the modified video data to produce modified media content having the second playback rate. A network-based media content storage device and associated logic to provide adjusted rate audio content are also disclosed. | 09-25-2008 |
20090259465 | LOW LATENCY REAL-TIME VOCAL TRACT LENGTH NORMALIZATION - A method and system for training an automatic speech recognition system are provided. The method includes separating training data into speaker specific segments, and for each speaker specific segment, performing the following acts: generating spectral data, selecting a first warping factor and warping the spectral data, and comparing the warped spectral data with a speech model. The method also includes iteratively performing the steps of selecting another warping factor and generating another warped spectral data, comparing the other warped spectral data with the speech model, and if the other warping factor produces a closer match to the speech model, saving the other warping factor as the best warping factor for the speaker specific segment. The system includes modules configured to control a processor in the system to perform the steps of the method. | 10-15-2009 |
20120227078 | Systems and Methods of Providing Modified Media Content - A method includes receiving a command to provide media content configured to be sent to a display device for display at a particular scan rate. The media content includes audio data and video data. The method includes identifying high priority segments of the media content based on the audio data. The high priority segments are to be displayed by the display device at a presentation rate such that the high priority segments displayed at the presentation rate correspond to the media content displayed at the particular scan rate. The method also includes sending the high priority segments to the display device to provide video content and audio content of the requested media content for display. | 09-06-2012 |
20120271635 | SPEECH RECOGNITION BASED ON PRONUNCIATION MODELING - A system and method for performing speech recognition is disclosed. The method comprises receiving an utterance, applying the utterance to a recognizer with a language model having pronunciation probabilities associated with unique word identifiers for words given their pronunciations and presenting a recognition result for the utterance. Recognition improvement is found by moving a pronunciation model from a dictionary to the language model. | 10-25-2012 |
20130051753 | Systems and Methods of Providing Modified Media Content - A method includes processing media content. The media content includes audio data corresponding to a first audio playback rate and video data corresponding to a first video playback rate. Processing the media content includes identifying a speech portion of the audio data. The speech portion includes a consonant portion. The method further includes producing modified media content. The modified media content is produced based on modifying the video data and modifying the audio data. Modifying the audio data includes applying a non-linear transformation to the speech portion identified in the audio data. The method further includes storing the modified media content. | 02-28-2013 |
20130216210 | Systems and Methods of Providing Modified Media Content - In a particular embodiment, a method includes displaying a playback rate slide bar having a plurality of increments. The plurality of increments is equally spaced along the playback rate slide bar and each increment of the plurality of increments corresponds to a different playback rate of media content. | 08-22-2013 |
20150088498 | LOW LATENCY REAL-TIME VOCAL TRACT LENGTH NORMALIZATION - A method and system for training an automatic speech recognition system are provided. The method includes separating training data into speaker specific segments, and for each speaker specific segment, performing the following acts: generating spectral data, selecting a first warping factor and warping the spectral data, and comparing the warped spectral data with a speech model. The method also includes iteratively performing the steps of selecting another warping factor and generating another warped spectral data, comparing the other warped spectral data with the speech model, and if the other warping factor produces a closer match to the speech model, saving the other warping factor as the best warping factor for the speaker specific segment. The system includes modules configured to control a processor in the system to perform the steps of the method. | 03-26-2015 |