Inventors list

Assignees list

Classification tree browser

Top 100 Inventors

Top 100 Assignees


Avery Li Chun Wang, Palo Alto US

Avery Li Chun Wang, Palo Alto, CA US

Patent application numberDescriptionPublished
20080208891SYSTEM AND METHODS FOR RECOGNIZING SOUND AND MUSIC SIGNALS IN HIGH NOISE AND DISTORTION - A method for recognizing an audio sample locates an audio file that most closely matches the audio sample from a database indexing a large set of original recordings. Each indexed audio file is represented in the database index by a set of landmark timepoints and associated fingerprints. Landmarks occur at reproducible locations within the file, while fingerprints represent features of the signal at or near the landmark timepoints. To perform recognition, landmarks and fingerprints are computed for the unknown sample and used to retrieve matching fingerprints from the database. For each file containing matching fingerprints, the landmarks are compared with landmarks of the sample at which the same fingerprints were computed. If a large number of corresponding landmarks are linearly related, i.e., if equivalent fingerprints of the sample and retrieved file have the same time evolution, then the file is identified with the sample. The method can be used for any type of sound or music, and is particularly effective for audio signals subject to linear and nonlinear distortion such as background noise, compression artifacts, or transmission dropouts. The sample can be identified in a time proportional to the logarithm of the number of entries in the database; given sufficient computational power, recognition can be performed in nearly real time as the sound is being sampled.08-28-2008
20090012849METHOD AND SYSTEM FOR INTERACTING WITH A USER IN AN EXPERIENTIAL ENVIRONMENT - A method and system for provides a user with an ability to capture a sample of an experiential environment and deliver that sample to an interactive service to trigger one or more predetermined events. In exemplary embodiments of the invention such triggered events include the delivery of information and services to the user, the execution of tasks and instructions by the service on the user's behalf, communication events; surveillance events and other control-oriented events that are responsive to the user's wishes. In other exemplary embodiments of the invention, the triggered events include transaction-oriented events, entertainment events, and events associated with enhancements to human ability or function.01-08-2009
20090265174ROBUST AND INVARIANT AUDIO PATTERN MATCHING - The present invention provides an innovative technique for rapidly and accurately determining whether two audio samples match, as well as being immune to various kinds of transformations, such as playback speed variation. The relationship between the two audio samples is characterized by first matching certain fingerprint objects derived from the respective samples. A set (10-22-2009
20110071838SYSTEM AND METHODS FOR RECOGNIZING SOUND AND MUSIC SIGNALS IN HIGH NOISE AND DISTORTION - A method for recognizing an audio sample locates an audio file that most closely matches the audio sample from a database indexing a large set of original recordings. Each indexed audio file is represented in the database index by a set of landmark timepoints and associated fingerprints. Landmarks occur at reproducible locations within the file, while fingerprints represent features of the signal at or near the landmark timepoints. To perform recognition, landmarks and fingerprints are computed for the unknown sample and used to retrieve matching fingerprints from the database. For each file containing matching fingerprints, the landmarks are compared with landmarks of the sample at which the same fingerprints were computed. If a large number of corresponding landmarks are linearly related, i.e., if equivalent fingerprints of the sample and retrieved file have the same time evolution, then the file is identified with the sample. The method can be used for any type of sound or music, and is particularly effective for audio signals subject to linear and nonlinear distortion such as background noise, compression artifacts, or transmission dropouts. The sample can be identified in a time proportional to the logarithm of the number of entries in the database; given sufficient computational power, recognition can be performed in nearly real time as the sound is being sampled.03-24-2011
20110099197Method and System for Identification of Distributed Broadcast Content - A method and system of performing high-throughput identifications of broadcast content is provided. A device can send a content identification query, which includes a sample of content being broadcast, to a server to request an identity of the content. The server will perform a computational identification of the content, return the result to the device, and store the result. For all subsequently received content identification queries requesting an identity of content being broadcast from the same source and in a time during which the content is still being broadcast from the source, the server will send the stored content identification in response to the subsequent queries. If a subsequent content identification query does not request the identity of content being broadcast from the same source or is not received during the time that the content is still being broadcast, the server will perform a computational identification of a content sample.04-28-2011
20110244784METHOD AND APPARATUS FOR IDENTIFICATION OF BROADCAST SOURCE10-06-2011
20110273455Systems and Methods of Rendering a Textual Animation - Systems and methods of rendering a textual animation are provided. The methods include receiving an audio sample of an audio signal that is being rendered by a media rendering source. The methods also include receiving one or more descriptors for the audio signal based on at least one of a semantic vector, an audio vector, and an emotion vector. Based on the one or more descriptors, a client device may render the textual transcriptions of vocal elements of the audio signal in an animated manner. The client device may further render the textual transcriptions of the vocal elements of the audio signal to be substantially in synchrony to the audio signal being rendered by the media rendering source. In addition, the client device may further receive an identification of a song corresponding to the audio sample, and may render lyrics of the song in an animated manner.11-10-2011
20110276157Methods and Systems for Processing a Sample of a Media Stream - Methods and systems for processing a sample of a media stream are provided. In one example, a method includes receiving a sample of a media stream rendered by a media rendering source, and for matching media recordings from which the sample was obtained, the method also includes receiving a signature file including a temporally mapped collection of features describing content of the matching media recordings. Each received signature file may be stored in memory. In addition, a signature stream may be received that includes features extracted from the media stream, and a number of matching features between the signature stream of the media stream and the signature file for at least one matching media recording can be determined. In one example, when substantially matching features between the signature stream of the media stream and the signature file for a matching media recording stop occurring, a discontinuity can be identified.11-10-2011
20110276333Methods and Systems for Synchronizing Media - Systems and methods of synchronizing media are provided. A client device may be used to capture a sample of a media stream being rendered by a media rendering source. The client device sends the sample to a position identification module to determine a time offset indicating a position in the media stream corresponding to the sampling time of the sample, and optionally a timescale ratio indicating a speed at which the media stream is being rendered by the media rendering source based on a reference speed of the media stream. The client device calculates a real-time offset using a present time, a timestamp of the media sample, the time offset, and optionally the timescale ratio. The client device then renders a second media stream at a position corresponding to the real-time offset to be in synchrony to the media stream being rendered by the media rendering source.11-10-2011
20110276334Methods and Systems for Synchronizing Media - Systems and methods of synchronizing media are provided. A client device may be used to capture a sample of a media stream being rendered by a media rendering source. The client device sends the sample to a position identification module to determine a time offset indicating a position in the media stream corresponding to the sampling time of the sample, and optionally a timescale ratio indicating a speed at which the media stream is being rendered by the media rendering source based on a reference speed of the media stream. The client device calculates a real-time offset using a present time, a timestamp of the media sample, the time offset, and optionally the timescale ratio. The client device then renders a second media stream at a position corresponding to the real-time offset to be in synchrony to the media stream being rendered by the media rendering source.11-10-2011

Patent applications by Avery Li Chun Wang, Palo Alto, CA US