Patent application number | Description | Published |
20130120438 | Methods and Apparatuses for Generating Co-Salient Thumbnails for Digital Images - Methods and apparatus for generating context-sensitive, co-salient thumbnails are described. Co-salient thumbnails may effectively convey more useful information than conventional thumbnails while using the same or similar display space. As such, co-salient thumbnails may make it easier for a user to locate a particular image in a large collection of similar images. In some embodiments, for each of a plurality of patches in a first image, an illustrative method may identify a corresponding similar patch in a second image based on one or more similarity measures. The method may then determine a distance from each of the plurality of patches to its corresponding similar patch and select a crop in the first image relative to the second image based on the determined distance. Optionally, the method may repeat these operations in the opposite direction to select a crop in the second image relative to the first image. | 05-16-2013 |
20130120454 | Methods and Apparatuses for Generating Thumbnail Summaries for Image Collections - Methods and apparatuses for generating thumbnail summaries for image collections are described. Thumbnails summaries may be representative of the image collection while being disjoint from each other. In some embodiments, a method may sample crops from some or all images in an image collection. From these candidate crops, the method may select representative but disjoint crops. In some embodiments, a desired representativeness and/or disjointedness of the candidate crops may be defined by a user. The method may then generate and/or select thumbnail summaries for the collection of images based at least in part on the selected candidate crops. | 05-16-2013 |
20130121409 | Methods and Apparatus for Face Fitting and Editing Applications - Various embodiments of methods and apparatus for face fitting are disclosed. In one embodiment, sets of two-dimensional local feature points on a face in each image of a set of images are identified. The set of images includes a sequence of frames a video stream. A three-dimensional face model for the face in the each image is generated as a combination of a set of predefined three-dimensional face models. In some embodiments, the generating includes reducing an error between a projection of vertices of the set of predefined three-dimensional face models and the two-dimensional local feature points of the each image, and constraining facial expression of the three-dimensional face model to change smoothly from image to image in the sequence of video frames. | 05-16-2013 |
20130129141 | Methods and Apparatus for Facial Feature Replacement - Three dimensional models corresponding to a target image and a reference image are selected based on a set of feature points defining facial features in the target image and the reference image. The set of feature points defining the facial features in the target image and the reference image are associated with corresponding 3-dimensional models. A 3D motion flow between the 3-dimensional models is computed. The 3D motion flow is projected onto a 2D image plane to create a 2D optical field flow. The target image and the reference image are warped using the 2D optical field flow. A selected feature from the reference image is copied to the target image. | 05-23-2013 |
20130129158 | Methods and Apparatus for Facial Feature Replacement - A set of feature points for defining features of a face is detected in a target image and a reference image. The target image is warped to accommodate a selected feature from the reference image using the set of feature points and a set of three-dimensional models. The reference image is warped to a pose of the target image using the set of feature points and the set of three-dimensional models. The selected feature is copied from the reference image to the target image. | 05-23-2013 |
20130129213 | Non-Rigid Dense Correspondence - Non-rigid dense correspondence (NRDC) for image enhancement may be performed. In one embodiment, a correspondence may be computed for each one of a plurality of regions of a source image to one of a plurality of regions of a reference image. Computing the correspondences may include searching within a search range for each of a plurality of image characteristics. One or more of the correspondences may be aggregated into matched regions. A global color transform and/or deblurring may be applied to the source image. In one embodiment, the global color transform and/or deblurring may be based on the matched regions. At least one of the search ranges may optionally be adjusted. In some embodiments, computing, aggregating, applying and/or deblurring, and adjusting may be iteratively performed. | 05-23-2013 |
20130163874 | Determining Correspondence Between Image Regions - Determining correspondence between image regions can include: selecting first and second regions of visual content including pixels in a computer system, the first region comprising a first patch to be mapped to the second region; selecting at least two heuristics for use in mapping the first patch to the second region, the heuristics selected from the group consisting of: (i) nearby-pixel mapping evaluation; (ii) random-perturbation mapping evaluation; (iii) evaluation of multiple mapping candidates identified in an iterative search process; and (iv) enrichment to increase a collection of mapping candidates; and identifying, using the selected heuristics, at least one patch in the second region for the first patch. | 06-27-2013 |
20130163884 | Image Processing Using Image Web - A computer-implemented method for determining correspondence between images includes: receiving images in a computer system; performing iterations using the computer system to find respective mappings for each patch of pixels in the images to a patch in another one of the images such that the mappings have minimal patch distance, the iterations including at least: (i) evaluation of a nearby-pixel mapping in a current image, (ii) evaluation of a randomly selected mapping in the current image, and (iii) evaluation of a randomly selected mapping in another one of the images; and generating a mapping record that results from the iterations. | 06-27-2013 |
20130254688 | Content Aware Image Editing - An image is displayed using a computer system. The image includes contents that have a visible feature therein at a first location. A first input is received that includes a user movement of at least the visible feature from the first location. During the user movement, the first location is synthesized with content from where the visible feature is currently located. A second input is received that specifies an end of the user movement at a second location. A source area in the image is identified. The method further includes identifying additional contents within the source area. The additional contents are identified using a patch-based optimization algorithm on the image. The method further includes updating the image to have the additional contents at least in the first location. | 09-26-2013 |
20130286221 | Camera Calibration and Automatic Adjustment of Images - Techniques and apparatus for automatic upright adjustment of digital images. An automatic upright adjustment technique is described that may provide an automated approach for straightening up slanted features in an input image to improve its perceptual quality. This correction may be referred to as upright adjustment. A set of criteria based on human perception may be used in the upright adjustment. A reprojection technique that implements an optimization framework is described that yields an optimal homography for adjustment based on the criteria and adjusts the image according to new camera parameters generated by the optimization. An optimization-based camera calibration technique is described that simultaneously estimates vanishing lines and points as well as camera parameters for an image; the calibration technique may, for example, be used to generate estimates of camera parameters and vanishing points and lines that are input to the reprojection technique. | 10-31-2013 |
20130287318 | Automatic Adjustment of Images using a Homography - Techniques and apparatus for automatic upright adjustment of digital images. An automatic upright adjustment technique is described that may provide an automated approach for straightening up slanted features in an input image to improve its perceptual quality. This correction may be referred to as upright adjustment. A set of criteria based on human perception may be used in the upright adjustment. A reprojection technique that implements an optimization framework is described that yields an optimal homography for adjustment based on the criteria and adjusts the image according to new camera parameters generated by the optimization. An optimization-based camera calibration technique is described that simultaneously estimates vanishing lines and points as well as camera parameters for an image; the calibration technique may, for example, be used to generate estimates of camera parameters and vanishing points and lines that are input to the reprojection technique. | 10-31-2013 |
20140105499 | Determining Correspondence Between Image Regions - Techniques for determining correspondence between image regions are described. A computing system stores images that are comparable to determine corresponding image patches of the images. An approximation algorithm is implemented, and for multiple image patches in a region in a first image, corresponding image patches are determined in a second image. The approximation algorithm performs iterations utilizing a nearby-pixel mapping evaluation and a random-perturbation mapping evaluation to determine and select the corresponding image patches in the second image. | 04-17-2014 |
20140212054 | Image Classification for Adjustment - Image classification techniques are described for adjustment of an image. In one or more implementations, an image is classified by one or more computing device based on suitability of the image for adjustment to correct perspective distortion of the image. Responsive to a classification of the image as not suitable for the adjustment, suitability of the image is detected for processing by a different image adjustment technique by the one or more computing devices. | 07-31-2014 |
20140333644 | LOW MEMORY CONTENT AWARE FILL - A first image at a first resolution is received, the first image having a first hole therein. Based on the first image, a second image is generated at a second resolution lower than the first resolution, the second image having a second hole therein corresponding to the first hole. In the second image, one or more second-image source patches for the second hole are identified. At least one first-image source patch in the first image is identified based on a location of the identified second-image source patch. The identified at least one first-image source patch are stored in memory. Fill content are identified in the at least one first-image source patch stored in the memory. The identified fill content are placed in the first hole. | 11-13-2014 |
20140337721 | Content Aware Slideshows - A method, system, and computer-readable storage medium for performing content based transitions between images. Image content within each image of a set of images are analyzed to determine at least one respective characteristic metric for each image. A respective transition score for each pair of at least a subset of the images is determined with respect to each transition effect of a plurality of transition effects based on the at least one respective characteristic metric for each image. Transition effects implementing transitions between successive images for a sequence of the images are determined based on the transitions between successive images for a sequence of the images are determined based on the transition scores. An indication of the determined transition effects is stored. The determined transition effects are useable to present the images in a slideshow or other image sequence presentation. | 11-13-2014 |
20140376807 | Patch-Based Synthesis Techniques Using Color and Color Gradient Voting - Methods, apparatus, and computer-readable storage media for patch-based image synthesis using color and color gradient voting. A patch matching technique provides an extended patch search space that encompasses geometric and photometric transformations, as well as color and color gradient domain features. The photometric transformations may include gain and bias. The patch-based image synthesis techniques may also integrate image color and color gradients into the patch representation and replace conventional color averaging with a technique that performs voting for colors and color gradients and then solves a screened Poisson equation based on values for colors and color gradients when blending patch(es) with a target image. | 12-25-2014 |
20150030238 | VISUAL PATTERN RECOGNITION IN AN IMAGE - A system may be configured as an image recognition machine that utilizes an image feature representation called local feature embedding (LFE). LFE enables generation of a feature vector that captures salient visual properties of an image to address both the fine-grained aspects and the coarse-grained aspects of recognizing a visual pattern depicted in the image. Configured to utilize image feature vectors with LFE, the system may implement a nearest class mean (NCM) classifier, as well as a scalable recognition algorithm with metric learning and max margin template selection. Accordingly, the system may be updated to accommodate new classes with very little added computational cost. This may have the effect of enabling the system to readily handle open-ended image classification problems. | 01-29-2015 |
20150063713 | GENERATING A HIERARCHY OF VISUAL PATTERN CLASSES - A hierarchy machine may be configured as a clustering machine that utilizes local feature embedding to organize visual patterns into nodes that each represent one or more visual patterns. These nodes may be arranged as a hierarchy in which a node may have a parent-child relationship with one or more other nodes. The hierarchy machine may implement a node splitting and tree-learning algorithm that includes hard-splitting of nodes and soft-assignment of nodes to perform error-bounded splitting of nodes into clusters. This may enable the hierarchy machine, which may form all or part of a visual pattern recognition system, to perform large-scale visual pattern recognition, such as font recognition or facial recognition, based on a learned error-bounded tree of visual patterns. | 03-05-2015 |
Patent application number | Description | Published |
20090174761 | Device, Method and Computer Program Product for Responding to Media Conference Deficiencies - A method for responding to media conference deficiencies, the method includes: monitoring, by at least one receiver, a quality of media conference signals being received by at least one receiver during the media conference; sending, in response to the monitoring, to at least an end user transmitter that transmitted the media conference signals, a quality indication representative of a quality of the received media conference signals; recording inadequately received media conference signals that were inadequately received by a certain end user receiver and participating in an activity related to a transmission, to the certain end user receiver, of the inadequately received media conference signals or of a representation of the inadequately received media conference signals. | 07-09-2009 |
20110144990 | RATING SPEECH NATURALNESS OF SPEECH UTTERANCES BASED ON A PLURALITY OF HUMAN TESTERS - A method that includes: generating an utterance-specific scoring model for each one of a plurality of obtained speech utterances, each scoring model usable to estimate a level of speech naturalness for a respective one of the obtained speech utterances; presenting a plurality of human-testers with some of the obtained speech utterances; receiving, for each presented speech utterance, a plurality of human tester generated speech utterances being at least one human repetition of the presented speech utterance; updating the scoring model for each presented speech utterance, based on respective human-tester generated speech utterances; and obtaining a speech naturalness score for each presented speech utterance by respectively applying the updated utterance-specific scoring model to each presented speech utterance. | 06-16-2011 |
20120239746 | Device, Method and Computer Program Product for Responding to Media Conference Deficiencies - A method for responding to media conference deficiencies, the method includes: monitoring, by at least one receiver, a quality of media conference signals being received by at least one receiver during the media conference; sending, in response to the monitoring, to at least an end user transmitter that transmitted the media conference signals, a quality indication representative of a quality of the received media conference signals; recording inadequately received media conference signals that were inadequately received by a certain end user receiver and participating in an activity related to a transmission, to the certain end user receiver, of the inadequately received media conference signals or of a representation of the inadequately received media conference signals. | 09-20-2012 |
20120271637 | DERIVING GEOGRAPHIC DISTRIBUTION OF PHYSIOLOGICAL OR PSYCHOLOGICAL CONDITIONS OF HUMAN SPEAKERS WHILE PRESERVING PERSONAL PRIVACY - A method including: obtaining, via a plurality of communication devices, a plurality of speech signals respectively associated with human speakers, the speech signals including verbal components and non-verbal components; identifying a plurality of geographical locations, each geographic location associated with a respective one of the plurality of the communication devices; extracting the non-verbal components from the obtained speech signals; deducing physiological or psychological conditions of the human speakers by analyzing, over a specified period, the extracted non-verbal components, using predefined relations between characteristics of the non-verbal components and physiological or psychological conditions of the human speakers; and providing a geographical distribution of the deduced physiological or psychological conditions of the human speakers by associating the deduced physiological or psychological conditions of the human speakers with geographical locations thereof. | 10-25-2012 |
20130013313 | STATISTICAL ENHANCEMENT OF SPEECH OUTPUT FROM A STATISTICAL TEXT-TO-SPEECH SYNTHESIS SYSTEM - A method, system and computer program product are provided for enhancement of speech synthesized by a statistical text-to-speech (TTS) system employing a parametric representation of speech in a space of acoustic feature vectors. The method includes: defining a parametric family of corrective transformations operating in the space of the acoustic feature vectors and dependent on a set of enhancing parameters; and defining a distortion indictor of a feature vector or a plurality of feature vectors. The method further includes: receiving a feature vector output by the system; and generating an instance of the corrective transformation by: calculating a reference value of the distortion indicator attributed to a statistical model of the phonetic unit emitting the feature vector; calculating an actual value of the distortion indicator attributed to feature vectors emitted by the statistical model of the phonetic unit emitting the feature vector; calculating the enhancing parameter values depending on the reference value of the distortion indicator, the actual value of the distortion indicator and the parametric corrective transformation; and deriving an instance of the corrective transformation corresponding to the enhancing parameter values from the parametric family of the corrective transformations. The instance of the corrective transformation may be applied to the feature vector to provide an enhanced feature vector. | 01-10-2013 |
20130317825 | DERIVING GEOGRAPHIC DISTRIBUTION OF PHYSIOLOGICAL OR PSYCHOLOGICAL CONDITIONS OF HUMAN SPEAKERS WHILE RESERVING PERSONAL PRIVACY - A method including: obtaining, via a plurality of communication devices, a plurality of speech signals respectively associated with human speakers, the speech signals including verbal components and non-verbal components; identifying a plurality of geographical locations, each geographic location associated with a respective one of the plurality of the communication devices; extracting the non-verbal components from the obtained speech signals; deducing physiological or psychological conditions of the human speakers by analyzing, over a specified period, the extracted non-verbal components, using predefined relations between characteristics of the non-verbal components and physiological or psychological conditions of the human speakers; and providing a geographical distribution of the deduced physiological or psychological conditions of the human speakers by associating the deduced physiological or psychological conditions of the human speakers with geographical locations thereof. | 11-28-2013 |
20140074468 | System and Method for Automatic Prediction of Speech Suitability for Statistical Modeling - An embodiment according to the invention provides a capability of automatically predicting how favorable a given speech signal is for statistical modeling, which is advantageous in a variety of different contexts. In Multi-Form Segment (MFS) synthesis, for example, an embodiment according to the invention uses prediction capability to provide an automatic acoustic driven template versus model decision maker with an output quality that is high, stable and depends gradually on the system footprint. In speaker selection for a statistical Text-to-Speech synthesis (TTS) system build, as another example context, an embodiment according to the invention enables a fast selection of the most appropriate speaker among several available ones for the full voice dataset recording and preparation, based on a small amount of recorded speech material. | 03-13-2014 |
20150066512 | Method and Apparatus for Detecting Synthesized Speech - Computer systems employing speaker verification as a security approach to prevent un-authorized access by intruders may be tricked by a synthetic speech with voice characteristics similar to those of an authorized user of the computer system. According to at least one example embodiment, a method and corresponding apparatus for detecting a synthetic speech signal include extracting a plurality of speech features from multiple segments of the speech signal; analyzing the plurality of speech features to determine whether the plurality of speech features exhibit periodic variation behavior; and determining whether the speech signal is a synthetic speech signal or a natural speech signal based on whether or not a periodic variation behavior of the plurality of speech features is detected. The embodiments of synthetic speech detection result in security enhancement of the computer system employing speaker verification. | 03-05-2015 |