Patent application number | Description | Published |
20080212882 | Pattern Encoded Dictionaries - The present invention is related to a method and system providing a pattern-classifier encoded dictionary for use in language processing systems implemented in computer systems. The pattern encoded dictionary according to the present invention may be utilized in Optical Character Recognition systems or (OCR) or Automatic Speech Recognition systems (ASR) to retrieve reliably identified words used in an adaptive manner or as a tool to configure said OCR or ASR system. | 09-04-2008 |
20090016606 | METHOD, SYSTEM, DIGITAL CAMERA AND ASIC FOR GEOMETRIC IMAGE TRANSFORMATION BASED ON TEXT LINE SEARCHING - The present invention provides a method, system and/or a digital camera providing a geometrical transformation of deformed images of documents comprising text, by text line tracking, resulting in an image comprising parallel text lines. The transformed image is provided as an input to an OCR program either running in a computer system or in a processing element comprised in said digital camera. | 01-15-2009 |
20090067756 | METHOD AND SYSTEM FOR VERIFICATION OF UNCERTAINLY RECOGNIZED WORDS IN AN OCR SYSTEM - The present invention provides a method and system for confirming uncertainly recognized words as reported by an Optical Character Recognition process by using spelling alternatives as search arguments for an Internet search engine. The measured number of hits for each spelling alternative is used to provide a confirmation measure for the most probable spelling alternative. Whenever the confirmation measure is inconclusive, a plurality of search strategies are used to reach a measured result comprising zero hits except for one spelling alternative that is used as the correct alternative. | 03-12-2009 |
20100272359 | METHOD FOR RESOLVING CONTRADICTING OUTPUT DATA FROM AN OPTICAL CHARACTER RECOGNITION (OCR) SYSTEM, WHEREIN THE OUTPUT DATA COMPRISES MORE THAN ONE RECOGNITION ALTERNATIVE FOR AN IMAGE OF A CHARACTER - The present invention is related to a method for resolving contradicting output data from an Optical Character Recognition (OCR) system providing a conversion of pixelized documents into computer coded text as the output data, wherein the OCR output data comprises at least a first and second character listed as being likely candidates for an exemplar of a same sampled character instance from the pixelized document, by providing steps that identify locations of differences in graphical appearance between the candidate characters, and then using the location information to identify a corresponding locations in the sampled character instance. Based on correlation technique, this location information is used to select the correct candidate character as the identification of the sampled character instance. | 10-28-2010 |
20100303356 | METHOD FOR PROCESSING OPTICAL CHARACTER RECOGNITION (OCR) DATA, WHEREIN THE OUTPUT COMPRISES VISUALLY IMPAIRED CHARACTER IMAGES - The present invention provides a method for an Optical Character Recognition (OCR) system providing recognition of characters that are partly hidden by crossing outs due to for example an imprint of a stamp, handwritten signatures, etc. The method establishes a set of template images of certainly recognized characters from the image of the text being processed by the OCR system, wherein the effect of the crossed out section is modelled into the template images before comparing these images with the image of a visually impaired crossed out character. The modelled template image having the highest similarity with the visually impaired crossed out character is the correct identification for the visually impaired character instance. | 12-02-2010 |
20110026813 | RELATIVE THRESHOLD AND USE OF EDGES IN OPTICAL CHARACTER RECOGNITION PROCESS - Converting images to binary image representations is part of an Optical Character Recognition program in a computer system. The method and system according to present invention is using a relative threshold level to convert the image to its binary image representation. | 02-03-2011 |
20110103713 | WORD LENGTH INDEXED DICTIONARY FOR USE IN AN OPTICAL CHARACTER RECOGNITION (OCR) SYSTEM - A method for organizing a dictionary look up process in an Optical Character Recognition (OCR) system is described. Word length and an additional relative position within the words of a graphical feature, for example a stem, ascender, descender etc. are used in combination to index a dictionary. Unrecognized characters are analysed the same way, i.e. word length and relative position within the unrecognized word is then used to address the dictionary, resulting in an output of one ore more candidate words as an identification of the unrecognized word. An iterative process may reduce the number of candidate words identified in the dictionary look up process. | 05-05-2011 |