Inventors list

Assignees list

Classification tree browser

Top 100 Inventors

Top 100 Assignees


Limited to specially coded, human-readable characters

Subclass of:

382 - Image analysis

382181000 - PATTERN RECOGNITION

Patent class list (only not empty are listed)

Deeper subclasses:

Class / Patent application numberDescriptionNumber of patent applications / Date published
382183000 Characters formed entirely of parallel bars (e.g., CMC-7) 5
20130071029Dynamic Multidimensional Barcodes For Information Handling System Service Information - Multi-dimensional barcodes at a product include service identifiers for the product so that an end user with a portable information handling system captures an image of the multi-dimensional barcode and extracts the service identifiers to obtain service information from a service network location. For example, a service identifier embeds a URL that links to a video demonstrating how to assemble the product. As another example, a service identifier links to a service network location and includes a unique identifier so that an end user retrieves warranty or purchase information for the product.03-21-2013
20090148044Device and Method for Virtualizing an Image Sensor - A method virtualizes an image sensor. The method comprises selecting one of at least two portions of a filter to use as a function of a mode of operation. The method comprises capturing an image using the selected one of the at least two portions. The method comprises executing a functionality using data extracted from the image. The functionality corresponds to the mode of operation.06-11-2009
20090285484PORTABLE IMAGE PROCESSING AND MULTIMEDIA INTERFACE - A portable device configured to provide enhanced shopping information is provided. The portable device has a display screen and an image capture device and the portable device is configured to access databases through a wireless network. The portable device includes image recognition logic that is configured to perform analysis of an image of an object that includes a bar code associated with a product. The analysis determines if the graphics found on the object correspond to a bar code and a portion of an image with the bar code is communicated through the wireless network to the databases to identify the product. The portable device further includes image generation logic that is configured to obtain product information for the identified product from the databases and present the product information on the display screen of the portable device.11-19-2009
20090041353Method and system for collecting event attendee information - A method and system for a user to collect point of contact information for one or more event attendees, in which identification information associated with an attendee is scanned and transmitted to a data server. More particularly, a method and system is provided for a user to scan a bar code on a visitor identification badge and extract an identification code which can be sent to a data server in order to obtain certain visitor information associated with the identification code. The user can access the data server to obtain the information associated with the user's data collection activity which can stored on the data server such as the information on each scanned visitor badge and associated visitor information, as well as time and location of the data capture.02-12-2009
20080304747IDENTIFIERS FOR DIGITAL MEDIA - A computer-implemented method includes receiving a piece of content, wherein the piece of content comprises a machine-readable identifier, identifying the machine-readable identifier in the piece of content, and associating the machine-readable identifier to the piece of media content.12-11-2008
382184000 With separate timing or alignment marks 2
20080253658METHOD AND SYSTEM FOR PERFORMING IMAGE MARK RECOGNITION - A method and system for performing image mark recognition for a document is disclosed. A document is scanned into a digital image. Reference image marks are sensed from the digital image. The reference image marks may include trigger row marks and/or corner/crop marks. Coordinates denoting the location of cells within the digital image are determined base on the locations of the reference image marks. Response marks are evaluated for darkness, opacity, and/or grayness on a pixel-by-pixel basis. The response marks are each assigned a percentage value based on the ratio between a total color value for a cell and a maximum color value for a cell.10-16-2008
20100092089Image And Part Recognition Technology - A system and method for recognition of images may include the use of alignment markers. The image recognized may be a pattern from an array, a character, a number, a shape, and/or irregular shapes. The pattern may be formed by elements in an array such as an identification marking and/or a sensor array. More particularly, the system and method relate to discriminating between images by accounting for the orientation of the image. The size and/or location of alignment markers may provide information about the orientation of an image. Information about the orientation of an image may reduce false recognitions. The system and method of image recognition may be used with identification markings, biosensors, micro-fluidic arrays, and/or optical character recognition systems.04-15-2010
Entries
DocumentTitleDate
20090214115IMAGE PROCESSING APPARATUS AND COMPUTER READABLE MEDIUM - An image processing apparatus includes: an image acceptance unit that accepts an image; a character information adding unit that adds a character identifier for uniquely identifying a character, a character position indicating a position of the character in the image, a character size indicating a size of the character, and a character color indicating a color of the character as character information, to the image accepted by the image acceptance unit; a font allocation unit that allocates a font without drawing element as a font corresponding to the character identifier within the character information added by the character information adding unit; and an electronic document generation unit that generates an electronic document for reference to the font information allocated by the font allocation unit, based on the character information added to the image by the character information adding unit.08-27-2009
20130034302CHARACTER RECOGNITION APPARATUS, CHARACTER RECOGNITION METHOD AND PROGRAM - The character recognition apparatus recognizes characters from a read document original to correct a character string as a character recognition result in a word unit with a space character as a separator. The character recognition apparatus includes a circumscribed rectangle formation portion which forms a circumscribed rectangle for each recognized alphabet character string, a fixed-pitch font determination portion which determines whether or not a font is a fixed-pitch font based on a distance between center lines in a width direction of adjacent circumscribed rectangles, a portion for determining an excess space character which determines, in the case of a fixed-pitch font, that the space character is an excess based on that a width of a space character in the character string is narrower than a predetermined width, and a portion for deleting the space character determined as an excess from the character string.02-07-2013
20120263380DATA PROCESSING APPARATUS, METHOD FOR CONTROLLING DATA PROCESSING APPARATUS, AND NON-TRANSITORY COMPUTER READABLE STORAGE MEDIUM - When a display language is different from an OCR language, which is used for document name OCR, the name of a document to be sent may not be correctly displayed on a screen. A data processing apparatus is provided that includes a document name setting unit configured to set a document name including a character string recognized on the basis of document data for the document data generated by a read unit, and a control unit configured to restrain the document name setting unit from setting the document name when a language specified by a character recognition language specifying unit is different from a language specified by a display language setting unit.10-18-2012
20120183222COMPUTING DEVICE AND METHOD FOR AUTOMATICALLY TYPESETTING PATENT IMAGES - A method for automatically typesetting patent images extracts a brief introduction of each patent image from a description part of a patent document, and records a keyword of the brief introduction. The method distinguishes an image label of each patent image from an image part of the patent document. The method rotates the patent image by ninety degrees clockwise in response that the image label of the patent image does not contain the keyword, and outputs the rotated image onto a display device.07-19-2012
20090154810IMAGE PROCESSING DEVICE, IMAGE PROCESSING METHOD, AND PROGRAM AND RECORDING MEDIUM THEREOF - In an electronic document of drawing descriptions of a page image and a character, it is desired that although a font data necessary for drawing the character is held in the electronic document, the size of the electronic document is minimized. Furthermore, it is desired to ensure visibility at the time of highlighting of search. There is generated an electronic document in which a document image, a plurality of character codes obtained by executing a character recognition processing with respect to the document image, and a plurality of kinds of glyph data to be utilized in common with respect to the plurality of character codes when drawing characters corresponding to the plurality of character codes are stored. The plurality of kinds of glyph data are selectively used when characters corresponding to the character codes are drawn. It is desirable that the glyph data be the one in a simple form.06-18-2009
20130084009SYSTEMS, METHODS AND USER INTERFACES IN A PATENT MANAGEMENT SYSTEM - A system and method of providing data for a patent management system is proposed. The system presents one or more data fields of interest to a user and the method comprises downloading at least one patent document from an external patent database; applying optical character recognition to the downloaded document to provide a text-readable version of the at least one patent document; automatically applying electronic text analysis to the text-readable version to extract one or more data elements associated with a field of interest, and transmitting the data elements to a user.04-04-2013
20130084011PROOF READING OF TEXT DATA GENERATED THROUGH OPTICAL CHARACTER RECOGNITION - A novel system includes: a first proof reading tool for performing carpet proof reading on text data; a second proof reading tool for performing side-by-side proof reading on the text data; a storage unit configured to store a log of proof reading operations having been performed by using the first and second proof reading tools; and an analysis unit configured to determine, for each attribute serving as units in which carpet proof reading is performed with the first proof reading tool, whether or not to use the first proof reading tool in proof reading of the attribute, by comparing a first estimated value of a time taken when proof reading is performed by using the first proof reading tool with a second estimated value of a time taken when proof reading is performed by using the second proof reading tool without using the first proof reading tool, the first and second estimated values being calculated on the basis of the log.04-04-2013
20130084010IN-FIELD DEVICE FOR DE-CENTRALIZED WORKFLOW AUTOMATION - In one example, a system is provided. The system includes a portable, in-field unit including: a tag reader to acquire an ID tag identifier from a tag located in or on a physical item positioned within functional range of the in-field unit tag reader; a digital processor arranged for executing software code stored in the in-field unit responsive to the acquired ID tag identifier, the stored software code including—a customer application layer; and a database adapter component configured to provide database services to the processor; wherein the database services include accessing a stored database to acquire stored data associated with the acquired ID tag identifier.04-04-2013
20120114243Shape Clustering in Post Optical Character Recognition Processing - Techniques for shape clustering and applications in processing various documents, including an output of an optical character recognition (OCR) process.05-10-2012
20110019915Methods and data structures for multiple combined improved searchable formatted documents including citation and corpus generation - Searchable annotated formatted documents are produced by correlating documents stored as photographic or scanned graphic representations of an actual document (evidence, report, court order, etc.) with textual version of the same documents. A produced document will provide additional details in a data structure that supports citation annotation as well as other types of analysis of a document. The data structure also supports generation of citation reports and corpus reports. Methods of creating searchable annotated formatted documents including citation and corpus reports by correlating and correcting text files with photographic or scanned graphic of the original documents. Data structures for correlating and correcting text files with graphic images. Generation of citation reports, concordance reports, and corpus reports. Data structures for citation reports, concordance reports, and corpus reports generation. Multiple document data structures are used to create multiple citation documents and reports. Embodiments of citation reports and corpus reports contain correlated, comprehensive multiple citations.01-27-2011
20110280483Shape Clustering in Post Optical Character Recognition Processing - Techniques for shape clustering and applications in processing various documents, including an output of an optical character recognition (OCR) process.11-17-2011
20110299779Methods and Systems for Detecting Numerals in a Digital Image - Aspects of the present invention are related to systems and methods for determining the location of numerals in an electronic document image.12-08-2011
20110286669FORM PROCESSING SYSTEM, OCR DEVICE, FORM CREATION DEVICE, AND COMPUTER READABLE MEDIUM - There is provided a form processing system including a form creation device and an OCR device, wherein the form creation device includes a layout generation unit that generates layout information denoting a layout of a form and a layout transmission unit that transmits the layout information generated to the OCR device, and the OCR device includes a layout acquisition unit that acquires the layout information transmitted from the form creation device and an OCR processing unit that performs OCR processing on image data of the form read by a scanner, based on the layout information acquired.11-24-2011
20110286668IMAGE PROCESSING DEVICE, IMAGE PROCESSING METHOD, AND COMPUTER READABLE MEDIUM - An image processing device includes a storage module, character recognition module, a circumscribed rectangle extraction module, a ratio extraction module, and a character size calculation module. The storage module stores a reference ratio between a reference size of a reference circumscribed rectangle and a reference character size in a reference character image representing a reference character in association with a reference character identification code which uniquely identified the reference character. The character recognition module recognizes a character image in an image to get a character identification code from the recognized character image. The circumscribed rectangle extraction module extracts a circumscribed rectangle of the character image. The ratio extraction module extracts the reference ratio corresponding to the reference character identification code stored in the storage module based on the character identification code. The character size calculation module calculates a character size of the character image.11-24-2011
20110103688System and method for increasing the accuracy of optical character recognition (OCR) - A system and/or method for increasing the accuracy of optical character recognition (OCR) for at least one item, comprising: obtaining OCR results of OCR scanning from at least one OCR module; creating at least one OCR seed using at least a portion of the OCR results; creating at least one OCR learn set using at least a portion of the OCR seed; and applying the OCR learn set to the at least one item to obtain additional optical character recognition (OCR) results.05-05-2011
20100150445TEXT VECTORIZATION USING OCR AND STROKE STRUCTURE MODELING - Systems and methods are described that facilitate dominant point detection for text in a scanned document. The dominant points are classified as “major” (e.g., structural) and “minor” (e.g., serif). A set of rules or parameters for each character is determined off-line. During the text vectorization, OCR is performed and the rules (parameters) associated with the recognized character are selected. Both major and minor dominant points are detected as a maximization process with the parameter set. For minor dominant points, additional processes are optionally employed.06-17-2010
20110293184METHOD OF IDENTIFYING PAGE FROM PLURALITY OF PAGE FRAGMENT IMAGES - A method of identifying a physical page containing printed text from a plurality of page fragment images captured by a camera. The method includes the steps of: placing a handheld electronic device in contact with a surface of the physical page; moving the device across the physical page and capturing the plurality of page fragment images at a plurality of different capture points; measuring a displacement or direction of movement; performing OCR on each captured page fragment image; creating a glyph group key for each page fragment image; looking up each created glyph group key in an inverted index of glyph group keys; comparing a displacement or direction between glyph group keys in the inverted index with a measured displacement or direction between the capture points for corresponding glyph group keys created using OCR; and identifying a page identity corresponding to the physical page using the comparison.12-01-2011
20110293185HYBRID SYSTEM FOR IDENTIFYING PRINTED PAGE - A hybrid system for identifying a printed page. The system includes: (i) the printed page having human-readable content and a coding pattern printed in every interstitial space between portions of human-readable content, the coding pattern being either absent from the human-readable content or unreadable when superimposed with the human-readable content; and (ii) a handheld device for overlaying and contacting the printed page. The handheld device includes: a camera for capturing page fragment images; and a processor configured for: decoding the coding pattern and determining the page identity in the event that the coding pattern is visible in and decodable from the captured page fragment image; and otherwise initiating OCR or SIFT techniques to identify the page.12-01-2011
20110293183SCANNING SYSTEM WITH OPTICAL CHARACTER RECOGNITION - A system includes an imaging device and an acquisition layer. The imaging device acquires an image. The acquisition layer is logically located between a source manager and the imaging device, the source manager being called by an application when a user of the system requests to acquire the image. The acquisition layer includes imaging acquisition logic that receives the image from the imaging device and performs optical character recognition (OCR) that extracts machine editable text from the image. The acquisition layer forwards the image to the application and makes the machine editable text available to the user.12-01-2011
20080310722IDENTIFYING CHARACTER INFORMATION IN MEDIA CONTENT - Implementations of identifying character information in media content are described. In one implementation, a frame of media content is marked with a frame identifier including one or more known characters. These known characters can uniquely identify the frame of media content. During transmission, compression, decompression, etc., of the frame, loss can occur. This loss can affect a quality of presentation of one or more of the known characters in the frame identifier. Therefore, when the frame is subsequently examined, the frame identifier can be identified, and best matches of known characters from a character recognition library can be found for characters in the frame identifier.12-18-2008
20080310723TEXT PREDICTION WITH PARTIAL SELECTION IN A VARIETY OF DOMAINS - A computing system may predict a word based on received user input that selects a part of the word (e.g., the first characters, the first root, etc.). Specifically, a program, when run on the computing system, may perform a method including creating a candidate list of words based on received user input. These words may be then organized into a hierarchy, or tree structure, in which each word is associated with a parent and each parent is a partial match for its associated words. The top-tier partial matches may be presented, and user input corresponding to a selected partial match may be received. A set of candidates related to the selected partial match may then be presented for user selection.12-18-2008
20100266205Device and Method to Assist User in Conducting A Transaction With A Machine - A device for assisting a user to perform a transaction on a machine is described. The device receives data that specifies a transaction mode to use for processing an image and accesses a knowledge base to provide data to configure the device for the transaction mode, the data including data specific to the transaction mode. The device receives an image or images of a portion of a machine that the user will use to perform the transaction and processes the image or images to identify a pattern of controls on the machine and to detect the presence of a user-controlled pointing item over controls on the machine. The device announces to the user the name or function of the control closest to an end of the user-controlled pointing item.10-21-2010
20090310863FINDING IMAGE CAPTURE DATE OF HARDCOPY MEDIUM - A method of determining the image capture date of a scanned hardcopy medium having an image side and a non-image side, includes scanning the hardcopy medium to produce a scanned digital image; detecting handwritten annotations in the scanned digital image of the hardcopy medium; and using the handwritten annotations to determine the image capture date of the hardcopy medium by analyzing the handwritten annotations to identify names of people and associated ages; providing the names and lifespan information for a set of persons likely to appear in the hardcopy medium; and using the identified names of people and the associated ages along with the lifespan information to determine the image capture date.12-17-2009
20120106845Replacing word with image of word - First data represents an image of text including words. Second data represents the text in a non-image form. A particular word within the second data is replaced with a corresponding part of the first data representing the image of the particular word.05-03-2012
20090274369IMAGE PROCESSING DEVICE, IMAGE PROCESSING METHOD, PROGRAM, AND STORAGE MEDIUM - An image processing device includes a dividing unit for dividing objects of an input image, a metadata adding unit for adding metadata to each of the divided objects by performing OCR processing and morpheme analysis, a display unit for displaying at least one of the divided objects and the metadata added to the divided object, and a metadata accuracy determining unit for determining accuracies of the added metadata. The display unit preferentially displays metadata determined as being low in accuracy by the metadata accuracy determining unit.11-05-2009
20080212877HIGH SPEED ERROR DETECTION AND CORRECTION FOR CHARACTER RECOGNITION - Systems and methods for high speed error detection and correction are disclosed. An exemplary method may include grouping character images (ci) by suspected character code (cc) to generate a set of CI(cc). The method may also include displaying the set of CI(cc) for manual verification. The method may also include determining a set of RS(cc) of representative shapes (rs) of character images codes for each CI(cc). The method may also include displaying the set of RS(cc) for manual verification.09-04-2008
20080240567Displaying text of a writing system using syntax-directed translation - A method for displaying an input string of character codes as a sequence of glyphs. In one implementation, an ordered list of instructions for transforming an input string of character codes may be generated using syntax-directed translation. The ordered list of instructions may be executed to generate a sequence of glyph indices. A sequence of glyphs corresponding to the sequence of glyph indices may be displayed.10-02-2008
20120141031ANALYSING CHARACTER STRINGS - A method for analyzing a character string, the method including: analyzing a character string to determine one of more characters of the character string; determining from a dictionary source, an alternative character string to the analyzed character string; comparing the analyzed character string with the alternative character string to determine a weighting factor for each of the characters of the analyzed character string relative to the positional arrangement of the characters in the alternative character string; and for each determined weighting factor, generating for each of the characters in the analyzed character string a corresponding character of a particular size as determined by the weighting factor.06-07-2012
20080317348IMAGE PROCESSING APPARATUS, IMAGE REPRODUCTION APPARATUS, SYSTEM, METHOD AND STORAGE MEDIUM FOR IMAGE PROCESSING AND IMAGE REPRODUCTION - An original document image is inputted as multi valued image data (original image data) from an input unit. The multivalued image data is binarized by a binary image generation unit. Then, layout analysis is performed based on the binary image data. Based on the layout information, a partial image having text-attribute is extracted and a partial image having non-text-attribute are extracted from the multi-valued image data. One of the partial images is encrypted, and the encrypted data is stored with the partial image that is not encrypted and the layout information.12-25-2008
20080317346Character and Object Recognition with a Mobile Photographic Device - Character and object recognition are provided from digital photography followed by digitization and integration of recognized textual and non-textual content into a variety of software applications for enabling use of data associated with the photographed content. A digital photograph may be processed by an optical character recognizer or optical object recognizer for generating data associated with a photographed object. A user of the photographed content may tag the photographed content with descriptive or analytical information that may be used for improving recognition of the photographed content and that may be used by subsequent users of the photographed content. Data generated for the photographed object may then be passed to a variety of software applications for use in accordance with respective application functionalities.12-25-2008
20080317347Rendering engine test system - A system to compare a reference image of a text character, word or phrase with another image of the character, word or phrase that was rendered by a text rendering engine. Differences between the reference image and the rendered image may be recorded for subsequent analysis. Performance of a text rendering engine producing text according to typographical rules applicable to a natural language can be evaluated by one with no knowledge or ability to read the natural language.12-25-2008
20110222773PARAGRAPH RECOGNITION IN AN OPTICAL CHARACTER RECOGNITION (OCR) PROCESS - An image processing apparatus for detecting paragraphs in a textual image includes an input component for receiving an input image in which textual lines and words have been identified and a page classification component for classifying the input image as a first or second page type. The apparatus also includes a paragraph detection component for classifying all textual lines on the input image as a beginning paragraph line or a continuation paragraph line. The apparatus is also provided with a paragraph creation component for creating paragraphs that include textual lines between two successive beginning paragraph lines, including a first of the two successive beginning paragraph lines. The paragraphs that have been identified may be classified by the type of alignment they exhibit. For instance, paragraphs may be classified according to whether they are left aligned, right aligned, center aligned or justified.09-15-2011
20110142344BROWSING SYSTEM, SERVER, AND TEXT EXTRACTING METHOD - In order to precisely extract a character in an image displayed at a terminal device in the case that an imaged web page is sent to the terminal device and the web page is browsed at the terminal device, a server acquires the web page from the Internet, generates the image from the acquired web page, and sends the image to a client terminal, the client terminal receives the image, displays the image on a display part, specifies a rectangular area, and sends information regarding the specified rectangular area to the server, and the server extracts the image in the rectangular area from the image of the web page, recognizes a text by an OCR process, extracts a text from a source of an HTML file which matches the recognized text most closely, and sends the extracted text to the client terminal.06-16-2011
20110229037CHARACTER RECOGNITION APPARATUS AND CHARACTER RECOGNITION METHOD - An objective is to eliminate dotted lines in a character box in image data to increase the character recognition rate. There are some cases in which a dotted line candidate cannot be extracted due to many overlapping parts of dotted lines and characters or due to a blurry part in a dotted line. In such cases, the position of a dotted line candidate is estimated referring to features such as the interval, length, width, etc. of a dotted line candidate in the same character box (or in a character box for another relevant item), and image data of the estimated position and image data of a previously extracted dotted line (or a reference dotted line) are compared to determine whether or not they are an identical dotted line.09-22-2011
20090110283METHOD AND APPARATUS FOR OPERATING, INTERFACING AND/OR MANAGING FOR AT LEAST ONE OPTICAL CHARACTERISTIC SYSTEM FOR CONTAINER HANDLERS IN A CONTAINER YARD - Methods and several apparatus embodiments are disclosed operating Optical Characteristic Systems (OCS) in a container storage and/or transfer yard supporting the automated recognition of container codes displayed on various sides of the containers being stored and/or transferred. At least one processor may initiate an operational process by an OCS mounted on a container handler to create an operational result, select the operational process based upon an operational schedule and communicate with at least one OCS to receive an image of a container being handled by the container handler to at least partly create a container code estimate for a container inventory management system. A program system directing at least one computer implementing these operations, and may reside in computer readable memory, an installation package and/or a download server. The computer readable memory may or may not be accessibly coupled to the computer.04-30-2009
20090245644OPTICAL CHARACTER READERS FOR READING CHARACTERS PRINTED ON WIRES OR WIRE SLEEVES - A scanning system for scanning a wire to determine the characters provided on the wire.10-01-2009
20110229036METHOD AND APPARATUS FOR TEXT AND ERROR PROFILING OF HISTORICAL DOCUMENTS - The present invention enables the computation of various types of information for a particular scanned and OCR recognised or retyped historical input document. It provides a global view on the “patterns” for historical language variation (text profiling) and the OCR errors most frequently found in the text (error profiling). For each of the individual tokens of the OCR output, an interpretation is given which based on the document specific information attempts to describe both, the underlying correct word of the text and the corresponding modern spelling of the word. This not only provides input for optimised OCR recognition of historical documents, but also for quality assurance and improved information retrieval.09-22-2011
20090220154IMAGE PROCESSING APPARATUS, IMAGE READING APPARATUS, IMAGE DATA OUTPUT PROCESSING APPARATUS, AND IMAGE PROCESSING METHOD - A ruled-line extraction section can be performed with high precision by providing a main-scanning ruled-line extraction section for determining whether a target pixel of binary image data of a document image is a black pixel or a white pixel, for counting the number of black pixels connected one after another upstream in a main scanning direction with respect to the target pixel of the binary image data and for, when the target pixel of the binary image data is a black pixel and when a value counted for the target pixel is not less than a main-scanning run determination threshold value that has been set in advance, generating ruled-line image data by correcting, to pixel values corresponding to black pixels, pixel values of a predetermined number of pixels connected to the target pixel upstream in the main scanning direction.09-03-2009
20100254608 METHOD AND SYSTEM FOR AIDED INPUT ESPECIALLY FOR COMPUTER MANAGEMENT TOOLS - A method of aided input especially for a computer management tool, the management tool being executed in a computer system possessing an operating system furnished with instrumentation services, characterized in that it comprises the following steps: (a) entering raw data from an exterior source, (b) extracting relevant data from said raw data, (c) using said instrumentation services to transcribe said extracted data to corresponding fields of a preexisting input interface belonging to the management tool, with a view to allowing further inputs and overall validation. Application in particular to the semi-automated input of accounting items such as supplier invoices and the like.10-07-2010
20110058742SYSTEM AND METHOD FOR DETERMINING AUTHORSHIP OF A DOCUMENT - Systems, methods, and computer-readable mediums for determining authorship of a handwritten document for which the authorship is not known. A method includes scanning a document to produce a high-quality scanned image of the document, and identifying stylus information corresponding to the document. The method includes identifying authorship information corresponding to the document, and determining an authorship of the document based on the stylus information and the authorship information. In some cases, content analysis of the document is also performed and used to determine authorship.03-10-2011
20120141030Code Recognition Method, Device and Computer Readable Storage Medium for Storing Code Recognition Method - A code recognition method includes the following steps: a first code-image block is received. Wherein, several first codes are displayed on the first code-image block. The first code-image block is partitioned into several second code-image blocks. Wherein, each of the second code-image blocks displays a second code respectively. Each of the second codes is one of the first codes. Each of the second code-image blocks is recognized as several third codes corresponding to each of the second codes respectively. Some of the neighboring second code-image blocks are combined to form several third code-image blocks. Wherein, each of the third code-image blocks displays a first code set, which comprises some of the second codes. Each of the third code-image blocks is recognized as a second code set corresponding to each of the first code sets respectively. Wherein, each of the second code sets includes the codes selected from the third codes.06-07-2012
20080310721Method And Apparatus For Recognizing Characters In A Document Image - A method of recognizing characters in a document image comprises examining the intensity of pixels in the document image and identifying a peak intensity deemed to represent foreground in the document image. A threshold level for distinguishing the foreground from background in the document image as a function of the identified peak intensity is determined. The document image is thresholded using the threshold level to identify the foreground. Character recognition is performed on the foreground of the document image.12-18-2008
20090074294DOCUMENT-IMAGE-DATA PROVIDING SYSTEM, DOCUMENT-IMAGE-DATA PROVIDING DEVICE, INFORMATION PROCESSING DEVICE, DOCUMENT-IMAGE-DATA PROVIDING METHOD, INFORMATION PROCESSING METHOD, DOCUMENT-IMAGE-DATA PROVIDING PROGRAM, AND INFORMATION PROCESSING PROGRAM - In a document-image-data providing device, a document image inputting unit is configured to input document image data. An area recognition unit is configured to recognize a text area of a document image element containing text data among document image elements constituting the document image data, and another area of a document image element containing data other than the text data. A text data acquiring unit is configured to acquire text data contained in the recognized text area. A providing unit is configured to provide, in response to a document image data request received from the information processing device, both image data generated from the input document image data to have a resolution lower than a resolution of the input document image data and the text data acquired by the text data acquiring unit, to the information processing device.03-19-2009
20130129218SYSTEM AND METHOD FOR PROCESSING RECEIPTS AND OTHER RECORDS OF USERS - A service can perform optical character recognition (OCR) on an image of a record to determine a first set of information items about the record. A second set of information items can be identified that are likely part of the record but not determinable from performing OCR on the image. Another resource can be utilized to determine the second set of information items. A classification for the record can be determined based on first and second sets of information items. The record can be associated with a financial resource of the user based at least in part on the classification.05-23-2013
20110150336Hardware Management Based on Image Recognition - Embodiments of the disclosed technology allow for the control, monitoring, and/or configuration of specialized hardware devices with proprietary interfaces from a central interface capable of interacting with one or a plurality of specialized hardware devices via respective proprietary interfaces. Such embodiments are especially useful in controlling medical equipment, such as radiology equipment at a central and/or remote location, where otherwise, only a proprietary interface at a proximate location could be used to do same.06-23-2011
20100310171METHOD AND APPARATUS FOR ANALYSIS OF A DATABASE - A method for analyzing at least one database which contains a multiplicity of reference data items, in particular for determining the quality of the database in which, in the case of a data field which has a multiplicity of objects each having one information item, data elements are determined from the data field and these are checked and confirmed by comparison with the reference data items and comparison results resulting from this are recorded. It is proposed that a legibility degree is determined for at least some of the data elements, and a state of the database is determined automatically on the basis of the legibility degree and the comparison results.12-09-2010
20110033111Processing Method And Apparatus For Recording Media Having Printed Magnetic Ink Characters - In a method of processing recording media on which magnetic ink characters are printed, the media is transported at a first speed in an upright position along a transportation path from a supply unit to a discharge unit. The magnetic characters are read and output signals representative of the reading generated. The output signals are analyzed, including comparing the output signals with previously stored signal patterns of magnetic ink characters to determine if the magnetic characters can be recognized or not. The transporting of the recording media is paused, or slowed to a second speed substantially lower than the first speed, for a period of time during the analyzing of the output signals. A processing apparatus includes components for carrying out the operations of such method.02-10-2011
20110038542COMPUTER APPLICATION ANALYSIS - A method, system, and computer program product for computer application analysis are provided. The method for computer application analysis includes monitoring a computer system on which an application to be analyzed is executed and interacted with by a user of the computer system. The monitoring includes: capturing screen data of the application as displayed on a display screen of the computer system including interpreting the screen data using optical character recognition (OCR); and capturing user inputs to the application to input devices of the computer system. The method further includes analyzing the captured screen data and user inputs to generate a summary of the usage of the application.02-17-2011
20090214116IMAGE PROCESSING METHOD, IMAGE PROCESSING APPARATUS, IMAGE FORMING APPARATUS, AND STORAGE MEDIUM - In the image processing apparatus of the present invention, when a document is read, a document matching process section determines whether the document is similar to a reference document or not. When the document is similar to the reference document, the document matching process section further determines whether the document has been zoomed (size of the document has been changed). When the document has been zoomed, an editing process section restores the size of the document to the size of the reference document. This provides an image processing apparatus capable of restoring the changed size of a document in a predetermined format such as a form document and an application document to its original size.08-27-2009
20110129153Identifying Matching Canonical Documents in Response to a Visual Query - A server system receives a visual query from a client system. The visual query is an image containing text such as a picture of a document. At the receiving server or another server, optical character recognition (OCR) is performed on the visual query to produce text recognition data representing textual characters. Each character in a contiguous region of the visual query is individually scored according to its quality. The quality score of a respective character is influenced by the quality scores of neighboring or nearby characters. Using the scores, one or more high quality strings of characters are identified. Each high quality string has a plurality of high quality characters. A canonical document containing the one or more high quality textual strings is retrieved. At least a portion of the canonical document is sent to the client system.06-02-2011
20110243447METHOD AND APPARATUS FOR SYNTHESIZING SPEECH - Method and apparatus of synthesizing speech from a plurality of portion of text data, each portion having at least one associated attribute. The invention is achieved by determining (10-06-2011
20110243446CODE READING APPARATUS, SALES REGISTERING APPARATUS, AND SALES REGISTERING METHOD - According to one embodiment, a code reading apparatus includes a commodity-information reading unit, a commodity-information output unit, a benefit-information reading unit, and a benefit-information output unit. The commodity-information reading unit reads commodity information from a code symbol attached to a commodity. The commodity-information output unit outputs the commodity information read by the commodity-information reading unit. The benefit-information reading unit detects an image of benefit indication from an image imaged by an imaging unit and reads benefit information corresponding to the benefit indication from the detected image. The benefit-information output unit outputs the benefit information read by the benefit-information reading unit.10-06-2011
20110085732QR CODE PROCESSING METHOD AND APPARATUS THEREOF - A QR code processing method includes an edge processing process, a QR code positioning process and a projection modification process. The edge processing process converts an original image into a binarized input image. The QR code positioning process includes a group search process and a tag search process. The group search process includes: deriving a plurality of luminance groups according to luminance values of pixels within an input image; identifying a plurality of finder pattern groups complying with QR code finder pattern among the plurality of luminance groups according to a central point of each luminance group; and deriving position information of each finder pattern group. The tag search process derives position information of the QR code according to the position information of the finder pattern groups. The projection modification process converts the input image into a modified image according to the position information of the QR code.04-14-2011
20090028434SYSTEM AND METHOD FOR DISPLAYING CONTEXTUAL SUPPLEMENTAL CONTENT BASED ON IMAGE CONTENT - An image-based content item is analyzed to determine information about a subject of the content item. The analysis may include performing image analysis on at least an image of the content item. An inference may be programmatically made about one or more of (i) a viewer or holder of the content item, or (ii) the subject of content item.01-29-2009
20110081083GESTURE-BASED SELECTIVE TEXT RECOGNITION - An image is displayed on a touch screen. A user's underline gesture on the displayed image is detected. The area of the image touched by the underline gesture and a surrounding region approximate to the touched area are identified. Skew for text in the surrounding region is determined and compensated. A text region including the text is identified in the surrounding region and cropped from the image. The cropped image is transmitted to an optical character recognition (OCR) engine, which processes the cropped image and returns OCR'ed text. The OCR'ed text is outputted.04-07-2011
20120201461CHARACTER DETECTION APPARATUS, CHARACTER DETECTION METHOD, AND COMPUTER-READABLE STORAGE MEDIUM - A character detection apparatus is provided that detects, from an image including a first image representing a character and a second image representing a translucent object, the character. The character detection apparatus includes a calculating portion that, for each of blocks obtained by dividing an overlapping region in which the first image is overlapped by the second image, calculates a frequency of appearance of pixels for each of gradations of a property, and a detection portion that detects the character from the overlapping region based on the frequency for each of the gradations.08-09-2012
20110110592ELECTRONIC APPARATUS AND IMAGE DISPLAY METHOD - According to one embodiment, an electronic apparatus includes a text recognition module, a group creation module, a group extraction module, an arrangement module, and a movie generator. The text recognition module recognizes a character string in a plurality of still images. The group creation module creates a plurality of groups by classifying the plurality of still images. The group extraction module extracts, from the plurality of groups, groups including a still image which meets a predetermined condition. The arrangement module arranges still images included in the extracted groups in a predetermined order, and inserts a still image included in the extracted groups and including the character string at a predetermined position of the still images which are arranged. The movie generator generates movie data for successively displaying the arranged still images in the extracted groups.05-12-2011
20110052064METHOD FOR PROCESSING OPTICAL CHARACTER RECOGNITION (OCR) OUTPUT DATA, WHEREIN THE OUTPUT DATA COMPRISES DOUBLE PRINTED CHARACTER IMAGES - The present invention is related to a method of processing of output data from an Optical Character Recognition (OCR) system, wherein the output data comprises images of double printed characters. The method identifies the respective members of a suspected double printed character image by first providing a set of single character template images from images of characters identified in the text being processed by the OCR system, then combining the single character templates providing candidate models for the suspected double printed character image. Correlation between each respective candidate model and the suspected double printed character image provides an indication of which pair of modelled single template character images that most probable are he correct identification of the respective character images in the double printed character image.03-03-2011
20100215272AUTOMATIC FILE NAME GENERATION IN OCR SYSTEMS - Methods and system for processing document images in OCR systems, particularly for selecting a proper file name for a recognized document. The method comprises generating at least one document type hypothesis for the document; verifying each document type hypothesis; selecting a best document type hypothesis and saving the document with a proper name based on the best type hypothesis and unique features. The method further includes determining a logical structure of a document and selecting a best document model hypothesis that has the best degree of correspondence with the selected best block hypotheses for the document. On the basis of the best document model hypothesis the text document reflecting the logical structure of the source document in extended computer-editable format is formed and saved with a proper file name.08-26-2010
20100067794OPTICAL CHARACTER RECOGNITION VERIFICATION - A method for optical character recognition (OCR) verification, the method includes: receiving a first character image that was obtained from applying an OCR process on a document; wherein the first character image is classified, by the OCR, as being associated with a first character; receiving a first character code of a text; replacing the first character code by the first character image; and evaluating a correctness of the OCR based upon a response of a user to a display of the text first character image.03-18-2010
20100067795METHOD AND APPARATUS FOR PATTERN PROCESSING - An apparatus for pattern processing exhibits a discretizing device for discretizing an input pattern, a device for generating a number n of discrete variants of the quantized input pattern in accordance with established rules, a number n of input stages (03-18-2010
20110305393TECHNIQUES IN OPTICAL CHARACTER RECOGNITION - An image deskew system and techniques are used in the context of optical character recognition. An image is obtained of an original set of characters in an original linear (horizontal) orientation. An acquired set of characters, which is skewed relative to the original linear orientation by a rotation angle, is represented by pixels of the image. The rotation angle is estimated, and a confidence value may be associated with the estimation, to determine whether to deskew the image. In connection with rotation angle estimation, an edge detection filter is applied to the acquired set of characters to produce an edge map, which is input to a linear hough transform filter to produce a set of output lines in parametric form. The output lines are assigned scores, and based on the scores, at least one output line is determined to be a dominant line with a slope approximating the rotation angle.12-15-2011
20120039537METHOD, APPARATUS, AND SYSTEM FOR WORKFLOW PARTICIPATION OF AN IMAGING DEVICE - A method, apparatus, and system for communicating between an apparatus hosting a workflow application and an imaging device, the system including a state engine configured to read and extract data from a first message received from the imaging device, to communicate with an application component, and to advance to a workflow state, a state translator configured to receive the workflow state from the state engine, to convert the workflow state into an imaging device instruction, and to send the imaging device instruction to the imaging device, a state instantiater configured to change a state of a component of the imaging device in accordance with the imaging device instruction, an event responder configured to assemble data in a second message based on the changed state of the component of the imaging device, and an interface configured to send the second message to the apparatus.02-16-2012
20120099792ADAPTIVE OPTICAL CHARACTER RECOGNITION ON A DOCUMENT WITH DISTORTED CHARACTERS - A computer implemented method for adaptive optical character recognition on a document with distorted characters includes performing a distortion-correction transformation on a segmented character of the document assuming the segmented character to be a candidate character. The method further includes comparing the transformed segmented character to the candidate character by calculating a comparison score. If the calculated score is within a predetermined range, the segmented character is identified with the candidate character. The method may be implemented in either of computer hardware configured to perform the method, or in computer software embodied in a non-transitory, tangible, computer-readable storage medium. Also disclosed are corresponding computer program product and data processing system.04-26-2012
20100303356METHOD FOR PROCESSING OPTICAL CHARACTER RECOGNITION (OCR) DATA, WHEREIN THE OUTPUT COMPRISES VISUALLY IMPAIRED CHARACTER IMAGES - The present invention provides a method for an Optical Character Recognition (OCR) system providing recognition of characters that are partly hidden by crossing outs due to for example an imprint of a stamp, handwritten signatures, etc. The method establishes a set of template images of certainly recognized characters from the image of the text being processed by the OCR system, wherein the effect of the crossed out section is modelled into the template images before comparing these images with the image of a visually impaired crossed out character. The modelled template image having the highest similarity with the visually impaired crossed out character is the correct identification for the visually impaired character instance.12-02-2010
20120207392OPTICAL IMAGING AND ANALYSIS OF A GRAPHIC SYMBOL - Method, computer program product, and apparatus are provided for identifying a graphic symbol within an image obtained by optical scanning. An image intensity is measured for each of a plurality of columns of the image, wherein each column has a length that extends across the graphic symbol in a first direction, and wherein the plurality of columns collectively extend across the graphic symbol in a second direction. The graphic symbol is then identified by matching a profile of the image intensity to a predetermined image intensity profile associated with a given graphic symbol. Optionally, the image is a digital image and the image intensity for each column is the sum of the image intensity for each pixel in that individual column. An image intensity differential between adjacent columns may be calculated for matching with a predetermined differential profile or comparison with an electronic profile generated by a magnetic scan.08-16-2012
20120008865SYSTEM AND METHOD OF DETERMINING BUILDING NUMBERS - A system and method is provided for automatically recognizing building numbers in street level images. In one aspect, a processor selects a street level image that is likely to be near an address of interest. The processor identifies those portions of the image that are visually similar to street numbers, and then extracts the numeric values of the characters displayed in such portions. If an extracted value corresponds with the building number of the address of interest such as being substantially equal to the address of interest, the extracted value and the image portion are displayed to a human operator. The human operator confirms, by looking at the image portion, whether the image portion appears to be a building number that matches the extracted value. If so, the processor stores a value that associates that building number with the street level image.01-12-2012
20110081084CHARACTER RECOGNITION DEVICE, MOBILE COMMUNICATION SYSTEM, MOBILE TERMINAL DEVICE, FIXED STATION DEVICE, CHARACTER RECOGNITION METHOD AND CHARACTER RECOGNITION PROGRAM - Words possibly included in a scene image shot by a mobile camera can be efficiently extracted using a word dictionary or a map database. Positional information acquiring means 04-07-2011
20110103689SYSTEM AND METHOD FOR OBTAINING DOCUMENT INFORMATION - A method and system for determining at least one target value of at least one target in at least one document, comprising: determining, utilizing at least one scoring application; at least one possible target value, wherein the at least one scoring application utilizes information from at least one training document, and applying the information, utilizing the at least one scoring application, on the at least one new document to determine at least one value of the at least one target on the at least one new document.05-05-2011
20100092088Methods and data structures for improved searchable formatted documents including citation and corpus generation - Searchable annotated formatted documents are produced by correlating documents stored as a photographic or scanned graphic representations of an actual document (evidence, report, court order, etc.) with textual version of the same documents. A produced document will provide additional details in a data structure that supports citation annotation as well as other types of analysis of a document. The data structure also supports generation of citation reports and corpus reports. A method of creating searchable annotated formatted documents including citation and corpus reports by correlating and correcting text files with photographic or scanned graphic of the original documents. Data structures for correlating and correcting text files with graphic images. Generation of citation reports, concordance reports, and corpus reports. Data structures for citation reports, concordance reports, and corpus reports generation.04-15-2010
20120213442CHARACTER RECOGNITION APPARATUS, CHARACTER RECOGNITION METHOD, AND COMPUTER READABLE MEDIUM STORING PROGRAM - A character recognition apparatus includes an acquisition unit, a specification unit, a movement unit, and a recognition unit. The acquisition unit acquires data representing a character string. The specification unit specifies an element of a compound character satisfying a predetermined condition for determining the compound character from the character string. The movement unit moves the element of the compound character close to an adjacent character. The recognition unit recognizes a changed character string in which the movement unit has moved the element of the compound character, based on a shape of characters and relevance between adjacent characters.08-23-2012
20120128251Identifying Matching Canonical Documents Consistent with Visual Query Structural Information - A server system receives a visual query from a client system, performs optical character recognition (OCR) on the visual query to produce text recognition data representing textual characters, including a plurality of textual characters in a contiguous region of the visual query. The server system also produces structural information associated with the textual characters in the visual query. Textual characters in the plurality of textual characters are scored. The method further includes identifying, in accordance with the scoring, one or more high quality textual strings, each comprising a plurality of high quality textual characters from among the plurality of textual characters in the contiguous region of the visual query. A canonical document that includes the one or more high quality textual strings and that is consistent with the structural information is retrieved. At least a portion of the canonical document is sent to the client system.05-24-2012
20120128250Generating a Combination of a Visual Query and Matching Canonical Document - A server system receives a visual query from a client system distinct from the server system, performs optical character recognition (OCR) on the visual query to produce text recognition data representing textual characters, including a plurality of textual characters in a contiguous region of the visual query, and scores each textual character in the plurality of textual characters. The server system identifies, in accordance with the scoring, one or more high quality textual strings, each comprising a plurality of high quality textual characters from among the plurality of textual characters in the contiguous region of the visual query; retrieves a canonical document having the one or more high quality textual strings; generates a combination of the visual query and at least a portion of the canonical document; and sends the combination to the client system.05-24-2012
20120134589Optical character recognition (OCR) engines having confidence values for text types - An image of a known text sample having a text type is generated. The image of the known text sample is input into each OCR engine of a number of OCR engines. Output text corresponding to the image of the known text sample is received from each OCR engine. For each OCR engine, the output text received from the OCR engine is compared with the known text sample, to determine a confidence value of the OCR engine for the text type of the known text sample.05-31-2012
20120134590Identifying Matching Canonical Documents in Response to a Visual Query and in Accordance with Geographic Information - A server system receives a visual query from a client system distinct from the server system. The server system performs optical character recognition (OCR) on the visual query to produce text recognition data representing textual characters, including a plurality of textual characters in a contiguous region of the visual query. The server system scores each textual character in the plurality of textual characters in accordance with the geographic location of the client system. The server system identifies, in accordance with the scoring, one or more high quality textual strings, each comprising a plurality of high quality textual characters from among the plurality of textual characters in the contiguous region of the visual query. Then the server system retrieves a canonical document having the one or more high quality textual strings and sends at least a portion of the canonical document to the client system.05-31-2012
20120076415COMPUTER AIDED VALIDATION OF PATENT DISCLOSURES - A method and system for analyzing a patent disclosure is disclosed. The method and system comprise a computerized cross-check of reference labels within drawings of a disclosure to reference labels found within the text of the disclosure, and generating warnings for reference labels that are missing from either the drawings or the text.03-29-2012
20120314954EMBEDDED FORM EXTRACTION DEFINITION TO ENABLE AUTOMATIC WORKFLOW CONFIGURATION - A system and methods are disclosed to automatically extract data from documents, such as scanned paper forms and/or digital forms that need to be pre-configured to understand a layout for the forms to be processed. The system extracts data from the form definition at a two dimensional barcode and dynamically configures a workflow with services for extracting desired user filled information from the data fields present on the form. Support for a re-flowable service is provided.12-13-2012
20100272360METHOD FOR OUTPUTTING CONSECUTIVE CHARACTERS IN VIDEO-RECORDING MODE - The invention discloses a method for outputting consecutive characters in a video-recording mode. The method includes obtaining a first image and a second image from an object, comparing the first image and the second image to obtain a third image which is the overlapping part of the first image and the second image, removing the third image from the second image to generate a fourth image, integrating the fourth image with the first image to obtain a fifth image and recognize characters on the fifth image by OCR software and output the characters of the fifth image.10-28-2010
20100272359METHOD FOR RESOLVING CONTRADICTING OUTPUT DATA FROM AN OPTICAL CHARACTER RECOGNITION (OCR) SYSTEM, WHEREIN THE OUTPUT DATA COMPRISES MORE THAN ONE RECOGNITION ALTERNATIVE FOR AN IMAGE OF A CHARACTER - The present invention is related to a method for resolving contradicting output data from an Optical Character Recognition (OCR) system providing a conversion of pixelized documents into computer coded text as the output data, wherein the OCR output data comprises at least a first and second character listed as being likely candidates for an exemplar of a same sampled character instance from the pixelized document, by providing steps that identify locations of differences in graphical appearance between the candidate characters, and then using the location information to identify a corresponding locations in the sampled character instance. Based on correlation technique, this location information is used to select the correct candidate character as the identification of the sampled character instance.10-28-2010
20120251004IMAGE PROCESSING APPARATUS AND IMAGE PROCESSING METHOD - An image processing apparatus supports image processing in multiple languages via a user interface, a determining unit, a setting unit, and a character recognizing unit. The user interface sets an instruction from a user for various functions performed by the image processing apparatus. The user interface displays characters in a language. The determining unit automatically determines the language currently used for the characters displayed in the user interface of the various functions. The setting unit sets, in response to the determining unit automatically determining the language currently used for the characters displayed in the user interface, the determined language as a scanned document language for use in recognizing characters in a scanned document which is obtained by scanning a paper document. The character recognizing unit utilizes the scanned document language set by the setting unit to recognize characters in the scanned document and create text data.10-04-2012
20120257832INFORMATION PROCESSING APPARATUS AND METHOD, PROGRAM, AND IMAGING APPARATUS - An information processing apparatus includes: a character recognition processing portion which performs a character recognition processing with respect to a character string region in an image; a character string information extraction portion which extracts character string information being information related to a character string from the character string in which a character is recognized by the character recognition processing portion; a display character string generation portion which generates a display character string of a character font corresponding to the character string information which is extracted by the character string information extraction portion; and a display control portion which performs control so as to display the display character string in the vicinity of the character string region in the image.10-11-2012
20120082382DISTRIBUTED DOCUMENT PROCESSING - A system for document processing including decomposing an image of a document into at least one data entry region sub-image, providing the data entry region sub-image to a data entry clerk available for processing the data entry region sub-image, receiving from the data entry clerk a data entry value associated with the data entry region sub-image, and validating the data entry value.04-05-2012
20120230587SYSTEMS AND METHODS FOR TESTING CONTENT OF MOBILE COMMUNICATION DEVICES - The embodiments described herein relates to systems and method for testing user content of given mobile communication devices. According to one aspect, there is provided a method for testing user content of given mobile communication devices that includes the steps of providing at least one model image associated with at least one graphical user interface (“GUI”) screen of a model mobile communication device corresponding to the given mobile communication device, obtaining at least one test image associated with at least one GUI screen of the given mobile communication device, comparing the test image with the model image, and determining whether the user content of the given mobile communication device is different from the desired content of the model mobile communication device.09-13-2012
20120093415Dynamic Recognition of Web Addresses in Video - One embodiment described herein may take the form of a system or method for dynamically recognizing an Internet address within a video or audio component of a multimedia presentation on a distribution system or network such as, but not limited to, a satellite, cable or Internet network. In general, the embodiment may analyze the audio portion of the presentation or one or more frames of a video component to detect the presence of a web address within the one or more frames. In the embodiment where the audio portion is analyzed, the system may perform a voice recognition or a similar analysis on the audio portion to detect the utterance of a web address. Similarly, one embodiment analyzing the one or more frames of the video component may comprise performing an optical character recognition (OCR) of the frame.04-19-2012
20120269438IMAGE PROCESSING APPARATUS - The object of this invention is to provide an image processing apparatus in which, in processing of a document image read by a document reading device, an inclination of a character string in the document image which is recognized in character recognition is obtained more accurately. The image processing apparatus includes a similar character extraction portion which extracts and outputs a character group comprised of characters having a shape and a size that are same with or similar to each other from among characters constituting a character string comprised of a character recognized in optical character recognition from a document image read by a document reading device; and an inclination calculation portion which calculates an inclination value of the character string based on position information of each character of the character group output from the similar character extraction portion.10-25-2012
20110255784SYSTEMS AND METHODS FOR AUTOMATICALLY EXTRACTING DATA FROM ELETRONIC DOCUMENTS USING MULTIPLE CHARACTER RECOGNITION ENGINES - In a document analysis system that receives and processes jobs from a plurality of users, in which each job may contain multiple electronic documents, to extract data from the electronic documents, a method of automatically extracting data from each received electronic document using a plurality of character recognition engines is provided. The method includes: automatically processing each received electronic document page using each of a plurality of recognition engines to extract data; comparing quality of data extracted from each of the recognition engines to assign a confidence score to the extracted data; and selecting extracted data having highest confidence score as the correct extracted data.10-20-2011
20120087587Binarizing an Image - The invention provides various methods and techniques for binarizing an image, generally in advance of further processing such as optical character recognition (OCR). One step includes establishing boundaries of image objects of an image and classifying each image object as either suspect or non-suspect. Another step includes creating a local binarization threshold map that may include or store threshold binarization values associated with image objects classified as non-suspect. Yet another step includes expanding the local binarization threshold map to cover the entire image thereby creating a global binarization threshold map for the entire image. The methods and techniques are capable of identifying and working with separation objects and incuts in images.04-12-2012
20120321191READING ORDER DETERMINATION APPARATUS, METHOD, AND PROGRAM FOR DETERMINING READING ORDER OF CHARACTERS - A method and apparatus for determining a reading order of characters The method includes preparing a list of character information, which is character information extracted from image data by character recognition processing and preparing a list of line information, which is made up of a line box surrounding a set of characters which are continuously aligned in the same direction in image data and an alignment direction of characters in the line box. In response to a request for adding character information to the list of character information, extracting a line box containing a character region of the character to be added, obtaining all character information having the character region contained in the concerned line box from the list of character information and rearranging according to the position with respect to the alignment direction of characters corresponding to the line box to determine a new reading order of characters.12-20-2012
20120288202INTERIOR LOCATION IDENTIFICATION - A parse module calibrates an interior space by parsing objects and words out of an image of the scene and comparing each parsed object with a plurality of stored objects. The parse module further selects a parsed object that is differentiated from the stored objects as the first object and stores the first object with a location description. A search module can detect the same objects from the scene and use them to determine the location of the scene.11-15-2012
20100177965IMAGE PROCESSING APPARATUS, CONTROL METHOD THEREFOR, AND RECORDING MEDIUM - Even if an image processing apparatus which can recognize a certain character string is available on the network, processing results of an OCR process are determined by character recognition ability of an image processing apparatus which has happened to perform the OCR process. Thus, after an MFP performs a character recognition process based on image data contained in a character region of an image, if it is determined that processing results of the character recognition process are highly likely to contain recognition errors, the processing results are output to another MFP together with first information which indicates a high likelihood of the processing results containing recognition errors. Upon acquiring the processing results, the other MFP with higher character recognition capabilities performs a character recognition process on the image data contained in the character region if the first information is attached.07-15-2010
20130022272METHOD OF AND DEVICE FOR IDENTIFYING DIRECTION OF CHARACTERS IN IMAGE BLOCK - The present embodiments disclose a method of and a device for identifying the direction of characters in an image block. The method includes: performing optical character recognition processing on the image block by assuming various directions as assumed character directions, respectively, to obtain sub image blocks, recognized characters corresponding to the sub image blocks and correctness measures thereof in each of the assumed character directions; determining a language group to which the characters in the image block belong; adjusting a correctness measure corresponding to a sub image block which corresponds to a recognized character not belonging to the determined language group in each of the assumed character directions; calculating an accumulative correctness measure in each of the assumed character directions based on the adjusted correctness measure; and identifying the direction of the characters in the image block according to the accumulative correctness measures.01-24-2013
20130022270Optical Character Recognition of Text In An Image for Use By Software - A computer implemented method and apparatus that can OCR an image, or selected portions of an image, and then provide options to a user for use of the results of the OCR, including passing the results of the OCR to a software program so the software program can perform some action on the results of the OCR.01-24-2013
20130022271METHOD OF AND DEVICE FOR IDENTIFYING DIRECTION OF CHARACTERS IN IMAGE BLOCK - The embodiments disclose a method of and a device for identifying direction of characters in image block. The method includes: performing optical character recognition processing on the image block by assuming various directions as assumed character directions to obtain sub image blocks, recognized characters and correctness measures in each assumed direction; in sub image blocks in the assumed directions with a 180° mutual relation, searching for a minimum matching pair; when there is one sub image block in each assumed direction in a minimum matching pair and recognized characters belonging to the minimum matching pair are the same rotation invariant character or belong to the same rotation invariant character pair, adjusting their correctness measures to the same; calculating an accumulative correctness measure in each assumed direction based on the adjusted results; and identifying the direction of the characters in the image block according to the accumulative correctness measures.01-24-2013
20080253657Geometric parsing of mathematical expressions - A processing device may parse a group of strokes representing a mathematical expression. The group of strokes may be examined to determine whether the group of strokes satisfies any of a finite set of rules. When the group of strokes, included in a region, satisfies any of the finite set of rules, the region may be partitioned according to a satisfied one of the finite set of rules. The group of strokes included in the region may be further examined to determine whether the group of strokes may be further partitioned according to any of the finite set of rules. After all regions have been examined and no further partitioning of regions may be performed, all mathematical symbols of the mathematical expression may be isolated in at least some of the regions and may be recognized.10-16-2008
20100086209METHOD OF IMAGING POSITION-CODING PATTERN HAVING TAG COORDINATES ENCODED BY BIT-SHIFTED SUBSEQUENCES OF CYCLIC POSITION CODE - A method of decoding a position-coding pattern disposed on a surface of a substrate. The method comprises the steps of: (a) operatively positioning an optical reader relative to the surface and capturing an image of a portion of the coding pattern; (b) sampling a windowed subsequence of a cyclic code sequence; (c) identifying a coordinate codeword using the windowed subsequence; and (d) determining a position of the optical reader from the coordinate codeword. The imaged portion has a diameter of more than one tag diameter and less than two tag diameters.04-08-2010
20130177246Identification and Separation of Form and Feature Elements from Handwritten and Other User Supplied Elements - A system and methods for progressive feature evaluation of an electronic document image to identify user supplied elements is disclosed. The system includes a controller in communication with a storage device configured to receive and accessibly store a generated plurality of candidate images. The controller is operable to analyze the electronic document image to identify a first feature set and a second feature set, wherein each of the first and second feature sets represent a different form feature, compare the first feature set to the second feature set, and define a third feature set based on the intersection of the first and second feature sets, wherein the third feature sets represents the user provided elements.07-11-2013
20130170752IMAGE, AUDIO, AND METADATA INPUTS FOR KEYWORD RESOURCE NAVIGATION LINKS - A system, method, and computer-readable medium, is described that implements a resource navigation links tool that receives one or more inputs, extracts information from the inputs into a submission string, submits the submission string to a resource navigation links tool, and receives resource navigation links based on the submission string. Inputs types may include images, audio clips, and metadata. The inputs sources may be processed to extract information related to the image source to build the submission string.07-04-2013
20130129217COLLECTION AND USE OF MONITORED DATA - A device is configured to capture an image of a monitoring device display, perform optical character recognition to identify alphanumeric data in the image, apply a device profile to map each identified alphanumeric datum to a parameter associated with the monitoring device; and store each datum along with its associated parameter.05-23-2013
20130114900METHODS AND APPARATUSES FOR MOBILE VISUAL SEARCH - Methods, apparatuses, and computer program products are herein provided for providing a REVV system that is configured to provide an MVS that is operable on a mobile terminal. One example method may include causing a plurality of vector word residuals to be aggregated for at least one visual word using local feature descriptors extracted from an image. The method may further include causing the dimensionality of the aggregated at least one vector word residual for each visual word to be reduced by using a classification aware linear discriminant analysis. The method may further include computing, using a processor, a weighted correlation for at least one compact image signature that is binarized from the aggregated at least one vector word residual when compared to a list of candidates. The method may further include determining a ranked list of candidates based on the computed weighted correlation.05-09-2013
20130121580ANALYSIS OF SERVICE DELIVERY PROCESSES BASED ON INTERROGATION OF WORK ASSISTED DEVICES - A method of monitoring input devices to discover units of work and type of work includes recording uses of input devices of a computer, analyzing the recorded uses against pre-defined use patterns to determine sets of the recorded uses that correspond to one of a plurality of units of work, and outputting an indicator indicating which of the units of work have occurred. A method of accessing a call center includes performing speech to text transcription on audio recordings from the center, determining an identifier identifying an operator for a call from the text, estimating a phase of the call based on the text, recording ant entry including the identifier, the phase, and a time period of the phase, correlating the entry with another entry including information on an application run during the estimated phase to generate a correlated entry, and determining quality level of operator based on correlated entry.05-16-2013
20130121581IDENTIFICATION METHOD AND APPARATUS OF CONFUSABLE CHARACTER - An identification method and apparatus of confusable character are provided. The method involves: the detected character image is identified to gain the initial character information which is corresponding to the character image; the step change times of the corresponding external outline of the character image are counted if the initial character information is the confusable character; the final character information corresponding to the character image is confirmed according to the step change times; The final character information of the character image can be known conveniently according to the step change times, therefore the corresponding correct character information of the character image can be identified more precisely. The possibility of wrong identification of the character image because of the appearing confusable character can be reduced, and the identification precision rate of the confusable character can be improved.05-16-2013
20110268361Method for Locating and Decoding Distorted Two-Dimensional Matrix Symbols - A method is presented for processing an image of a two-dimensional (2D) matrix symbol having a plurality of data modules and a discontinuous finder pattern, each distorted by “donut effects”. A resulting processed image contains an image of the 2D matrix symbol having a continuous finder pattern suitable for conventional 2D matrix symbol locating techniques, and having a plurality of data modules, each data module having a center more truly representative of intended data, and suitable for conventional 2D matrix symbol sampling and decoding. The method includes sharpening the distorted image of the 2D matrix symbol to increase a difference between low frequency and high frequency image feature magnitudes, thereby providing a sharpened image, and smoothing the sharpened image using a moving window over the sharpened image so as to provide a smoothed image, the moving window and a module of the 2D matrix code being of substantially similar size.11-03-2011
20130129219PATTERN RECOGNITION APPARATUS, PATTERN RECOGNTION METHOD, IMAGE PROCESSING APPARATUS, AND IMAGE PROCESSING METHOD - An image and supplementary information of the image, such as a photographing point and time, are input by an image input section and are stored in an image data storage section. Character recognition in the image is performed by a character recognition section, and the recognition result is stored in a character recognition result storage section. An analysis section extracts object character information relevant to an object from the image, the supplementary information, and the character recognition result on the basis of the analysis conditions input in a designation section to thereby analyze an object, and the analysis result is output to a result output section. Accordingly, a change in the object can be analyzed by analyzing a change in character patterns indicating the identical object.05-23-2013
20130142430INFORMATION PROCESSING APPARATUS AND INFORMATION PROCESSING METHOD - An information processing apparatus encodes an input pattern to a code including a plurality of bits, calculates reliabilities for respective bits of the code, generates a similar codes each similar to the code based on the reliabilities, and recognizes the input pattern based on the code and the similar codes.06-06-2013
20130156317Enhanced Note Processing - Techniques and systems are disclosed to perform, in some examples, the steps of receiving a note or an image of a note, imaging at least a portion of the note, determining a value of at least one field indicated by a predetermined identifier of the note through character and mark recognition, and storing information regarding the note in a memory. The information regarding the note that may be stored in a memory may be forwarded to a regulatory agency or an external entity for reporting or record-keeping.06-20-2013
20110211759CHARACTER RECOGNITION APPARATUS AND METHOD BASED ON CHARACTER ORIENTATION - A character recognition apparatus and method based on a character orientation are provided, in which an input image is binarized, at least one character area is extracted from the binarized image, a slope value of the extracted at least one character area is calculated, the calculated slope value is set as a character feature value, and a character is recognized by using a neural network for recognizing a plurality of characters by receiving the set character feature value. Accordingly, the probability of wrongly recognizing a similar character decreases, and a recognition ratio of each character increases.09-01-2011
20110222772RESOLUTION ADJUSTMENT OF AN IMAGE THAT INCLUDES TEXT UNDERGOING AN OCR PROCESS - An optical character recognition process characterizes text lines in a textual image by their base-line, mean-line and x-height. The base-line for at least one text line in the image is determined by finding a parametric curve that maximizes a first fitness function that depends on the values of pixels through which the parametric curve passes and pixels below the parametric curve. The base-line corresponds to the parametric curve for which the first fitness function is maximized. The first fitness function is designed so that it increases with increasing lightless or brightness of pixels immediately below the parametric curve while also increasing with decreasing lightness of pixels through which the parametric curve passes. The mean-line is determined by incrementally shifting the base-line upward by predetermined amounts (e.g., a single pixel) until a second fitness function for the shifted base-line is maximized. The second fitness function is essentially the inverse of the first fitness function. Specifically, the second fitness function increases with increasing lightless of pixels immediately above the shifted base-line while also increasing with decreasing lightness of pixels through which the shifted base-line passes. The x-height is equal to the sum of the predetermined amounts by which the base-line is shifted upward in order to maximize the second fitness function. In some cases different groups of text-lines in the textual image may be characterized differently from one another. For example, each group may be characterized by a most probable x-height for that group.09-15-2011
20110255785Character area extracting device, imaging device having character area extracting function, recording medium saving character area extracting programs, and character area extracting method - A character area extracting device includes a reflective and non-reflective area separation unit separating image data into reflective and non-reflective areas, and binarizing the image data by changing a first threshold value when it is inappropriate; a reflective area binarizing unit separating the reflective area into character and background areas, and binarizing it by changing a second threshold value when it is inappropriate; a non-reflective area binarizing unit separating the non-reflective area into the character and background areas, and binarizing it by changing a third threshold value when it is inappropriate; a reflective and non-reflective area separation evaluation unit; and a line extracting unit connecting the character areas of the reflective and non-reflective areas and extracting positional information of the connected character areas in the image data.10-20-2011
20100316295IMAGE PROCESSING METHOD, IMAGE PROCESSING APPARATUS, IMAGE FORMING APPARATUS, AND STORAGE MEDIUM - An image processing apparatus includes: a division section for dividing input image data into portions; an orientation determining section for calculating reliabilities of directions of image data of each portion when the directions are regarded as orientations, and setting an orientation with the highest reliability as an orientation of each portion; a display control section for generating display image data including an image of a target portion whose reliability of an orientation is less than a predetermined value and images of designation regions from which a user's input to designate the orientation of the target portion is entered; and a character recognition section for recognizing characters of each portion in such a manner that the orientation is designated from the designation regions or set by the orientation determining section. This allows prompt recognition of characters of a portion whose reliability of orientation is low, in accordance with a right orientation.12-16-2010
20120281920PARALLEL TEST PAYLOAD - A parallel test payload includes a bit sequence configured to be segmented into a plurality of sub-sequences having variable bit length carriers. Respective carriers are represented uniformly in each one of the plurality of sub-sequences.11-08-2012
20110311140Selecting Representative Images for Establishments - Establishments are identified in geo-tagged images. According to one aspect, text regions are located in a geo-tagged image and text strings in the text regions are recognized using Optical Character Recognition (OCR) techniques. Text phrases are extracted from information associated with establishments known to be near the geographic location specified in the geo-tag of the image. The text strings recognized in the image are compared with the phrases for the establishments for approximate matches, and an establishment is selected as the establishment in the image based on the approximate matches. According to another aspect, text strings recognized in a collection of geo-tagged images are compared with phrases for establishments in the geographic area identified by the geo-tags to generate scores for image-establishment pairs. Establishments in each of the large collection of images as well as representative images showing each establishment are identified using the scores.12-22-2011
20130188872INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND RECORDING MEDIUM THAT HAS RECORDED INFORMATION PROCESSING PROGRAM - An appropriate search is carried out even with images including a complicated layout structure, decorated characters, and so on. An image search device 07-25-2013
20120020565Selecting Representative Images for Establishments - Establishments are identified in geo-tagged images. According to one aspect, text regions are located in a geo-tagged image and text strings in the text regions are recognized using Optical Character Recognition (OCR) techniques. Text phrases are extracted from information associated with establishments known to be near the geographic location specified in the geo-tag of the image. The text strings recognized in the image are compared with the phrases for the establishments for approximate matches, and an establishment is selected as the establishment in the image based on the approximate matches. According to another aspect, text strings recognized in a collection of geo-tagged images are compared with phrases for establishments in the geographic area identified by the geo-tags to generate scores for image-establishment pairs. Establishments in each of the large collection of images as well as representative images showing each establishment are identified using the scores.01-26-2012
20120020564SHAPE CLUSTERING IN POST OPTICAL CHARACTER RECOGNITION PROCESSING - Techniques for shape clustering and applications in processing various documents, including an output of an optical character recognition (OCR) process. The output of an OCR process is classified into a plurality of clusters of clip images and a representative image for each cluster is generated to identify clusters whose clip images were incorrectly assigned character codes by the OCR process.01-26-2012
20120020563Systems and Methods for Automated Extraction of Measurement Information in Medical Videos - Systems and methods providing automated extraction of information contained in video data and uses thereof are described. In particular, systems and associated methods are described that provide techniques for extracting data embedded in video, for example measurement-value pairs of medical videos, for use in a variety of applications, for example video indexing, searching and decision support applications.01-26-2012
20120020562CAMERA-VISION SYSTEMS, USED IN COLLABORATION WHITEBOARDS, FOR PRE-FORMATTED, REUSABLE, ANNOTATABLE, MOVABLE MENUS AND FORMS. - Systems and devices for, and methods of, image-based processing where a device embodiment comprises: (a) a processor; (b) an addressable memory, the memory comprising a set one or more image references, and where the set of image references comprises a rule of interpretation and a rule of execution; and the processor is configured to: (1) compare captured surface indicia of a sheet with the set of at least one image reference; (2) determine the image reference associated with the surface indicia based on the comparison of the surface indicia and the set of at least one image reference; (3) extract a marking by differencing the surface indicia and the image reference; (4) interpret the extracted marking based on the rule of interpretation associated with the image reference; and (5) invoke the rule of execution based on the rule of interpretation.01-26-2012
20130195360LOWER MODIFIER DETECTION AND EXTRACTION FROM DEVANAGARI TEXT IMAGES TO IMPROVE OCR PERFORMANCE - Systems, apparatus and methods for extracting lower modifiers from a word image, before performing optical character recognition (OCR), based on a plurality of tests comprising a first test, a second test and a third test are presented. The method obtains the word image and performing a plurality of tests (e.g., a first test, a second test and a third test). The first test determines whether a vertical line spanning the height of the word image exists. The second test determines whether a jump of a number of components in the lower portion of the word image exists. The third test determines sparseness in a lower portion of the word image. The plurality of tests may run sequentially and/or in parallel. Results from the plurality of tests are used to decide whether a lower modifier exists by comparing and accumulating test results from the plurality of tests.08-01-2013
20130202208INFORMATION PROCESSING DEVICE AND INFORMATION PROCESSING METHOD - An information processing device comprises a word string acquirer which acquires a word string that is a target of analysis; a partial string extractor which extracts, using two words on either side of each space in the word string, a partial string containing one word but not the other, a partial string not containing the one word but containing the other, and a partial string containing both words from the word string; a division coefficient acquirer which acquires, for each partial string, division coefficients indicating degree of reliability in dividing the partial string by respective division patterns that divide the partial string into words; a probability coefficient acquirer which calculates a coefficient indicating probability that the word string is divided at the space based on the division coefficients; and an ouputter which determines division of the word string based on the coefficient, and divides and outputs the word string.08-08-2013
20130202207METHOD, SERVER, AND COMPUTER-READABLE RECORDING MEDIUM FOR ASSISTING MULTIPLE USERS TO PERFORM COLLECTION SIMULTANEOUSLY - The present invention relates to a method for assisting multiple users to perform a collection simultaneously. The method includes the steps of: (a) acquiring digital data created with respect to recognition reference information of an object from a terminal of each of the multiple users; (b) determining or recognizing whether the respective digital data on the recognition reference information acquired through the terminals were created within a preset place condition and whether the respective digital data on the recognition reference information acquired through the terminals were created within a preset scope of the time; (c) selecting a specified group of users, including a first to an n-th user among the multiple users, who create the digital data within the preset place condition and within the preset scope of the time; and (d) providing information on rewards corresponding to the object for users included in the specified group of users.08-08-2013

Patent applications in class Limited to specially coded, human-readable characters

Patent applications in all subclasses Limited to specially coded, human-readable characters