20120317472 | Creation of data extraction rules to facilitate web scraping of unstructured data from web pages - The present invention provides a method, system, and computer program to help a user without any programming knowledge create data extraction rules for collecting data from websites at scale. A user only needs to provide a web page Universal Resource Locator (URL), then mark and assign the needed data to its type. For example, on an e-commerce website, this data can be the product name, price, description, and so forth. Marking is done by highlighting the correct part of the web page. This creates a data extraction rule that describes the web template of full website and can be used thereafter for automated web scraping from all pages on a particular website. | 12-13-2012 |