Cloudera, Inc. Patent applications |
Patent application number | Title | Published |
20130282668 | AUTOMATIC REPAIR OF CORRUPT HBASES - Systems and methods for checking for region consistency and table integrity problems and automatically repairing a corrupted HBase cluster. The methods and systems operate in a diagnostic mode and a diagnostic and repair mode. The methods include fixing table integrity problems, such as backwards table regions, table region holes, table region overlap, and the like to restore table integrity invariant. Once the table integrity has been restored, each row key resolves to exactly one region. The methods further include fixing region inconsistencies, such as bad region assignment, no region present in the meta table, region information not in the Hadoop Distributed File System (HDFS), and the like to restore region consistency invariant. The information in the HDFS is taken as ground truth and any meta table or assignment problems that are inconsistent with the HDFS is deemed wrong and removed. | 10-24-2013 |
20130204948 | CENTRALIZED CONFIGURATION AND MONITORING OF A DISTRIBUTED COMPUTING CLUSTER - Systems and methods for centralized configuration and monitoring of a distributed computing cluster are disclosed. One embodiment of the disclose technology enables deployment and central operation a complete Hadoop stack. The application automates the installation process and reduces deployment time from weeks to minutes. One embodiment further provides a cluster-wide, real time view of the services running and the status of the host machines in a cluster via a single, central place to enact configuration changes across the computing cluster which further incorporates reporting and diagnostic tools to optimize cluster performance and utilization. | 08-08-2013 |
20130185337 | MEMORY ALLOCATION BUFFER FOR REDUCTION OF HEAP FRAGMENTATION - Systems and methods of a memory allocation buffer to reduce heap fragmentation. In one embodiment, the memory allocation buffer structures a memory arena dedicated to a target region that is one of a plurality of regions in a server in a database cluster such as an HBase cluster. The memory area has a chunk size (e.g., 2 MB) and an offset pointer. Data objects in write requests targeted to the region are received and inserted to the memory arena at a location specified by the offset pointer. When the memory arena is filled, a new one is allocated. When a MemStore of the target region is flushed, the entire memory arenas for the target region are freed up. This reduces heap fragmentation that is responsible for long and/or frequent garbage collection pauses. | 07-18-2013 |
20120254722 | INTERACTIVE USER INTERFACE IMPLEMENTATION AND DEVELOPMENT ENVIRONMENT THEREFOR - Systems and methods of interactive user interface implementation and development environment therefor are disclosed. One embodiment of implementing interactive elements in a web page on a client device includes, sending, by the client device, a request to a web server in response to an event triggered in a frame of the web page having multiple frames, processing a web server response from the web server for the frame, and/or unloading contents of the frame, after receipt of the web server response, independent of the other frames in the web page when in accordance with rules defined for the web server response. | 10-04-2012 |
20120254292 | USER INTERFACE IMPLEMENTATION FOR PARTIAL DISPLAY UPDATE - Systems and methods for user interface implementation for partial display update are disclosed. One embodiment of the method, which may be embodied on a system includes, in a response received from a web server, identifying, for a web page, a set of elements able to he updated partially as displayed without refreshing the user interface in its entirety, detecting, in the response, updated elements in the set of elements that have been updated from a value displayed in the user interface, and/or partially updating the user interface to reflect changes to the updated elements in the web page without refreshing other portions of the user interface. | 10-04-2012 |
20110246826 | COLLECTING AND AGGREGATING LOG DATA WITH FAULT TOLERANCE - Systems and methods of collecting and aggregating log data with fault tolerance are disclosed. One embodiment includes, one or more devices that generate log data, the one or more machines each associated with an agent node to collect the log data, wherein, the agent node generates a batch comprising multiple messages from the log data and assigns a tag to the batch. In one embodiment, the agent node further computes a checksum for the batch of multiple messages. The system may further include a collector device, the collector device being associated with a collector tier having a collector node to which the agent sends the log data; wherein, the collector determines the checksum for the batch of multiple messages received from the agent node. | 10-06-2011 |
20110246816 | CONFIGURING A SYSTEM TO COLLECT AND AGGREGATE DATASETS - Methods for configuring a system to collect and aggregate datasets are disclosed. One embodiment includes, identifying a data source in the system from where dataset is to be collected, configuring a machine in the system that generates the dataset to be collected, to send the dataset to the data source, identifying an arrival location where the dataset that is collected is to be aggregated or written, and/or configuring an agent node by specifying a source for the agent node as the data source in the system and specifying a sink for the agent node as the arrival location. | 10-06-2011 |
20110246528 | DYNAMICALLY PROCESSING AN EVENT USING AN EXTENSIBLE DATA MODEL - Systems and methods of dynamically processing an event using an extensible data model are disclosed. One embodiment includes, specifying attributes of the event in a data model; the data model being extensible to add properties to the event as the dataset is streamed from the source to the sink. | 10-06-2011 |
20110246460 | COLLECTING AND AGGREGATING DATASETS FOR ANALYSIS - Systems and methods of facilitating collecting and aggregating datasets that are machine or user-generated for analysis are disclosed. One embodiment includes, collecting a dataset on a machine on which the dataset is received or generated, wherein, the dataset is collected from a data source on the machine, aggregating the dataset collected from the data source at a receiving location, performing analytics on the dataset upon collection or aggregation, and/or writing the dataset aggregated at the receiving location to a storage location. | 10-06-2011 |