HStreaming Enables Continuous Real-time Analytics on All Hadoop Distributions

HStreaming LLC announced today a new release of its scalable continuous data analytics platform built on Hadoop that is available for all major Hadoop distributions including Apache Hadoop, Cloudera, MapR, Amazon EMR, Hortonworks, EMC, and IBM. The software allows companies to realize the full value of big data through analyzing, visualizing, and acting upon massive continuous data in real time directly on their existing Hadoop infrastructure. The new release is available as of today.

  • Share on TwitterShare on FacebookShare on Google+Share on LinkedInEmail a friend
"67% of Hadoop users see the lack of real-time capabilities as the #1 technology obstacle in analyzing big data on Hadoop." says David Menninger, VP and Research Director at Ventana Research.

Hadoop World, New York, NY (PRWEB) November 08, 2011

HStreaming LLC announced today a new release of its scalable continuous data analytics platform built on Hadoop that is available for all major Hadoop distributions including Apache Hadoop, Cloudera, MapR, Amazon EMR, Hortonworks, EMC, and IBM. The software allows companies to realize the full value of big data through analyzing, visualizing, and acting upon massive continuous data in real time directly on their existing Hadoop infrastructure. The new release is available as of today.

Big data problems start whenever there is machine-generated data. “Apache Hadoop is one of the leading platforms to store and analyze big data with 48% of Hadoop users producing more than 100 GB of data per day.” says David Menninger, VP and Research Director at Ventana Research. While machine-generated data is usually generated continuously, Apache Hadoop allows for batch processing only, taking anywhere from minutes to hours to run an analysis. According to the same study done by Ventana Research, 67% of Hadoop users see the lack of real-time capabilities as the #1 technology obstacle in analyzing big data on Hadoop. “Processing latency is a true challenge with Hadoop since real-time data often loses its value very rapidly. For example, by immediately acting on a network outage from sensor data or preventing fraud as it happens can substantially reduce cost for an enterprise. HStreaming provides exactly these capabilities for Hadoop.” says Jana Uhlig, CEO of HStreaming.

HStreaming enables running advanced analytics on Hadoop in real-time to create live dashboards, identify and recognize patterns within one or across multiple data streams, and trigger action based on predefined rules or heuristics. HStreaming can handle even the most challenging data volumes and complex analytical problems leveraging the scalability of Hadoop and MapReduce.

HStreaming enables users to process data continuously as they are generated, reducing the latency for analytics results to milliseconds. With the addition of HStreaming’s real-time processing, ETL (extract-transform-load) capabilities, and a rich set of data and storage connectors to SQL and noSQL databases, HStreaming-powered Hadoop can now handle the full big data life cycle from real-time data acquisition, analytics, storage, query and archival on a single platform. Built on MapReduce and Hadoop, HStreaming is the most scalable stream processing / complex event processing (CEP) system on the market able to process and analyze hundreds of millions of events per second.

The new release of HStreaming technology is built upon Hadoop and is now fully compatible with all major Hadoop distributions including Apache Hadoop, Cloudera’s CDH, MapR’s M3 and M5, Amazon Elastic MapReduce (EMR), Hortonworks Data Platform, EMC’s Greenplum HD, and IBM’s InfoSphere BigInsights. No installation of a custom distribution is necessary for real-time analytics on Hadoop – instead, HStreaming software runs completely agnostic atop any of these Hadoop distributions.

Tweet This: #HStreaming continuous real-time analytics platform now runs on all major #Hadoop distros. #CEP #MapReduce http://bit.ly/vbWelf

About HStreaming
HStreaming LLC, based in Chicago, IL, provides the most scalable real-time continuous data analytics platform powered by Hadoop. HStreaming enables organizations to realize the full value of data by analyzing, visualizing, and acting on massive data correctly and in real-time. HStreaming adds real-time processing and ETL capabilities to Hadoop consolidating the full big-data life cycle including pre-processing, ETL, storage, analytics, post-processing, and archival on a single platform. HStreaming is compatible with all major Hadoop distributions. HStreaming offers two products: HStreaming Cloud and HStreaming Enterprise. For more on HStreaming, please visit http://www.hstreaming.com.

###


Contact