"The combination of our technologies will help continue the adoption of Hadoop across business units with easier data self-service and a foundation for data governance," said Jon Posnik, VP of business development, MapR Technologies.
Hadoop World New York, New York (PRWEB) October 16, 2014
Strata + Waterline Data Science today announced at Strata + Hadoop World New York that it has joined the MapR Advantage Partner Program. Waterline Data Science will integrate the MapR Distribution including Apache™ Hadoop® with Waterline Data Inventory to enable data self-service on Hadoop, allowing users to find, understand, and help govern Hadoop data.
Oliver Claude, Waterline Data Science CMO, states, “We’re pleased to have Anoop Dawar, senior director, product management, MapR, join our Advisory Board to help steer the partnership and make Hadoop more business and mission critical by leveraging the enterprise-grade MapR Distribution and Waterline Data Inventory’s enterprise-ready data self-service with built-in data governance.”
Jon Posnik, vice president of business development, MapR Technologies, said, “Waterline Data Science is a great addition to our growing partner ecosystem. The combination of our technologies will help continue the adoption of Hadoop across business units with easier data self-service and a foundation for data governance.”
Companies are deploying Hadoop “data lakes” to provide unprecedented access to data for data science and analytics to uncover new business insight. But Hadoop’s advantages of frictionless ingest, flexible schema on read, and lack of data governance, present problems for users trying to find and understand the data. Waterline Data Inventory addresses these problems by building a complete inventory of data assets in Hadoop and by opening access to Hadoop data through data self-service. As a result, data scientists can be more productive, business analysts can easily augment reporting and BI with Hadoop data without coding, and data governance teams can start controlling Hadoop data.
“There is no point building a predictive model of the wrong column, and without a data inventory, you don’t know if you have the wrong column,” said John Mount, co-author of the book, Practical Data Science with R. A data inventory is also valuable for Hadoop data governance, according to Sunil Soares, author of Big Data Governance.
Alex Gorelik, Founder and CEO, states, “A major complaint with Hadoop is once you’ve loaded the data, extracting value is like finding a needle in a stack of needles. Waterline Data Inventory lets business users find the best needles in the stack of needles, without having to write code, and without having to wrangle the entire stack. That's our secret sauce, and key to deliver faster time to value and broad Hadoop adoption."
MapR delivers on the promise of Hadoop with a proven, enterprise-grade platform that supports a broad set of mission-critical and real-time production uses. MapR brings unprecedented dependability, ease-of-use and world-record speed to Hadoop, NoSQL, database and streaming applications in one unified distribution for Hadoop. MapR is used by more than 500 customers across financial services, government, healthcare, manufacturing, media, retail and telecommunications as well as by leading Global 2000 and Web 2.0 companies. Amazon, Cisco, Google and HP are part of the broad MapR partner ecosystem. Investors include Google Capital, Lightspeed Venture Partners, Mayfield Fund, NEA, Qualcomm Ventures and Redpoint Ventures. MapR is based in San Jose, CA. Connect with MapR on Twitter, LinkedIn, and Facebook.
About Waterline Data Science
Waterline Data Science is an early-stage Big Data software company, founded in December 2013, backed by Menlo Ventures and Sigma West. The inspiration for the name "Waterline" came from the metaphor of the Big Data Lake. Waterline solves the challenges of data self-service for the Hadoop data lake. It's easy to get data into Hadoop, but it's not easy to get it out in a self-service manner and derive business value from it. The idea behind Waterline is that data self-service for Hadoop should be like finding the data you need easily, without having to dive for it -- you should be able to Hadoop "above the waterline."