Zaloni Changes the Game in the Data Lake with New Machine-learning Data Matching Technology

Share Article

Next-gen data lake management platform enables data matching to provide enriched data views for 360° initiatives

Zaloni: The Data Lake Company

“As companies mature into data-driven organizations, automation becomes the key to scaling and adding more use cases, improving the accuracy and quality of data, and accelerating business insights.” - Ben Sharma, CEO, Zaloni

As an extension to its Data Lake Management Platform, Zaloni today introduced a machine-learning data matching engine, which leverages the data lake to create “golden” records and enable enriched data views for multiple use cases across business sectors. Zaloni’s data matching engine provides a new approach for creating an integrated, consistent view of data that is updated, efficiently maintained and can drive customer-facing applications. It addresses a gap in the marketplace for a simplified, much less expensive and faster-to-implement solution for data mastering.

“Many master data records solutions are complex, inflexible, expensive and underperform for the cost,” said Ben Sharma, Zaloni’s CEO. “Zaloni’s data matching engine, which is offered as an extension to Zaloni’s Data Lake Management Platform, enables a practical, unique solution at a great value that will interest any organization that has a Customer or Product 360° initiative. For example, we implemented a Patient 360° project with one of our healthcare customers."

With Zaloni’s Data Master extension, companies can leverage their data lake environment to achieve an enriched view of customer or product data for applications such as intelligent pricing, personalized marketing, smart alerts, customized recommendations, and more. Because it works directly in the data lake, organizations can capture and combine any data type, including unstructured data, which allows the engine to create a more robust single version of truth. Further, Zaloni’s data matching engine can use your sample data to train its machine-learning algorithms.

“As companies mature into data-driven organizations, automation becomes the key to scaling and adding more use cases, improving the accuracy and quality of data, and accelerating business insights,” said Sharma. “Zaloni’s data matching engine provides this critical automation without significant new investment – a huge win for CIOs.”

Zaloni’s data matching engine is built on top of the powerful Zaloni Data Lake Management Platform and uses Spark machine-learning libraries and analytic approaches to integrate data silos. This includes probabilistic matching for record linkage, advanced data clustering, and data classification techniques. In addition, Zaloni’s data matching extension uses reinforced learning techniques that enable customers to train the matching models based on live sample data. This approach provides maximum accuracy that may be adjusted as the data changes.

Zaloni’s data matching engine also leverages the Zaloni Data Lake Management Platform for metadata, data quality, scalability, user interface, and operational data pipelines for creating master records. In addition, Zaloni’s total, integrated package provides a clear advantage and faster time to value over more limited deduplication open source or point products that lack analytics data preparation capabilities such as joins and data profiling.

Zaloni’s data matching engine is currently in beta with select Zaloni customers. General availability is scheduled for fall 2017.

Come see us during the Strata Data Conference in New York

Find us in the Expo Hall at Booth #515 to view product demos, meet with Zaloni experts, and get a copy of Understanding Metadata by Scott Gidley, Zaloni’s vice president of Product.

Speaking Session:

Operationalizing your data lake: The key to a scalable, modern data architecture
Ben Sharma, CEO, Zaloni
Carlos Matos, CTO of Big Data, AIG

Wednesday, September 27
2:05pm - 2:45pm
Location: 1E 06

About Zaloni
Zaloni simplifies big data for transformative business insights. We work with pioneering enterprises to modernize their data architecture and operationalize their data lakes to incorporate data into everyday business practices. The Zaloni Data Lake Management Platform provides total control throughout the data pipeline from ingestion to analytics, with comprehensive data management, governance and self-service data preparation capabilities for IT and business users. A leader in big data for more than a decade, Zaloni’s expertise is deep, spans multiple industries, and has proven invaluable to customers at many of the world’s top companies. We are proud to be recognized by CRN’s 2017 Big Data 100 list, Forbes top 20 big data companies to work for, and Red Herring’s Top 100 North America Award. To learn more, visit

Share article on social media or email:

View article via:

Pdf Print

Contact Author

Annie Bishop
919-323-4050 (x7054)
Email >
Follow us on
Visit website