South Brunswick, NJ (PRWEB) May 11, 2010
Infosolve Technologies, industry leading experts in the area of data quality solutions, are pleased to announce the price/performance benchmark results of their massively scalable OpenDQ fuzzy matching solution on the Amazon EC2 cloud platform.
- 100 million records on a 50 node cluster are matched in 40 Minutes at a cost of $25.
- 150 million records on a 50 node cluster are matched in 1 hour 20 Minutes at a cost of $50.
- 200 million records on a 75 node cluster are matched in 2 hours 12 minutes at a cost of $112.50.
- 250 million records on a 75 node cluster are matched in 3 hours 32 minutes at a cost of $150.
- Matching performed on five attributes using the Jaro-Winkler algorithm.
- Each node is an Amazon EC2 M1.Xlarge instance type.
- The cost of each EC2 instance is 50 cents/hour (as of 04/30/2010).
- Benchmarks are performed on a generated test data set; real world data benchmarks may be different.
- Matching can be run on any general purpose cluster of computers.
Record Linkage/Data Matching using fuzzy matching algorithms on data sets that do not have a unique identifier between records is one of the most important process steps in applications such as: single view of the customer, fraud detection, electronic health care records, terrorist and no fly watch lists, auditing and compliance, master data management, data de-duplication, and social media/multi channel marketing programs. These processes impose heavy compute loads on servers and take a considerable amount of time to complete on large data sets. With OpenDQ’s massively scalable data matching solution, organizations now have the ability to match large data sets from multiple data sources and scale seamlessly from a single server to a cluster of servers using general purpose commodity hardware.
OpenDQ, Infosolve's zero license cost enterprise data quality solution provides organizations with one of the most comprehensive platforms for achieving clean, consistent and accurate enterprise data. Features of OpenDQ include profiling, data matching, de-duplication, dictionary-based standardization, reporting with comprehensive extraction-transformation-loading capabilities, built-in address parsing and correction, external data enhancement, real-time matching and duplicate detection. “While the scalability and the performance of the matching solution are important features, it is really the affordable price point at which we are able to deliver an end-to-end solution to our customers that differentiates Infosolve’s OpenDQ platform from the rest of the competition” said Subbu Manchiraju, Vice President at Infosolve Technologies.
Using their powerful Zero Based data solutions, Infosolve Technologies has assisted businesses, ranging in size from Fortune 100 companies, mid-sized companies to non-profit organizations, to maximize the business value of their data. Infosolve will continue to improve and expand their suite of zero license cost software with new enhancements and offerings in the upcoming months. To learn more about the features/capabilities of OpenDQ please visit http://www.dataqualitysolution.com
About Infosolve Technologies
Infosolve Technologies (http://www.infosolvetech.com), founded in South Brunswick, New Jersey in 2003, provides clients with comprehensive data solutions that leverage the power of their enterprise data to achieve business objectives and create strategic opportunities without the burdens of cumbersome licensing agreements, complex term contracts, and expensive hardware requirements. Infosolve’s solutions are client-driven in every aspect, from inception to results, focusing on client business drivers and required return on investment. The company's industry-leading expertise derives from both in-depth domain knowledge and hands-on experience implementing solutions for numerous businesses, ranging in size from non-profit organizations to Fortune 100 companies.
All trademarks or registered trademarks are property of their respective owners.
Zero Defect Data, Zero Defect Data Guarantee, OpenDQ, OpenCDI, OpenMDM and The Power of Zero Based Data Solutions are trademarks of Infosolve Technologies, Inc
Infosolve Technologies, Inc
# # #