Equivio Near-Duplicate Benchmark Processes One Million Files in Less Than Three Hours

Share Article

Equivio, a provider of software for near-duplicates and email threads, today announced benchmark results on performance version 2.3.7. The benchmark results show a 100% improvement in processing throughput in Equivio version 2.3.7, relative to prior versions. The new version also contains important operational enhancements, including support of additional data sources, full Unicode encoding, additional email formats and customizable output in Equivio>Extract.

The market has transitioned near-duping and email threads from a premium service, used mainly by technologically savvy customers, to a default bundled service, applied universally in all cases. This benchmark demonstrates that Equivio can handle the massive volumes and throughput demanded of a standard service in the e-discovery arena.

Equivio, a provider of software for near-duplicates and email threads, today announced benchmark results on performance version 2.3.7. The Generally Available (GA) version of Equivio 2.3.7 was released on May 6, 2009. As demonstrated in the benchmark, version 2.3.7 delivers the performance and operational ease required of a product that has become an integral component of the standard work flow for electronic discovery.

Key results from the benchmark include:

  • Case of 1,000,000 documents completed in less than 3 hours
  • Case of 7,000,000 documents completed in 30 hours

Amir Milo, CEO of Equivio, noted, "The market has transitioned near-duping and email threads from a premium service, used mainly by technologically savvy customers, to a default bundled service, applied universally in all cases. This benchmark demonstrates that Equivio can handle the massive volumes and throughput demanded of a standard service in the e-discovery arena."

The benchmark results show a 100% improvement in processing throughput in Equivio version 2.3.7, relative to prior versions. The new version also contains important operational enhancements, including support of additional data sources, full Unicode encoding, additional email formats and customizable output in Equivio>Extract.

These enhancements translate into mission-critical savings in time-sensitive business environments. In e-discovery, for example, the majority of cases include up to 1 million cases. Equivio's new throughput levels, combined with the operational improvements, mean that most cases can be processed unsupervised in just a few hours.

Milo continued, "We have been focused on delivering performance, operational and business enhancements that can support the huge demand from our partners and customers for near-duplicate and email thread processing. These performance numbers show that Equivio can support the throughput requirements for even the largest cases and e-discovery operations in the industry."

The benchmark was run on standard PC's, and was designed to analyze the impact of key performance parameters including: threshold level of near-duplicate resemblance, databases, email thread processing, distributed configuration, number of threads, and use of local or remote data. Full details on the benchmark results are available in an Equivio white paper which can be downloaded from the resource center at http://www.equivio.com.

About Equivio

Equivio enables the management of data redundancy in content-centric business processes. Equivio's technology zooms in on unique data, allowing you to read less, think more, win big™. With products for grouping near-duplicates, capturing email threads and determining document relevance, Equivio powers a broad range of business applications, including eDiscovery, corporate investigations, records management, email archiving, data retention and intelligence. To learn more about winning with Equivio, visit http://www.equivio.com.

# # #

Share article on socal media or email:

View article via:

Pdf Print

Contact Author

Warwick Sharp
Equivio
800-851-1965
Email >
Visit website