Dagger Data Introduces Ultra-Fast OCR

Share Article

Less-than-a-penny-per-page price aims to undercut prevailing market rates by 25% or more

Projects that formerly took days or weeks can now be completed in minutes or hours. Prices of pennies per page can no longer be justified.

SANTA MONICA, CALIFORNIA, May 5, 2016. Dagger Data, an innovative start-up provider of cloud-based e-discovery self-service solutions, has announced a near-instantaneous optical-character recognition (OCR) offering. Priced at three-quarters of a cent per page, the service outputs OCR’d PDF, full text file, and by-page text files for each image file.

“New, scalable technologies enable us to perform OCR that is accurate, inexpensive, and almost instantaneous” said Bruce Fein, Dagger’s legal director. “This offering has the potential to rationalize and disrupt overpriced and outmoded OCR techniques. Projects that formerly took days or weeks can now be completed in minutes or hours. Prices of pennies per page can no longer be justified.”

Users can upload their images and download the corresponding outputs, in a variety of ways, including drag-and-drop, secure file transfer protocol (S/FTP), Amazon Web Services S3 bucket, or, for an added charge, by mailing a hard drive. The service accepts PDF, TIF (single- or multi-page), JPG, JPEG, PNG, BMP, and GIF formats. 24/7 customer support is free, as is OCR on images emailed as attachments to ocr(at)daggerdata(dot)com.

“Our processing can keep pace with just about any upload speed, with a lag of no more than a few minutes,” said lead Dagger developer Sergey Tsalkov. Tsalkov cited a recent matter in which Dagger performed OCR on approximately one million pages in a matter of hours.

The offering is currently tailored to English-language documents, and all processing occurs in the United States. According to Fein, Dagger will soon offer over 140 languages and processing in Europe to facilitate regulatory compliance, as well as integration with Dropbox.

About Dagger Data. Dagger Data offers a full range of scalable, cloud-based, self-service e-discovery services including processing, analytics, review platform, predictive coding, and OCR, all available a la carte or as a bundle. Dagger’s expertise encompasses bulk document review and categorization in connection with civil and criminal litigation and investigations in fields including financial institutions, FCPA, securities law, antitrust and Hart-Scott-Rodino Second Requests. In its first year, it processed nearly 20 terabytes of data and 100 million documents. Dagger was started in 2015 by DreamHost co-founder Josh Jones, Chunkhost co-founders Sergey Tsalkov and Nate Daiger, and Backstop co-founder Bruce Fein. For more information, please visit https://daggerdata.com/ocr or @daggerdata.

Share article on social media or email:

View article via:

Pdf Print

Contact Author

Bruce Fein
Dagger Data
+1 571-348-4666
Email >

Josh Jones
since: 02/2015
Follow >
Visit website