PDFTextStream v1.3 Released: Enterprise-Class PDF Text Extraction Library for Java Upgraded
Snowtide Informatics Systems, Inc. (www.snowtide.com) today announced an upgrade to PDFTextStream, its high-performance PDF text and metadata extraction library for Java applications and web services. Version 1.3 significantly expands PDFTextStream's Pure Java API, introduces compatibility with v1.6 of the PDF document specification (used by Acrobat 7), and offers major improvements to the accuracy and quality of text extracted from PDF documents.
(PRWEB) April 26, 2005 -- Snowtide Informatics Systems, Inc. (http://www.snowtide.com) today announced an upgrade to PDFTextStream, its high-performance PDF text and metadata extraction library for Java applications and web services. Version 1.3 significantly expands PDFTextStream's Pure Java API, introduces compatibility with v1.6 of the PDF document specification (used by Acrobat 7), and offers major improvements to the accuracy and quality of text extracted from PDF documents.
PDFTextStream v1.3's expanded API and object model represents a significant productivity driver for Java developers. It now:
- provides page-by-page access of PDF documents
- supports retrieval of PDF document revision and encryption information
- offers high-performance 'piping' routines that simplify the extraction of text from PDF documents and pages.
PDFTextStream v1.3's new compatibility with PDF documents adhering to v1.6 of the PDF document specification (used by Acrobat 7) further enhances PDFTextStream's deep and broad support for different types of PDF documents.
Finally, PDFTextStream v1.3 includes a major upgrade to its internal document understanding processes, which results in more accurate and higher quality text extracts. This is a critical improvement for developers that rely upon PDFTextStream text extracts for full-text indexing and data mining applications where semantic integrity of the extracted text is mission-critical.
Product Overview
PDFTextStream is demonstrably the fastest Java library for converting PDF documents to text (http://www.snowtide.com/home/PDFTextStream/Performance), and especially well-suited for high-volume enterprise environments. It supports all versions of the PDF document specification, including decryption of 40- and 128-bit encrypted PDF files. Finally, PDFTextStream also includes a module that provides drop-in integration with the popular Apache/Jakarta Lucene full-text indexing and search component.
Availability and Pricing
Version 1.3 of PDFTextStream is available immediately. PDFTextStream is a pure Java library, and is therefore compatible with any computing platform for which a v1.3 or higher Java Runtime Environment is available (including Linux, many Unix variants, Windows 2000/XP, and Mac OS X). Licenses start at $1900 for a 1-CPU development/deployment license; OEM redistribution licenses are available. Evaluation downloads are available at Snowtide.com.
Press contact:
Chas Emerick
303 Sargeant Street
Holyoke, MA 01040
press@snowtide.com
413.519.6365
Licensed copies of PDFTextStream are available for qualified members of the press.
PDFTextStream is a trademark of Snowtide Informatics Systems, Inc. Other trademarks are property of their respective owners.
###
Bookmark -
Del.icio.us |
Digg |
Furl It |
Spurl |
RawSugar |
Simpy |
Shadows |
Blink It |
My Web
|