PDFTextStream Upgraded, v1.4 Extracts PDF Form Content

Share Article

Snowtide Informatics Systems, Inc. today announced an upgrade to PDFTextStream, its high-performance PDF content extraction library for Java applications and web services. Version 1.4 expands PDFTextStream's Pure Java API to include support for extracting PDF form data, enhances performance, and introduces an expanded PDF document model.

Snowtide Informatics Systems, Inc. http://www.snowtide.com today announced an upgrade to PDFTextStream, its high-performance PDF content extraction library for Java applications and web services. Version 1.4 expands PDFTextStream's Pure Java API to include support for extracting PDF form data, enhances performance, and introduces an expanded PDF document model.

PDFTextStream v1.4's new interactive form extraction capability represents the realization of many users' requests to be able to use a single library to satisfy all of their PDF content extraction needs. And, in keeping with PDFTextStream's reputation of providing developer-friendly Pure Java APIs, PDFTextStream's form extraction API requires no knowledge of the internal workings of PDF documents or PDF forms, and is therefore very easy to integrate into existing development environments.

PDFTextStream v1.4 also introduces an expanded document model. Now developers can access block-, line-, and character-level entities for each PDF document. This makes possible the creation of user interfaces that need to display PDF content, and can significantly aid many types of content analyses for search and data mining purposes.

Finally, PDFTextStream v1.4 includes some performance enhancements. Foremost among these is a new in-memory operation mode; this eliminates PDFTextStream's requirement that PDF documents be on disk before they can be processed. Overall, the new in-memory operation mode can boost performance by an order of magnitude for applications that already have PDF content in memory.

Product Overview

PDFTextStream is demonstrably the fastest Java library for converting PDF documents to text http://www.snowtide.com/home/PDFTextStream/Performance, and especially well-suited for high-volume enterprise environments. It supports all versions of the PDF document specification, including decryption of 40- and 128-bit encrypted PDF files. Finally, PDFTextStream also includes a module that provides drop-in integration with the popular Apache/Jakarta Lucene full-text indexing and search component.

Availability and Pricing

Version 1.4 of PDFTextStream is available immediately. PDFTextStream is a pure Java library, and is therefore compatible with any computing platform for which a v1.3 or higher Java Runtime Environment is available (including Linux, many Unix variants, Windows 2000/XP, and Mac OS X). Licenses start at $1900 for a 1-CPU development/deployment license; OEM redistribution licenses are available. Evaluation downloads are available at Snowtide.com.

Press contact:

Chas Emerick

303 Sargeant Street

Holyoke, MA 01040

413.519.6365

Evaluation copies of PDFTextStream are available for qualified members of the press.

PDFTextStream is a trademark of Snowtide Informatics Systems, Inc. Other trademarks are property of their respective owners.

# # #

Share article on social media or email:

View article via:

Pdf Print

Contact Author

Charles Emerick
Visit website