Read text from pdf

DynamicPDF CoreSuite for .NET (v6) Forum

Advanced Search

I found this thread:

http://www.DynamicPDF.com/Forums/DisplayThread.csp?ForumID=9&ThreadID=526#1571

dated back to 2006 asking if one could read strings from a pdf. The support rep said it was on the wishlist. Is it possible to read the raw text of a pdf with any CeTe product (especially DynamicPdf Merger)? I'm not interested in changing the text; I just want to know if I could search a pdf for a given string and know:

1) Does the string exist? <-- most important
2) What page is the string on?

Posted by a ceTe Software moderator

Hello,

Currently it is not possible to extract text contents from already existing PDF document using our version 6 DynamicPDF Merger for .NET product.

Our developers are working on a feature using which you can extract text content from a page of PDF document but it is still under development stage. This feature will added to our future version of DynamicPDF Merger for .NET product but we do not have any exact time line on this.

Thanks,
ceTe Software Support Team.

Is Reading text into strings from a PDF still a wish feature?

Posted by a ceTe Software moderator

Hello,

Yes, feature for reading text of an existing PDF document is implemented in version 7 DynamicPDF Merger for .NET product. You can get the document and page level text using GetText method of PdfDocument class. Please refer to the documentation on text extraction here.

Feel free to download evaluation edition of DynamicPDF Merger for .NET product from our website here.

Thanks,
ceTe Software Support Team.

Post Reply EMail Alerts

All times are US Eastern Standard time. The time now is 8:35 AM.