Read text from pdf

Skip Navigation LinksHome  /  Support  /  Forums  /  DynamicPDF CoreSuite for .NET (v6)  /  Re: Read text from pdf

DynamicPDF CoreSuite for .NET (v6) Forum

 Oct 28 2011 12:18 PM
I found this thread:

http://www.DynamicPDF.com/Forums/DisplayThread.csp?ForumID=9&ThreadID=526#1571

dated back to 2006 asking if one could read strings from a pdf. The support rep said it was on the wishlist. Is it possible to read the raw text of a pdf with any CeTe product (especially DynamicPdf Merger)? I'm not interested in changing the text; I just want to know if I could search a pdf for a given string and know:

1) Does the string exist?               <-- most important
2) What page is the string on?   

 Oct 28 2011 12:37 PM
Posted by a ceTe Software moderator
Hello,

Currently it is not possible to extract text contents from already existing PDF document using our version 6 DynamicPDF Merger for .NET product.

Our developers are working on a feature using which you can extract text content from a page of PDF document but it is still under development stage. This feature will added to our future version of DynamicPDF Merger for .NET product but we do not have any exact time line on this.

Thanks,
ceTe Software Support Team.
 Feb 10 2014 9:05 AM
Is Reading text into strings from a PDF still a wish feature?
 Feb 10 2014 10:46 AM
Posted by a ceTe Software moderator
Hello,

Yes, feature for reading text of an existing PDF document is implemented in version 7 DynamicPDF Merger for .NET product. You can get the document and page level text using GetText method of PdfDocument class. Please refer to the documentation on text extraction here.

Feel free to download evaluation edition of DynamicPDF Merger for .NET product from our website here.

Thanks,
ceTe Software Support Team.

All times are US Eastern Standard time. The time now is 8:35 AM.