Quantcast
Channel: Adobe Community: Message List - Acrobat SDK
Viewing all articles
Browse latest Browse all 10848

Re: PDETextItem for utf-8 characters

$
0
0

Text is not stored in PDF files in UTF-8 or any other Unicode. PDETextItemCopyText copies text without recoding it. There is a HUGE gap between the internal text format and having it in Unicode. You need to understand text encoding issues from the PDF specification, or for text extraction (without editing) use a different API like a UCS WordFinder.


Viewing all articles
Browse latest Browse all 10848

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>