Quantcast
Channel: Adobe Community: Message List - Acrobat SDK
Viewing all articles
Browse latest Browse all 10848

Using Acrobat SDK to read hyphenated text

$
0
0

I'm using the GetText method, retrieving text, word by word, from a PDF. I'm running into two problems, both relate to GetText treating hyphens as punctuation, not text.

 

If the source text in my document contains a date in the form 30-JUN-2013, GetText returns 30JUN2013

 

If the source text contains a negative number, for example, -90.20, GetText returns just 90.20. Similarly, source text of -$90.20 is returned as two text items, first $, then 90.20

 

I'm using VBA within an Access db to read PDFs and populate data within database tables.

 

Does anyone know how to either set an option to have the SDK treat hyphens as part of the word or an alternative to the GetText routine to accomplsih something analogous?


Viewing all articles
Browse latest Browse all 10848

Trending Articles