Channel: Adobe Community: Message List - Acrobat SDK

↧

Ligature text expansion issue

April 25, 2014, 4:48 am

≫ Next: Re: Ligature text expansion issue

≪ Previous: Re: how to extract Text from a pdf file

Hi,

I am successfully extracting text from pdf, by the PDWordFinder but there are some issue with ligature text.

Can any one help let me know if possible, How to stop ligature expanision.

There is a word "office" in my pdf file. and it is getting expanded as "offi ce".

Here is my code

PDWordFinderConfigRec wfConfig; /* WordFinder configuration record */

memset(&wfConfig, 0, sizeof(PDWordFinderConfigRec));

wfConfig.noXYSort = true;

wfConfig.noLigatureExp = false;

wordFinder = PDDocCreateWordFinderEx (pdDoc, WF_LATEST_VERSION, toUnicode, &wfConfig);

pageNum = AVPageViewGetPageNum (pageView);

         PDWordFinderAcquireWordList (wordFinder, pageNum, &wInfo, NULL, NULL, &count);

for(i=0; i<count; i++)

{

memset (str, '\0', MAX_PATH);

word = PDWordFinderGetNthWord (wordFinder, i);

PDWordGetString (word, str, PDWordGetLength(word));

attrib = PDWordGetAttrEx (word, 0);

if((attrib & WXE_ADJACENT_TO_SPACE) && !(attrib & WXE_LAST_WORD_ON_LINE) && !(attrib & WXE_HAS_LIGATURE))

strcat (str, " ");

fprintf (pFileTexts, "%s", str);

}

Actually for all words the value (attrib & WXE_HAS_LIGATURE) is never getting true.
so not able to detect ligatured texts.

↧

Trending Articles

NCERT Solutions for Class 9th Sanskrit Chapter 3 पाथेयम्

December 22, 2016, 3:50 am

Man charged with July slaying of Jovan Hopkins in Back of the Yards

August 23, 2015, 12:33 pm

Man dies and another in serious condition after A614 crash between Driffield...

August 16, 2012, 2:58 am

Trio remanded on gun, other serious charges

March 13, 2020, 10:07 pm

Who's been in court? A round up of cases heard by Essex magistrates

September 27, 2014, 10:00 pm

The Angry Birds Movie (Tamil Dubbed)

May 29, 2016, 1:08 am

Casualty cut free following three-car collision in Newtown Unthank

September 18, 2014, 11:19 am

Novel : I Love You, Stupid! 2

August 14, 2012, 7:38 pm

Moondru Mudichu 05-04-2017 – Polimer tv Serial

April 5, 2017, 9:07 am

La Liga Font 2017/2018 (Free TTF Version)

November 19, 2017, 6:49 pm

Download: FK ft Shenky – Nakuyewa ”Prod by: Shenky”

February 16, 2017, 4:24 pm

Fushigi no Dungeon – Furai no Shiren 3: Karakuri Yashiki no Nemuri Hime (JPN)

October 24, 2017, 1:48 am

Playboi Carti – MUSIC – SORRY 4 DA WAIT [iTunes Plus M4A + M4V]

March 25, 2025, 5:45 pm

Transformation of Sentence for HSC Students

October 1, 2019, 10:30 pm

Sarah Samis, Emil Bove III

November 17, 2012, 9:36 pm

Throw Back: Samini — Where My Baby Dey (Prod by Kaywa)

May 14, 2015, 11:18 am

የኤሌክትሪክ ሥራዎች ተቋራጭ ሰርተፊኬት ለማግኘት የሚያስፈልጉ ቅድመ ሁኔታዎች

March 12, 2020, 8:24 am

Toughie 3495

June 12, 2025, 6:00 am

A/L Technology Stream – Subject combinations, Syllabuses and Teacher guides

December 17, 2013, 6:12 pm

Inception 2010 Hindi Dual Audio 650MB BRRip 720p ESubs HEVC

December 27, 2016, 4:23 pm

© 2025 //www.rssing.com