Refying PDF with subset embedded fonts fixes text extraction

Hi All,

I know it is not a good idea to (just) refry PDF files (PDF -> EPS -> PDF). Especially when the PDF contains subset embedded fonts. Chances are you will end up with a PDF file which does not contain valid (searchable) text.

I did not know the apposite could also be true. The following zip file contains 2 PDF files echo containing two words: the original and the refried version.

Refried.zip

When selecting text from the original PDF (using acrobat 6 through X) file it contains incorrect text, in this case invalid capitals. If I try the same in the refried version the extracted text is correct.

It seems strange to me that a process which only can result in loss of information "fixes" this text issue. Somewhere the correct text must be hidden in the original PDF file. Not only capitals seem to be effected but also random characters which seem to be fixed once refried.

Could anyone think of an explanation?

Is there a workaround without having to refry the PDF (refrying often results in loss of information). I have no influence on the PDF files I recieve, therefore I cannot embed the full fonts.

I am using de C++ SDK for Acrobat to write plugins.

Any pointers would be great!

Kind regards,

Robert

Refying PDF with subset embedded fonts fixes text extraction

Trending Articles

KMS & Digital & Online Activation Suite v5.7

13 Japanese teen boys caught peeping into girls’ hot spring bath during class...

Mp3 Download: Mdu - Auntie

Kanulanu Thaake Lyrics and translation | Manam (2014)

Karimnagar District Police Office Mobile Numbers List in Telangana State

GTA 5 PPSSPP Zip File Download For Android Mediafire 382 MB

Das MausPad • Req.Bin ein Star usw.

The Personal Assistant (JL Creation) (ENG+RUS) [L] [1.79GB]

Love Status in Punjabi, ਪੰਜਾਬੀ ਲਵ ਸਟੇਟਸ

Practice Sheet of Right form of verbs for HSC Students

NCERT Solutions for Class 9th Sanskrit Chapter 3 पाथेयम्

Moondru Mudichu 01-05-2017 – Polimer tv Serial

Trailer Park Boys Jail S01-S02 1080p NF WEB-DL H264-FLUX

13917

The 10 Tennessee Cities With The Largest Black Population For 2021

Shatta Wale – You Shock Me (Prod. by Willis Beatz)

[GET] Rob Lennon – AI Lead Magnets + Workshop ($199)

Alessia Cara – Know It All (Album) [2015] – FREE DOWNLOAD – ZIP

Allison Russell – The Returner (2023) [FLAC 24bit/48kHz]

Psycho For Love: Alfred Scott Keefe convicted of killing his fiancee, Terri...