[reportlab-users] Reportlab text not searchable in Apple OSX Preview.App? but searchable in Acrobat and google-pdf viewer ?
Robin Becker
robin at reportlab.com
Wed Aug 5 13:03:36 EDT 2009
Tim Roberts wrote:
> Robin Becker wrote:
>> ........
>> It seems that preview is actually looking at the rendered image to
>> find the characters.
>
> I agree that this matches the symptoms, but as a programmer, how would
> you do that? The rendered image is just a matrix of pixels. How would
> you search for words, or even letters, for that matter?
>
> When I encounter a bug, I always like to put myself in the mind of the
> programmer to figure out what thinking would have led to the bug. I'm
> having a hard time coming up with an implementation that would trigger
> this. Maybe they are converting the PDF to some kind of intermediate
> language (like a Windows EMF), where strings that aren't horizontal get
> converted into a series of smaller strings that ARE horizontal, and they
> are searching that intermediate format. I'd call that "overthinking the
> problem".
>
I saw some related Preview search problems on one of the tex lists when I was
googling. They seemed to be recommending the use of standard fonts and the like
to improve searchablity; that would come down to some kind of OCR like weakness.
Mac people: are gifs searchable?
--
Robin Becker
More information about the reportlab-users
mailing list