Preview rendered the text in the new file such that Adobe Acrobat copies the highlighted text with the expected white space between words. I opened the pdf file in question with Mac Preview rather than Adobe Acrobat and resaved the file under a new name. I did find a fix that works for me in this particular case. A big nuisance since I use this feature for generating annodated summaries. Copying and pasting the entire text within the Comment annotation box into BBedit (using the show invisibiles feature) on my Mac showed that in fact there were absolutely no white space characters in between the words. This occurs only certain pdf files from particular journals. The "copy text to a highlighted Comment" feature in Adobe Acrobat XI (Mac) rendered the text with no spaces. I am not a Framemaker user but I discovered this problem with some scientific journal articles downloaded in Adobe pdf format. I also have no idea if using any accessibility (Section 508) practices might provide the PDF reader/viewer with more clues for proper plaintext extraction (particularly keeping flows in proper order). turn off Kerning for the paragraph and/or character tag used,.tag those sections with a fixed-pitch font ( Courier is apt to be the most reliable),.If you are generating PDFs that must support copy&paste, one workaround might be to: The rendering engine (Ps or PDF driver) may have chosen to break " word1 word2" into two strings with two starting coordinates and no U+0020 space character (or alternative space characters) at all. The problem is that the PDF may or may not have the apparent spaces encoded as space characters, particularly at line ends, but also between words and perhaps even between characters. I am not able to understand what the problem is?
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |