PDF to Text Not Rendering Thai Properly
Posted: Mon Oct 15, 2018 12:47 pm
Hello,
I compare several hundred pdf documents in a variety of language. Most of them are in English, but about a third of them are in non-English characters. I have no problem when comparing Japanese, Chinese, or Arabic, but when I compar Thai a large number of boxes show up in the comparison window. I'm certain my issue lies within the PDF to Text plug-in because I had to configure that a few years ago to get useful comparisons for the languages above, but I'm unsure of whether it's an issue with ExamDiff Pro, Xpdf's pdf to text, or something else. I've visually ruled out anything with the PDFs themselves.
I compare several hundred pdf documents in a variety of language. Most of them are in English, but about a third of them are in non-English characters. I have no problem when comparing Japanese, Chinese, or Arabic, but when I compar Thai a large number of boxes show up in the comparison window. I'm certain my issue lies within the PDF to Text plug-in because I had to configure that a few years ago to get useful comparisons for the languages above, but I'm unsure of whether it's an issue with ExamDiff Pro, Xpdf's pdf to text, or something else. I've visually ruled out anything with the PDFs themselves.