Not seeing Unicode support in file differences
Posted: Fri Mar 10, 2017 7:21 pm
Greetings,
I'm working on a project that involves C++ code that is mostly in English, but contains some Korean characters in comments. I just purchased ExamDiff Pro today because of the advertised Unicode support.
However, when I diff two of these files, I see all the Korean characters turn to gibberish.
Strangely, if I view the same files in Notepad++ the characters are displayed as actual Korean characters. If it helps, Notepad++ identifies the file encoding as EUC-KR.
I've tried manually forcing the encoding to any and all of the Unicode options in the open file dialog. Some turn the entire file into what look like Chinese characters, but with UTF-8 (which I gather is the correct choice) the file is correctly displayed with English characters while the Korean characters are gibberish.
Neither of the two files has byte-order marks which I suppose means EDP has to make some assumptions. Does it also make assumptions about the code page to use?
It's a little disconcerting that the only font options I can select are mono-spaced fonts that have a "script" option pull-down. The pull down only supports some alphabets (Hebrew, Greek, Cyrillic, etc.) How these fonts (specifically the default Courier New in Western script) end up displaying what looks like Chinese characters with the forced Unicode encoding is beyond me.
I understand there is more to displaying foreign characters than just using Unicode (the Notepad++ site has a whole page dedicated to explaining character encoding) but I'm wondering if I'm missing something or if ExamDiff Pro simply doesn't support Unicode characters in the file comparison view.
I'm not here to push Notepad++, I'm just using it for comparison, since everything else (document file, OS, fonts, etc.) is the same except the software I'm using to display these documents. Notepad++ shows me the Korean characters, EDP does not.
I'm working on a project that involves C++ code that is mostly in English, but contains some Korean characters in comments. I just purchased ExamDiff Pro today because of the advertised Unicode support.
However, when I diff two of these files, I see all the Korean characters turn to gibberish.
Strangely, if I view the same files in Notepad++ the characters are displayed as actual Korean characters. If it helps, Notepad++ identifies the file encoding as EUC-KR.
I've tried manually forcing the encoding to any and all of the Unicode options in the open file dialog. Some turn the entire file into what look like Chinese characters, but with UTF-8 (which I gather is the correct choice) the file is correctly displayed with English characters while the Korean characters are gibberish.
Neither of the two files has byte-order marks which I suppose means EDP has to make some assumptions. Does it also make assumptions about the code page to use?
It's a little disconcerting that the only font options I can select are mono-spaced fonts that have a "script" option pull-down. The pull down only supports some alphabets (Hebrew, Greek, Cyrillic, etc.) How these fonts (specifically the default Courier New in Western script) end up displaying what looks like Chinese characters with the forced Unicode encoding is beyond me.
I understand there is more to displaying foreign characters than just using Unicode (the Notepad++ site has a whole page dedicated to explaining character encoding) but I'm wondering if I'm missing something or if ExamDiff Pro simply doesn't support Unicode characters in the file comparison view.
I'm not here to push Notepad++, I'm just using it for comparison, since everything else (document file, OS, fonts, etc.) is the same except the software I'm using to display these documents. Notepad++ shows me the Korean characters, EDP does not.