Bug: Silently changing UTF-8 characters
Posted: Sun Nov 10, 2019 1:06 pm
Version: EDP 10.0.1.17 64-bit
Use plug-ins: off
Use document type-settings: on
Scenario:
Comparing 2 directories.
Comparing one of the changed .XML files by doubleclicking on it.
The has some non-encoded UTF-8 emojis (specifically: F0-9F-91-91) -- those are shown correctly in Notepad++, Windows 10 Notepad and other editors.
Saving the file from EDP silently changes the character from F0-9F-91-91 to ED-A0-BD-ED, which breaks the application that uses that file since it considers it to be a different string.
I found no setting related to UTF-8 that will allow me to save the file without the silent changes.
Result:
It is impossible to reconcile the files with EDP.
Expected behaviour:
EDP should not change any parts of the file behind the user's back.
Note:
The file is not a "real" XML file, it is just a settings file used by the application which has an XML structure. It has no BOM and can include non-encoded UTF-8 characters. I cannot make arbitrary changes to the file (like adding a BOM) since it will likewise break the application.
Note2: Treating the files as binary is not feasible.
Please fix ASAP!
Use plug-ins: off
Use document type-settings: on
Scenario:
Comparing 2 directories.
Comparing one of the changed .XML files by doubleclicking on it.
The has some non-encoded UTF-8 emojis (specifically: F0-9F-91-91) -- those are shown correctly in Notepad++, Windows 10 Notepad and other editors.
Saving the file from EDP silently changes the character from F0-9F-91-91 to ED-A0-BD-ED, which breaks the application that uses that file since it considers it to be a different string.
I found no setting related to UTF-8 that will allow me to save the file without the silent changes.
Result:
It is impossible to reconcile the files with EDP.
Expected behaviour:
EDP should not change any parts of the file behind the user's back.
Note:
The file is not a "real" XML file, it is just a settings file used by the application which has an XML structure. It has no BOM and can include non-encoded UTF-8 characters. I cannot make arbitrary changes to the file (like adding a BOM) since it will likewise break the application.
Note2: Treating the files as binary is not feasible.
Please fix ASAP!