MLang & MSXML6 doesn't like UTF-7
In some cases MLang (on which MSXML6 depends) can added extra ? to decoded UTF-7 data, which can cause UTF-7 encoded XML to fail to parse.
UTF-7 isn't a great encoding anyway, so this is just another reason to Please Avoid UTF-7.
In particular there doesn't seem to me to be much reason to use encodings other than UTF-8 or UTF-16 with XML data. XML is new enough that Unicode support exists for whatever the XML is being used for.
Comments
Anonymous
May 11, 2007
Not only that XML is new enough that to support Unicode, but UTF-8 and UTf-16 are mandatory: "All XML processors MUST accept the UTF-8 and UTF-16 encodings of Unicode 3.1" http://www.w3.org/TR/2006/REC-xml-20060816/#charsets In the second I see an "XML parser" project explaining that the there is no Unicode support "yet," I am out of there. Not Unicode == not compliant!Anonymous
May 11, 2007
True, but... ahem... I know of at least one internal XML source file we have with a non-Unicode code page. (I just converted it so we're OK now :)