问题描述
我被要求解决文件中的编码问题。 预期使用UTF8,但实际上使用扩展ASCII。
巴西利亚; Eletrônicos eInformáticaCâmeras eAcessóriosmásica
实际上应该是:
巴西利亚电子信箱卡梅拉斯州Acessóriosmúsica
private static string FixEncodingIssues(string str)
{
string fixedStr = str;
foreach (keyvaluePair<string,string> pair in encodingErrosDic)
fixedStr = fixedStr.Replace(pair.Key,pair.Value);
return fixedStr;
}
private static Dictionary<string,string> encodingErrosDic = new Dictionary<string,string>()
{
{ "Ã","Ã" },{ "Ã\x81","Á" },{ "À","À" },{ "Â","Â" },{ "Ä","Ä" },{ "Ã…","Å" },{ "Ç","Ç" },{ "È","È" },{ "É","É" },{ "Ê","Ê" },{ "Ë","Ë" },{ "ÃŒ","Ì" },{ "Ã\x8D","Í" },{ "ÃŽ","Î" },{ "Ã\x8F","Ï" },{ "Ã\x90","Ð" },{ "Ñ","Ñ" },{ "Ã’","Ò"},{ "Ó","Ó" },{ "Ô","Ô" },{ "Õ","Õ" },{ "Ö","Ö" },{ "×","×" },{ "Ø","Ø" },{ "Ù","Ù" },{ "Ú","Ú" },{ "Û","Û" },{ "Ãœ","Ü" },{ "Ã\x9D","Ý" },{ "Ã\xA0","à" },{ "á","á" },{ "â","â" },{ "ã","ã" },{ "ä","ä" },{ "Ã¥","å" },{ "æ","æ" },{ "ç","ç" },{ "è","è" },{ "é","é" },{ "ê","ê"},{ "ë","ë" },{ "ì","ì" },{ "î","î" },{ "ï","ï" },{ "Ã\xAD","í" },{ "ð","ð" },{ "ñ","ñ" },{ "ò","ò" },{ "ó","ó" },{ "ô","ô" },{ "õ","õ" },{ "ö","ö" },{ "ø","ø" },{ "ù","ù" },{ "ú","ú" },{ "û","û" },{ "ü","ü" },{ "ý","ý" }
};
我想知道是否有更好的方法来解决此问题。我觉得我的解决方案太粗糙了,不适用于字典中未列出的字节。我想知道是否有一个更干净的解决方案,不涉及列出所有扩展案例,并用等效的UTF8值替换它们。
解决方法
暂无找到可以解决该程序问题的有效方法,小编努力寻找整理中!
如果你已经找到好的解决方法,欢迎将解决方案带上本链接一起发送给小编。
小编邮箱:dio#foxmail.com (将#修改为@)