如何将unicode文本转换为可读的utf8文本? [英] how to convert unicode text to utf8 text readable?

查看:126
本文介绍了如何将unicode文本转换为可读的utf8文本?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

关于Unicode和utf8,我遇到了一个严重问题,
我将一段阿拉伯语/波斯文字文件保存到记事本中并保存下来,现在我看到的信息是

  $$$ p> 

我的问题是如何取回我的数据,这对我来说很重要,请提前感谢

 字符串加扰= 
ÈåØæáåãæäÚÏÏãËáËíÑæÑÓããíãä;
byte [] bytes = Encoding.GetEncoding( windows-1252)。GetBytes(scrambled);
string plainText = Encoding.GetEncoding( windows-1256)。GetString(bytes);
Console.WriteLine(text);

纯文本输出为:
تواينسورسبرنامهعدددلخواهيروازوروديميگيره وبهطولهمونعددمثلثيرورسمميکنه


I got a serious problem regarding Unicode and utf8, I saved a paragraph of Arabic/Persian text file into notepad and saved it, now I saw my information like

Êæ Çíä ÓæÑÓ ÈÑäÇãå ÚÏÏ ÏáÎæÇåí Ñæ ÇÒ æÑæÏí ãííÑå æ Èå Øæá åãæä ÚÏÏ ãËáËí Ñæ ÑÓã ãí ˜äå 

my question is how to get back my data, it is important for me to get this data back, thanks in advance

解决方案

The paragraph was scrambled by saving as code page 1256 (Arabic/Persian), then interpreted as code page 1252 (Western Europe), and finally saved as Unicode text. You can use C# to reverse this procedure:

string scrambled = "Êæ Çíä ÓæÑÓ ÈÑäÇãå ÚÏÏ ÏáÎæÇåí Ñæ ÇÒ æÑæÏí ãííÑå æ " + 
                   "Èå Øæá åãæä ÚÏÏ ãËáËí Ñæ ÑÓã ãí ˜äå";
byte[] bytes = Encoding.GetEncoding("windows-1252").GetBytes(scrambled);
string plainText = Encoding.GetEncoding("windows-1256").GetString(bytes);
Console.WriteLine(text);

The plain text output is: "تو اين سورس برنامه عدد دلخواهي رو از ورودي ميگيره و به طول همون عدد مثلثي رو رسم مي کنه"

这篇关于如何将unicode文本转换为可读的utf8文本?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆