强制 XDocument 使用 UTF-8 编码写入字符串 [英] Force XDocument to write to String with UTF-8 encoding
问题描述
我希望能够使用声明和 UTF-8 编码将 XML 写入字符串.这似乎很难完成.
I want to be able to write XML to a String with the declaration and with UTF-8 encoding. This seems mighty tricky to accomplish.
我已经阅读了一些并为此尝试了一些流行的答案,但它们都有问题.我当前的代码正确输出为 UTF-8,但不保持 XDocument 的原始格式(即缩进/空格)!
I have read around a bit and tried some of the popular answers for this but the they all have issues. My current code correctly outputs as UTF-8 but does not maintain the original formatting of the XDocument (i.e. indents / whitespace)!
有人可以提供一些建议吗?
Can anyone offer some advice please?
XDocument xml = new XDocument(new XDeclaration("1.0", "utf-8", "yes"), xelementXML);
MemoryStream ms = new MemoryStream();
using (XmlWriter xw = new XmlTextWriter(ms, Encoding.UTF8))
{
xml.Save(xw);
xw.Flush();
StreamReader sr = new StreamReader(ms);
ms.Seek(0, SeekOrigin.Begin);
String xmlString = sr.ReadToEnd();
}
XML 要求格式与 .ToString()
的格式相同,即
The XML requires the formatting to be identical to the way .ToString()
would format it i.e.
<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<root>
<node>blah</node>
</root>
我现在看到的是
<?xml version="1.0" encoding="utf-8" standalone="yes"?><root><node>blah</node></root>
更新我已经通过添加 XmlTextWriter
设置设法让它工作......虽然它看起来很笨重!
Update
I have managed to get this to work by adding XmlTextWriter
settings... It seems VERY clunky though!
MemoryStream ms = new MemoryStream();
XmlWriterSettings settings = new XmlWriterSettings();
settings.Encoding = Encoding.UTF8;
settings.ConformanceLevel = ConformanceLevel.Document;
settings.Indent = true;
using (XmlWriter xw = XmlTextWriter.Create(ms, settings))
{
xml.Save(xw);
xw.Flush();
StreamReader sr = new StreamReader(ms);
ms.Seek(0, SeekOrigin.Begin);
String blah = sr.ReadToEnd();
}
推荐答案
试试这个:
using System;
using System.IO;
using System.Text;
using System.Xml.Linq;
class Test
{
static void Main()
{
XDocument doc = XDocument.Load("test.xml",
LoadOptions.PreserveWhitespace);
doc.Declaration = new XDeclaration("1.0", "utf-8", null);
StringWriter writer = new Utf8StringWriter();
doc.Save(writer, SaveOptions.None);
Console.WriteLine(writer);
}
private class Utf8StringWriter : StringWriter
{
public override Encoding Encoding { get { return Encoding.UTF8; } }
}
}
当然,您还没有向我们展示您如何构建文档,这使得测试变得困难...我刚刚尝试使用手工构建的 XDocument
,其中包含相关的空格也是如此.
Of course, you haven't shown us how you're building the document, which makes it hard to test... I've just tried with a hand-constructed XDocument
and that contains the relevant whitespace too.
这篇关于强制 XDocument 使用 UTF-8 编码写入字符串的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!