如何完成对UTF8文件的随机读取 [英] How do I accomplish random reads of a UTF8 file

查看：118 发布时间：2020/7/13 5:18:30 c# unicode utf-8 utf-16 utf8-decode

本文介绍了如何完成对UTF8文件的随机读取的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我的理解是，由于偶然的替代字节(例如，在东方语言中使用)，对UTF8或UTF16编码文件的读取不一定是随机的.

My understanding is that reads to a UTF8 or UTF16 Encoded file can't necessarily be random because of the occasional surrogate byte (used in Eastern languages for example).

如何使用.NET跳到文件中的大致位置，并从半随机位置读取unicode文本?

How can I use .NET to skip to an approximate position within the file, and read the unicode text from a semi-random position?

我是否丢弃代理字节并等待分词继续读取?如果是这样，有效字词是什么中断我应该等到开始解码吗?

Do I discard surrogate bytes and wait for a word break to continue reading? If so, what are the valid word breaks I should wait for until I start the decoding?

如何完成对UTF8文件的随机读取 [英] How do I accomplish random reads of a UTF8 file

问题描述

推荐答案

相关文章

C#/.NET最新文章

热门教程

热门工具

登录关闭

如何完成对UTF8文件的随机读取 [英] How do I accomplish random reads of a UTF8 file

问题描述

推荐答案

相关文章

C#/.NET最新文章

热门教程

热门工具

登录 关闭

登录关闭