使用 BOM 读取 UTF-8 文本文件 [英] Read a UTF-8 text file with BOM

查看：37 发布时间：2021/12/26 13:37:50 r unicode utf-8 character-encoding byte-order-mark

本文介绍了使用 BOM 读取 UTF-8 文本文件的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我有一个以字节顺序标记 (U+FEFF) 开头的文本文件.我正在尝试在 R 中读取文件.是否可以避免字节顺序标记?

I have a text file with Byte order mark (U+FEFF) at the beginning. I am trying to read the file in R. Is it possible to avoid the Byte order mark?

函数fread(来自data.table 包)读取文件，但在第一个开头添加ļ»æ变量名:

The function fread (from the data.table package) reads the file, but adds ļ»æ at the beginning of the first variable name:

> names(frame_pers)[1]
[1] "ļ»æreg_date"

read.csv 函数也是如此.

目前我已经做了一个从第一列名称中删除 BOM 的函数，但我相信应该有一种方法可以自动去除 BOM.

Currently I have made a function which removes the BOM from the first column name, but I believe there should be a way how to automatically strip the BOM.

remove.BOM <- function(x) setnames(x, 1, substring(names(x)[1], 4))

> names(frame_pers)[1]
[1] "ļ»æreg_date"
> remove.BOM(frame_pers)
> names(frame_pers)[1]
[1] "reg_date"

我正在为 R 会话使用本机编码:

I am using the native encoding for the R session:

> options("encoding" = "")
> options("encoding")
$encoding
[1] ""

使用 BOM 读取 UTF-8 文本文件 [英] Read a UTF-8 text file with BOM

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录关闭

使用 BOM 读取 UTF-8 文本文件 [英] Read a UTF-8 text file with BOM

问题描述

推荐答案

相关文章

其他开发最新文章

热门教程

热门工具

登录 关闭

登录关闭