在MongoDb中导入CSV文件(包含一些非UTF8字符) [英] Import CSV file (contains some non-UTF8 characters) in MongoDb
问题描述
如何将包含一些非UTF8字符的CSV文件导入MongoDB?
我尝试了一个推荐的导入代码。
How can I import a CSV file that contains some non-UTF8 characters to MongoDB? I tried a recommended importing code.
mongoimport --db dbname --collection colname --type csv --headerline --file D:/fastfood.xls
错误讯息
exception: Invalid UTF8 character detected
无效字符,但数据大小相当大。
I would remove those invalid characters manually, but the size of the data is considerably big.
尝试Google无法成功。
Tried Google with no success.
PS:mongo -v = 2.4.6
PS: mongo -v = 2.4.6
感谢。
BTW,我在Win7
BTW, I'm on Win7
推荐答案
在Linux中你可以使用 iconv
命令:如何从文本文件中删除非UTF-8字符
In Linux you could use the iconv
command as suggested in: How to remove non UTF-8 characters from text file
iconv -f utf8 -t utf8 -c file.txt
我不熟悉MongoDB,所以我不知道如何在导入期间保留无效字符。
I'm not familiar with MongoDB, so I have no insight on how to preserve the invalid characters during import.
这篇关于在MongoDb中导入CSV文件(包含一些非UTF8字符)的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!