读取google ngram csv文件 [英] read google ngram csv files
本文介绍了读取google ngram csv文件的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
我正在尝试读取google ngram csv文件.
但是我发现我读出的内容与google网站上描述的内容不同.
网站: http://books.google.com/ngrams/datasets [
救救我!
I''m trying to read google ngram csv files.
But I found that the contents which I read out are different from the decribed on google website.
The website:http://books.google.com/ngrams/datasets[^]
The contents I read out like:
# 1574 1 1 1
# 1584 6 6 1
# 1614 1 1 1
# 1631 115 100 1
The description of google website like:
circumvallate 1978 313 215 85
circumvallate 1979 183 147 77
well, why the ''#'' instead of the words?
help me!!!
thanks!!
推荐答案
描述仅是其中一种文件类型的内容的示例,它不是对文件类型的绝对描述每个文件.每行的第一项是一个ngram,由n个令牌组成,如您在上面发布的链接中清楚描述的那样.
The description is merely an example of the contents of one of the file types, it is not an absolute description of every file. The first item in each line is an ngram, which is composed of n tokens, as clearly described in the link you posted above.
这篇关于读取google ngram csv文件的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!
查看全文