撇号变成 \x92 [英] apostrophe turning into \x92

查看：47 发布时间：2021/6/26 19:28:05 python python-2.7 apostrophe

本文介绍了撇号变成 \x92的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

mycorpus.txt

Human where's machine interface for lab abc computer applications   
A where's survey of user opinion of computer system response time

stopwords.txt

let's
ain't
there's

以下代码

corpus = set()
for line in open("path\\to\\mycorpus.txt"):
    corpus.update(set(line.lower().split()))
print corpus

stoplist = set()
for line in open("C:\\Users\\Pankaj\\Desktop\\BTP\\stopwords_new.txt"):
    stoplist.add(line.lower().strip())
print stoplist

给出以下输出

set(['a', "where's", 'abc', 'for', 'of', 'system', 'lab', 'machine', 'applications', 'computer', 'survey', 'user', 'human', 'time', 'interface', 'opinion', 'response'])
set(['let\x92s', 'ain\x92t', 'there\x92s'])

为什么在第二组中撇号变成了\x92?

Why is the apostrophe turning into \x92 in the 2nd set??

撇号变成 \x92 [英] apostrophe turning into \x92

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

撇号变成 \x92 [英] apostrophe turning into \x92

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭