阿拉伯文字的Python ISRIStemmer [英] Python ISRIStemmer for Arabic text

查看：135 发布时间：2020/7/13 3:10:00 python utf-8 arabic stemming

本文介绍了阿拉伯文字的Python ISRIStemmer的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

我正在IDLE(Python)上运行以下代码，我想输入阿拉伯字符串并获取其词根，但实际上不起作用

I am running the following code on IDLE(Python) and I want to enter Arabic string and get the stemming for it but actually it doesn't work

">>>从nltk.stem.isri导入ISRIStemmer

">>> from nltk.stem.isri import ISRIStemmer

">>> st = ISRIStemmer()

">>> w ='حركات'

">>> w= 'حركات'

">>> join = w.decode('Windows-1256')

">>>打印st.stem(join).encode('Windows-1256').decode('utf-8')

">>> print st.stem(join).encode('Windows-1256').decode('utf-8')

运行它的结果是w中的相同文本，即'حركات'而不是词干

The result of running it is the same text in w which is 'حركات' which is not the stem

但是何时执行以下操作:

but when do the following:

">>>打印st.stem(u'اعلاميون')

">>> print st.stem(u'اعلاميون')

结果成功并返回'علم'

the result succeeded and returns the stem which is 'علم'

为什么将变量传递给stem()函数不会返回主干.

why passing variable to stem() function doesn't return the stem.