将文本拆分为句子 [英] Split text into sentences

查看：317 发布时间：2020/7/11 0:26:35 python regex python-3.x text-segmentation

本文介绍了将文本拆分为句子的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我希望将文本分成句子.谁能帮我吗?

I wish to split text into sentences. Can anyone help me?

我还需要处理缩写.但是，我的计划是在早期阶段替换它们.先生->先生

I also need to handle abbreviations. However my plan is to replace these at an earlier stage. Mr. -> Mister

import re  
import unittest    

class Sentences:

    def __init__(self,text):
        self.sentences = tuple(re.split("[.!?]\s", text))

class TestSentences(unittest.TestCase):

    def testFullStop(self):
        self.assertEquals(Sentences("X. X.").sentences, ("X.","X."))

    def testQuestion(self):
        self.assertEquals(Sentences("X? X?").sentences, ("X?","X?"))

    def testExclaimation(self):
        self.assertEquals(Sentences("X! X!").sentences, ("X!","X!"))

    def testMixed(self):
        self.assertEquals(Sentences("X! X? X! X.").sentences, ("X!", "X?", "X!", "X."))

谢谢，巴里

首先，我将很高兴满足上面包含的四个测试.这将有助于我更好地了解正则表达式的工作方式.现在，我可以按照测试中的定义将句子定义为X等.

To start with, I would be happy to satisfy the four tests I've included above. This would help me understand better how regexs work. For now I can define a sentence as X. etc as defined in my tests.

将文本拆分为句子 [英] Split text into sentences

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录关闭

将文本拆分为句子 [英] Split text into sentences

问题描述

推荐答案

相关文章

Python最新文章

热门教程

热门工具

登录 关闭

登录关闭