怎么给句子加标点符号? [英] How to add punctuation marks for the sentences?

查看:136
本文介绍了怎么给句子加标点符号?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

如何解决构建标点预测器的问题?

可以在此链接中找到该问题的工作演示.

The working demo for the question can be found in this link.

输入文本如下:

"its   been   a   little   while   Kirk   tells   me its   actually   been
three   weeks   now   that Ive   been   using   this   device   right   here
that   is   of   course   the   Galaxy   S   ten   I mean   Ive   just   been
living   with   this phone   this   has   been   my   phone   has   the   SIM
card   in   it   I   took   photos I   lived   live   I   sent   tweets whatsapp
slack   email   whatever   other   app   this   was my   smart phone"

推荐答案

预测文本的标点符号(尤其是语音转录)是一个众所周知的问题.

Predicting punctuation for text (in particular for speech transcriptions) is a well-known problem.

您可以尝试将 Punctuator2 与提供的模型一起使用,或者通过训练来自文本的新模型你的域.查看 README 底部的一些相关项目的指针.

You could try using Punctuator2, either with the provided models or by training new models for text from your domain. Look at the bottom of the README for pointers to some related projects.

Grammarly 开发了一种更简单的方法,仅在连续句子之间插入句点,描述如下:

Grammarly developed a simpler approach for only inserting periods between run-on sentences, described here:

https://www.grammarly.com/blog/nlp-连续句/

他们对真实与人工训练数据做了一些很好的实验,这很有用,因为可以很容易地从你知道在句子边界有可靠标点符号的文本中生成训练数据,比如报纸文本.

They did some nice experiments with real vs. artificial training data, which is useful because it's easy to generate training data from texts that you know have reliable punctuation at sentence boundaries, like newspaper text.

这篇关于怎么给句子加标点符号?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆