将英语单词分解为对应于不同声音的字素 [英] Chunking English words into graphemes corresponding to distinct sounds

查看:85
本文介绍了将英语单词分解为对应于不同声音的字素的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

如何将英语输入单词转换为字素组合?是否有可以完成此工作的库或函数?

How to convert english input word into combinations of graphemes? Is there a library or function that does the job?

我正在寻找的是一种算法/实现,它可以将正交字分成映射到音素的段.也就是说,一个单词中的字母顺序应该在不同的声音之间打断.

What I'm looking for is an algorithm/implementation that splits orthographic words into segments which map to phonemes. That is, the sequence of letters in a word should be broken in between distinct sounds.

在我看来,这类似于以下内容:

To my mind, this would look something like the following:

physically --> ph-y-s-i-c-a-ll-y
psychology --> ps-y-ch-o-l-o-g-y
thrush -->     th-r-u-sh
bought --> b-ough-t
chew --> ch-ew
palm --> p-al-m

推荐答案

split english words into graphemes进行谷歌搜索,第一个结果似乎是一篇有关使用机器学习方法将英语拼字法映射到音素表示上的论文.

Googling for split english words into graphemes, the first result appears to be a paper about mapping English orthography onto a phonemic representation using a Machine Learning approach. This paper appears to be doing the kind of thing you're looking for.

这篇关于将英语单词分解为对应于不同声音的字素的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆