如何从句子中分离单词 [英] how to separate words from a sentence

查看:108
本文介绍了如何从句子中分离单词的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

假设我有一个语音信号(可以说持续5秒).单词被隔离,即单词被短暂的停顿分开,但是单词总数在语音信号中是未知的.我该如何区分这些词?为了论证,让我们说信号中有5个单词,如何分离这些单词并将它们存储在5个不同的变量中?
在此先谢谢您.

suppose I have a speech signal (lets say, of 5 seconds duration). The words are isolated i.e. words are separated by brief pauses, but total number of words are unknown in the speech signal. How can I separate these words? For the argument sake, lets say there are 5 words in the signal, how can I separate these words as well as store them in 5 different variables?
Thanks in advance.

推荐答案

会员7702812,

我想我确实理解您的问题:)

有趣的话题和很多解决方案... :(

答案是:
将假想指针设置为初始样本的开头.从N个样本的初始集合中获取n个样本(其中n远小于N),并为其求幂(fe汇总abs值).然后将假想指针偏移某个偏移量并取n个样本并进行相同的操作...依此类推...
单词的结尾将位于先前功率相对于信号当前功率将增大某个阈值的位置."

我认为这个主意很清楚.如果没有-我可以通过代码段为您提供...

真是太幸运了!
Hi Member 7702812,

I think I do understand your question:)

Funny topic and so much solutions...:(

The answer is:
"Set imaginary pointer to the beginning of the initial samples. Take n samples from initial set of N (where n much less than N) samples and find the power for it (f.e. summarize abs values). Then shift your imaginary pointer to some shift and also take n samples and do the same... and so on...
The end of the word will be in the place where previous power will be greater by some threshold relatively to the current power of the signal."

I think the idea is clear to you. If no - I can provide you by code snippet...

God luck!


我进入Google搜索并做了类似的讨论 [ MATLAB拆分 [
I went into Google search and did this[^] and got the first link as:
Similar discussion[^]

Second link as: MATLAB Split[^]
I guess, you would be able to move ahead now.


这篇关于如何从句子中分离单词的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆