如何启动语音质量PESQ测试? [英] How to start a voice quality PESQ test?
问题描述
我正在尝试进行语音质量测试(pesq),但我不知道如何开始。我试图编译公共源代码( http:// www。 itu.int/itu-t/recommendations/index.aspx?ser=P (p.862)),但无法开始测试。也许有人对此进行处理?
I'm trying to do a voice quality test (pesq), but I don't understand how to start. I trying to compile a public source code (http://www.itu.int/itu-t/recommendations/index.aspx?ser=P (p.862)) but can't start a test. Maybe anyone work with this?
推荐答案
您将需要C编译器( ITU PESQ参考实现实际上是C,因此您不需要一个C ++编译器,尽管两者都可以正常工作)
You will need a C compiler (The ITU PESQ reference implementation is actually C, so you don't need a C++ compiler, although both should work just fine)
例如,在linux上,您将输入源
目录并使用 gcc
进行编译:
For instance, on linux, you would enter the source
directory and compile with gcc
:
$ cd Software/P862_annex_A_2005_CD/source
$ gcc -o PESQ *.c
这将编译文件 dsp.c,pesqdsp.c,pesqio.c,pesqmain.c,pesqmod.c
转换为二进制文件 PESQ
然后可以使用 ./ PESQ
:
This will compile the files dsp.c, pesqdsp.c, pesqio.c, pesqmain.c, pesqmod.c
into a binary file PESQ
which you can then run with ./PESQ
:
$ ./PESQ
Perceptual Evaluation of Speech Quality (PESQ)
Reference implementation for ITU-T Recommendations P.862, P.862.1 and P.862.2.
Version 2.0 October 2005.
<snip long unenlightening IP notice>
Usage:
PESQ HELP Displays this text
PESQ [options] ref deg
Run model on reference ref and degraded deg
Options: +8000 +16000 +swap +wb
Sample rate - No default. Must select either +8000 or +16000.
Swap byte order - machine native format by default. Select +swap for byteswap.
Default mode of operation is P.862 (narrowband handset listening). Select +wb
to use P.862.2 wideband extension (headphone listening).
File names may not begin with a + character.
Files with names ending .wav or .WAV are assumed to have a 44-byte header, which is automatically skipped. All other file types are assumed to have no header.
要运行此二进制文件并测试算法,您需要引用 .wav文件(这是清晰的原始语音)和降级的 .wav文件(这是算法的输出)。只需将它们都传递到 PESQ
中,它将为您提供测试的输出。在ITU来源分发中包含的两个.wav文件上运行的示例:
To run this binary and test your algorithm, you need the "reference" .wav file (This is the clean, original speech) and the "degraded" .wav file (This is the output of your algorithm). Simply pass both into PESQ
, and it will give you the output of the test. An example run on two .wav files included in the source distribution from the ITU:
$ cd Software/P862_annex_A_2005_CD/conform
$ ../source/PESQ +8000 or105.wav dg105.wav
Perceptual Evaluation of Speech Quality (PESQ)
Reference implementation for ITU-T Recommendations P.862, P.862.1 and P.862.2.
Version 2.0 October 2005.
<snip IP notice>
Reading reference file or105.wav...done.
Reading degraded file dg105.wav...done.
Level normalization...
IRS filtering...
Variable delay compensation...
Acoustic model processing...
P.862 Prediction (Raw MOS, MOS-LQO): = 2.237 1.844
其中 + 8000
参数表示wav文件以8000Hz采样。
Where the +8000
parameter denotes that the wav files are sampled at 8000Hz.
这篇关于如何启动语音质量PESQ测试?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!