检测Speech
检测音频信号中语音的边界
句法
Description
检测Speech(___)
没有输出参数将显示输入信号中检测到的语音区域的图。
Examples
输入参数
输出参数
算法
这检测Speech
算法基于[[1],,,,although modified so that the statistics to threshold are short-term energy and spectral spread, instead of short-term energy and spectral centroid. The diagram and steps provide a high-level overview of the algorithm. For details, see[[1]。
这audio signal is converted to a time-frequency representation using the specified
窗户
和OverlapLength
。这short-term energy and spectral spread is calculated for each frame. The spectral spread is calculated according to
Spectralspread
。为短期能量和光谱扩散分布创建直方图。
对于每个直方图,根据 ,,,,wherem1和m2分别是第一本和第二个本地最大值。wis set to
5
。Both the spectral spread and the short-term energy are smoothed across time by passing through successive five-element moving median filters.
通过比较短期能量和光谱扩散与各自的阈值来创建面具。要将框架声明为包含语音,必须高于其阈值。
面具合并。为了将框架宣布为语音,短期能量和光谱传播都必须超过其各自的阈值。
如果它们之间的距离小于
合并
。
References
[1] Theodoros的Giannakopoulos。“一种在MATLAB中实施的语音信号的沉默和分割的方法”(雅典,雅典,2009年)。