发言人的认可

10 views (last 30 days)
我venky
我venky on 14 Oct 2011
回答ed: Brian Hemmat on 20 Mar 2020
Iam going to do a project based on speaker recognition (not speech recognition). I saw many files in the internet and came across many methods.
First该计划的一部分是他们找到了MFCC,之后我们必须进行模式识别。
这里有一些模式识别算法rithms that I came across 1)VQ algorithm followed by LBG algorithm for clustering. 2)K means algorithm.
I have another idea. I know neural networks for pattern recognition in image processing. Will that work with speaker recognition?
Is there any code in matlab central for speaker recognition? If you have done this project before please tell me the method that you followed.
Thanks in advance.
Much awaiting for your response.
2 Comments
图像分析师
图像分析师 2011年10月15日
From the subject line I thought he was talking about speaker identification (recognizing a particular speaker and extracting his speech), like the "cocktail party problem" http://research.ics.tkk.fi/ica/cocktail/cocktail_en.cgi but I've heard of that being solved with ICA, not the acronyms he listed.

Sign in to comment.

答案(8)

William
William on 14 Oct 2011
Many use a Gausian Mixture Model (GMM) after using the MFCC. There is a really good toolbox for these operations called "voicebox.m" it is a collection of functions that all you to extract and classify data from speech via wavread()
1 Comment
我venky
我venky on 14 Oct 2011
You mean the voicebox file uploaded in the internet?
GMM比其他方法好吗?如果您以前做过此发言人的认可,请告诉我您如何做。

Sign in to comment.


我venky
我venky on 14 Oct 2011
William我正在等待您的回应。

William
William on 14 Oct 2011
Look over this website. I had to do this a year ago for a class and this is exactly what I followed
这是语音箱的链接
2 Comments
William
William on 14 Oct 2011
这并不可怕。如果您使用MFCC收集大量数据,则可能比模型更准确,但是处理的时间会减慢速度。

Sign in to comment.


我venky
我venky on 14 Oct 2011
Do我必须使用语音箱的guassmix函数?
Thanks in advance.

William
William on 14 Oct 2011
There are numerous GMM algorithms that could be used to do this. find one that you understand so that if it ever stops working you can figure out why.
2 Comments
KML
KML 2011年12月22日
did you finish the program?

Sign in to comment.


我venky
我venky on 14 Oct 2011
您如何看待神经网络?当我进行图像处理时,它可以很好地工作。那会适合这个吗?
2 Comments
我venky
我venky 2011年10月15日
Hello Greg Heath. I am not that much familiar with nn tool box. I used GUI for image processing. I usually use the neural network pattern recognition with a two-layer feed forward network window. I tried the same for voice recognition and it didn't work. So I think I should go for some other nn method.
In the nntoobox how would you select MLP and RBF?
Thanks in advance.

Sign in to comment.


William
William on 14 Oct 2011
我在神经网络方面没有很多经验。如果您了解他们,请尝试一下,看看会发生什么。最糟糕的情况,您失去了几个小时的时间。
1 Comment
我venky
我venky 2011年10月15日
你好威廉。我发现给定输入的对数可能性的平均值。之后,您如何找到最佳匹配。每个型号大约有12个均值(GMM中的12个组件)。在这种情况下,如何根据日志可能性的平均值比较模型。(如果每个模型只有一个均值,那么我会发现日志可能性的最大值并选择最佳匹配项,但是在这里我有12个手段,所以我感到困惑。)
Thanks in advance.

Sign in to comment.


Brian Hemmat
Brian Hemmat on 20 Mar 2020

社区寻宝

Find the treasures in MATLAB Central and discover how the community can help you!

开始狩猎!