Main Content

Audio Processing Using Deep Learning

通过音频和语音处理应用扩展深度学习工作流程

Apply deep learning to audio and speech processing applications by using Deep Learning Toolbox™ together with Audio Toolbox™. For signal processing applications, seeSignal Processing Using Deep Learning。For applications in wireless communications, see使用深度学习的无线通信

应用

信号标签 Label signal attributes, regions, and points of interest, and extract features

功能

expand all

audiodatastore 用于收集音频文件的数据存储
Audiodataaugmenter Augment audio data
audioFeatureExtractor Streamline audio feature extraction
Openl3embeddings Extract OpenL3 feature embeddings
pitchnn 通过深度学习神经网络估计音调
vggishEmbeddings Extract VGGish feature embeddings
分类 Classify sounds in audio signal
crepe 可丽饼神经网络
Crepepreprecess 可丽饼深度学习网络的预处理音频
crepePostprocess Postprocess output of CREPE deep learning network
OpenL3 OpenL3 neural network
Openl3embeddings Extract OpenL3 feature embeddings
OpenL3Preprocess Preprocess audio for OpenL3 feature extraction
pitchnn 通过深度学习神经网络估计音调
vggish VGGish neural network
vggishEmbeddings Extract VGGish feature embeddings
VGGISHPRECESS VGGISH功能提取的预处理音频
yamnet Yamnet神经网络
yamnetGraph Yamnet Audioset本体论图
yamnetPreprocess Preprocess audio for YAMNet classification

Blocks

VGGish VGGish embeddings extraction network
vggish嵌入 提取vggish嵌入
Yamnet Yamnetsound classification network
Sound Classifier Classify sounds in audio signal

话题