pretrained models -凯发k8网页登录

transfer learning, sound classification, feature embeddings, pretrained audio deep learning networks

audio toolbox™ provides matlab^® and simulink^® support for pretrained audio deep learning networks. locate and classify sounds with yamnet and estimate pitch with crepe. extract vggish or openl3 feature embeddings to input to machine learning and deep learning systems. use i-vector systems to produce compact representations of audio signals for applications such as speaker recognition, verification, identification, and diarization. use to perform voice activity detection (vad).

using pretrained deep learning networks requires deep learning toolbox™. the audio toolbox pretrained networks are available in deep network designer (deep learning toolbox).

functions

vggish

	extract vggish feature embeddings
`vggish`	vggish neural network
	preprocess audio for vggish feature extraction

yamnet

`classifysound`	classify sounds in audio signal
`yamnet`	yamnet neural network
	graph of yamnet audioset ontology
	preprocess audio for yamnet classification

openl3

	extract openl3 feature embeddings
	openl3 neural network
	preprocess audio for openl3 feature extraction

crepe

	estimate pitch with deep learning neural network
	crepe neural network
	preprocess audio for crepe deep learning network
	postprocess output of crepe deep learning network

i-vectors

	pretrained speaker recognition system
`ivectorsystem`	create i-vector system

vad

	detect boundaries of speech in audio signal using ai
	voice activity detection (vad) neural network
	preprocess audio for voice activity detection (vad) network
	postprocess frame-based vad probabilities

blocks

vggish

	extract vggish embeddings
	preprocess audio for vggish feature extraction
	vggish embeddings extraction network

yamnet

	classify sounds in audio signal
	yamnet sound classification network
	preprocess audio for yamnet classification

openl3

	extract openl3 embeddings
	preprocess audio for openl3 embeddings extraction
	openl3 embeddings extraction network

crepe

	estimate pitch with crepe deep learning neural network
	crepe deep pitch estimation neural network
	preprocess audio for crepe deep pitch estimation
	postprocess output of crepe pitch estimation network

apps

deep network designer

design, visualize, and train deep learning networks

topics

configure an experiment that compares the performance of multiple pretrained networks applied to a speech command recognition task using transfer learning.
(simulink support package for android devices)
this example shows how to use the simulink® support package for android™ devices and a pretrained yamnet network to classify human voices.

pretrained models -凯发k8网页登录

functions

vggish

yamnet

openl3

crepe

i-vectors

vad

blocks

vggish

yamnet

openl3

crepe

apps

topics

related information

featured examples

pretrained models -凯发k8网页登录

functions

vggish

yamnet

openl3

crepe

i-vectors

vad

blocks

vggish

yamnet

openl3

crepe

apps

topics

related information

featured examples

wechat