주요 콘텐츠

특징 추출

멜 스펙트로그램, MFCC, 피치, 스펙트럼 설명자

머신러닝 또는 딥러닝 시스템에 대한 입력으로 사용할 오디오 신호에서 특징을 추출합니다. 개별 함수(예: melSpectrogram, mfcc, pitch, spectralCentroid)를 사용하거나 audioFeatureExtractor 객체를 사용하여 중복 계산을 최소화하는 특징 추출 파이프라인을 만듭니다. Simulink®의 오디오 신호에서 특징을 추출하려면 Mel SpectrogramMFCC와 같은 블록을 사용하십시오. 라이브 스크립트에서는 오디오 특징 추출을 사용하여 추출할 특징을 시각적으로 선택하십시오.

객체

audioFeatureExtractorStreamline audio feature extraction
ivectorSystemCreate i-vector system (R2021a 이후)

라이브 편집기 작업

오디오 특징 추출라이브 편집기에서 오디오 특징 추출 간소화

함수

모두 확장

audioDeltaCompute delta features
designAuditoryFilterBankDesign auditory filter bank
melSpectrogram멜 스펙트로그램
audioDeltaCompute delta features
cepstralCoefficientsExtract cepstral coefficients
gtccExtract gammatone cepstral coefficients, log-energy, delta, and delta-delta
mfcc오디오 신호의 MFCC, 로그 에너지, 델타, 델타-델타 추출
openl3EmbeddingsExtract OpenL3 feature embeddings (R2022a 이후)
vggishEmbeddingsExtract VGGish feature embeddings (R2022a 이후)
speakerEmbeddingsExtract speaker embeddings from speech (R2024b 이후)
audioDeltaCompute delta features
harmonicRatioHarmonic ratio
pitch오디오 신호의 기본주파수 추정
pitchnnEstimate pitch with deep learning neural network (R2021a 이후)
audioDeltaCompute delta features
spectralCentroidSpectral centroid for audio signals and auditory spectrograms
spectralCrestSpectral crest for signals and spectrograms
spectralDecreaseSpectral decrease for audio signals and auditory spectrograms
spectralEntropySpectral entropy for signals and spectrograms
spectralFlatnessSpectral flatness for signals and spectrograms
spectralFluxSpectral flux for audio signals and auditory spectrograms
spectralKurtosisSpectral kurtosis for signals and spectrograms
spectralRolloffPointSpectral rolloff point for audio signals and auditory spectrograms
spectralSkewnessSpectral skewness for signals and spectrograms
spectralSlopeSpectral slope for audio signals and auditory spectrograms
spectralSpreadSpectral spread for audio signals and auditory spectrograms
erb2hzConvert from equivalent rectangular bandwidth (ERB) scale to hertz
bark2hzConvert from Bark scale to hertz
mel2hzConvert from mel scale to hertz
hz2erbConvert from hertz to equivalent rectangular bandwidth (ERB) scale
hz2barkConvert from hertz to Bark scale
hz2melConvert from hertz to mel scale
phon2soneConvert from phon to sone
sone2phonConvert from sone to phon

블록

Audio DeltaCompute delta features (R2022b 이후)
Auditory SpectrogramExtract mel, Bark, or ERB spectrogram from audio (R2022a 이후)
Cepstral CoefficientsExtract cepstral coefficients from spectrogram (R2022b 이후)
Design Auditory Filter BankDesign frequency-domain auditory filter bank (R2022a 이후)
Design Mel Filter BankDesign frequency-domain mel filter bank (R2022a 이후)
Mel SpectrogramExtract mel spectrogram from audio (R2022a 이후)
MFCCExtract mel-frequency cepstral coefficients from audio (R2022b 이후)

도움말 항목

추천 예제