Stacked auto-encoders for audio classification

버전 1.0.0 (3.02 KB) 작성자: Arvind

We evaluates the performance of stacked auto-encoders for designing a speech/music classifier on S&S and GTZAN dataset

https://doi.org/10.1016/j.eswa.2022.118041

팔로우

0.0

(0)

다운로드 수: 39

업데이트 날짜: 2023/7/26

라이선스 보기

This work evaluates the performance of stacked auto-encoders based deep neural network for designing a speech/music classifier on S&S and GTZAN dataset using visual features. The hidden layers of the neural network are initially trained in unsupervised manner using auto-encoders and are stacked with the final softmax layer. Different experiments were conducted on time-frequency features derived from Spectrogram and Chromagram. Performances of the combination of stacked auto-encoder and softmax classifier was further compared with traditional classifiers and different deep learning techniques. Best classification accuracy of 93.05% and 94.73% is observed for fused features for S&S and GTZAN datasets respectively.

인용 양식

Arvind (2025). Stacked auto-encoders for audio classification (https://kr.mathworks.com/matlabcentral/fileexchange/132752-stacked-auto-encoders-for-audio-classification), MATLAB Central File Exchange. 검색 날짜: 2025/11/8.

Kumar, Arvind, et al. “Stacked Auto-Encoders Based Visual Features for Speech/Music Classification.” Expert Systems with Applications, vol. 208, Elsevier BV, Dec. 2022, p. 118041, doi:10.1016/j.eswa.2022.118041.

양식 더 보기

MATLAB 릴리스 호환 정보

개발 환경: R2021a

모든 릴리스와 호환

플랫폼 호환성

Windows macOS Linux

태그 태그 추가

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

code.m

버전	게시됨	릴리스 정보
1.0.0	2023/7/26		다운로드

MLA	Kumar, Arvind, et al. “Stacked Auto-Encoders Based Visual Features for Speech/Music Classification.” Expert Systems with Applications, vol. 208, Elsevier BV, Dec. 2022, p. 118041, doi:10.1016/j.eswa.2022.118041.
APA	Kumar, A., Solanki, S. S., & Chandra, M. (2022). Stacked auto-encoders based visual features for speech/music classification. Expert Systems with Applications, 208, 118041. Elsevier BV. Retrieved from https://doi.org/10.1016%2Fj.eswa.2022.118041
BibTeX	@article{Kumar_2022, doi = {10.1016/j.eswa.2022.118041}, url = {https://doi.org/10.1016%2Fj.eswa.2022.118041}, year = 2022, month = {dec}, publisher = {Elsevier {BV}}, volume = {208}, pages = {118041}, author = {Arvind Kumar and Sandeep Singh Solanki and Mahesh Chandra}, title = {Stacked auto-encoders based visual features for speech/music classification}, journal = {Expert Systems with Applications} }