Analysis of cnn-based speech recognition system using raw speech as input. Proceedings of Interspeech, 2015. [PUMA: Automatic CNN, TIMIT raw recognition, signal, speech]
Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition. 2012 IEEE international conference on Acoustics, speech and signal processing (ICASSP), 4277--4280, 2012. [PUMA: CNN, NN-HMM hybrid recognition; speech]
Learning salient features for speech emotion recognition using convolutional neural networks. IEEE Transactions on Multimedia, (16)8:2203--2213, IEEE, 2014. [PUMA: CNN, DES, Emo-DB, MES, SAVEE, analysis, classification discriminative emotion feature learning, recognition, salient speech]
Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition.. Interspeech, 1248--1252, 2013. [PUMA: CNN, adaptation, recognition speaker speech]
Speaker Adaptation of Convolutional Neural Network using Speaker Specific Subspace Vectors of SGMM. 2015. [PUMA: CNN, DNN, SGMM, adaptation fMLLR, recognition, speaker speech subspace vectors,]
Learning spectro-temporal features with 3D CNNs for speech emotion recognition. arXiv preprint arXiv:1708.05071, 2017. [PUMA: 3D CNN, IEMOCAP, Recola, Semaine, convolution corpora, emotion multilingual, multiple recognition, spectrogram, speech]