Publications

Dimitri Palaz, Ronan Collobert, and others. Analysis of cnn-based speech recognition system using raw speech as input. Proceedings of Interspeech, 2015. [PUMA: Automatic CNN, TIMIT raw recognition, signal, speech]

Ossama Abdel-Hamid, Abdel-rahman Mohamed, Hui Jiang, and Gerald Penn. Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition. 2012 IEEE international conference on Acoustics, speech and signal processing (ICASSP), 4277--4280, 2012. [PUMA: CNN, NN-HMM hybrid recognition; speech]

Qirong Mao, Ming Dong, Zhengwei Huang, and Yongzhao Zhan. Learning salient features for speech emotion recognition using convolutional neural networks. IEEE Transactions on Multimedia, (16)8:2203--2213, IEEE, 2014. [PUMA: CNN, DES, Emo-DB, MES, SAVEE, analysis, classification discriminative emotion feature learning, recognition, salient speech]

Ossama Abdel-Hamid, and Hui Jiang. Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition.. Interspeech, 1248--1252, 2013. [PUMA: CNN, adaptation, recognition speaker speech]

B Murali Karthick, Prateek Kolhar, and S Umesh. Speaker Adaptation of Convolutional Neural Network using Speaker Specific Subspace Vectors of SGMM. 2015. [PUMA: CNN, DNN, SGMM, adaptation fMLLR, recognition, speaker speech subspace vectors,]

Jaebok Kim, Khiet P Truong, Gwenn Englebienne, and Vanessa Evers. Learning spectro-temporal features with 3D CNNs for speech emotion recognition. arXiv preprint arXiv:1708.05071, 2017. [PUMA: 3D CNN, IEMOCAP, Recola, Semaine, convolution corpora, emotion multilingual, multiple recognition, spectrogram, speech]