Towards End-To-End Speech Recognition with Recurrent Neural Networks.. ICML, (14):1764--1772, 2014. [PUMA: RNN end-to-end learning, recognition, speech]
Learning salient features for speech emotion recognition using convolutional neural networks. IEEE Transactions on Multimedia, (16)8:2203--2213, IEEE, 2014. [PUMA: CNN, DES, Emo-DB, MES, SAVEE, analysis, classification discriminative emotion feature learning, recognition, salient speech]
Deep convolutional neural networks for large-scale speech tasks. Neural Networks, (64):39--48, Elsevier, 2015. [PUMA: LVCSR deep learning, nets, neural recognition, speech]
Variational Autoencoders for Learning Latent Representations of Speech Emotion. arXiv preprint arXiv:1712.08708, 2017. [PUMA: Emotion IEMOCAP Recognition, auto-encoder, autoencoder, learning, representation variational]