Misc,

Raw audio samples of the CNVVE dataset

R. Hedeshy, R. Menges, and S. Staab.
Dataset, (2024)Related to: CNVVE: Dataset and Benchmark for Classifying Non-verbal Voice Expressions.R. Hedeshy, R. Menges, and S. Staab. Interspeech 2023, August 20-24, 2023. Dublin, Ireland, (2023). doi: 10.21437/Interspeech.2023-201.
DOI: 10.18419/darus-3897

Abstract

This CNVVE Dataset contains raw audio samples encompassing six distinct classes of voice expressions, namely “Uh-huh” or “mm-hmm”, “Uh-uh” or“mm-mm”, “Hush” or “Shh”, “Psst”, “Ahem”, and Continuous humming, e.g., “hmmm.” Audio samples of each class are found in the respective folders. The samples are recorded through a dedicated website for data collection that defines the purpose and type of voice data by providing example recordings to participants as well as the expressions’ written equivalent, e.g., “Uh-huh”. Audio recordings were automatically saved in the .wav format and kept anonymous, with a sampling rate of 48 kHz and a bit depth of 32 bits.This dataset contains a raw version of the samples. A cleaned version of these samples can be found on https://doi.org/10.18419/darus-3898. For more info, please check the paper or feel free to contact the authors for any inquiries.

BibTeX key: hedeshy2024audio
entry type: misc
year: 2024
howpublished: Dataset
affiliation: Hedeshy, Ramin/Universität Stuttgart, Menges, Raphael/Semanux, Staab, Steffen/Universität Stuttgart
orcid-numbers: Hedeshy, Ramin/0000-0001-5854-4033, Menges, Raphael/0000-0002-2112-7065, Staab, Steffen/0000-0002-0780-4154
DOI: 10.18419/darus-3897
note: Related to: CNVVE: Dataset and Benchmark for Classifying Non-verbal Voice Expressions.R. Hedeshy, R. Menges, and S. Staab. Interspeech 2023, August 20-24, 2023. Dublin, Ireland, (2023). doi: 10.21437/Interspeech.2023-201

PUMA

Raw audio samples of the CNVVE dataset

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on