@unibiblio

Raw audio samples of the CNVVE dataset

, , and . Dataset, (2024)Related to: CNVVE: Dataset and Benchmark for Classifying Non-verbal Voice Expressions.R. Hedeshy, R. Menges, and S. Staab. Interspeech 2023, August 20-24, 2023. Dublin, Ireland, (2023). doi: 10.21437/Interspeech.2023-201.
DOI: 10.18419/darus-3897

Abstract

This CNVVE Dataset contains raw audio samples encompassing six distinct classes of voice expressions, namely “Uh-huh” or “mm-hmm”, “Uh-uh” or“mm-mm”, “Hush” or “Shh”, “Psst”, “Ahem”, and Continuous humming, e.g., “hmmm.” Audio samples of each class are found in the respective folders. The samples are recorded through a dedicated website for data collection that defines the purpose and type of voice data by providing example recordings to participants as well as the expressions’ written equivalent, e.g., “Uh-huh”. Audio recordings were automatically saved in the .wav format and kept anonymous, with a sampling rate of 48 kHz and a bit depth of 32 bits.This dataset contains a raw version of the samples. A cleaned version of these samples can be found on https://doi.org/10.18419/darus-3898. For more info, please check the paper or feel free to contact the authors for any inquiries.

Links and resources

Tags