Voice Commands in EU Spanish (Youth)
Interested in this data set?
Overview
Title
Voice Commands in EU Spanish (Youth)
Categories
Speech, Voice Commands
Published Date
September 2019
Description
This data set contains recordings of voice commands without a wake word in Spanish (es_ES) of 51 participants of age 6-14 (e.g., ""Eh, Alexa, cuéntame un chiste.""). The participants recorded their voices remotely using our in-house platform solution, Robson. Each participant has recorded on average 68 utterances (minimum 50, maximum 74). This data set contains the voice command only. The data set of wake words is available in a separate data package. Recordings are grouped by participant IDs and can be mapped to the phrase text. The data set comes with a complete phrase list and speaker metadata.
Data Set Details
Audio Sampling Rate
The number of times the audio is sampled per second, usually measured in kilohertz.
44.1 kHz
Bit Rate
706 kb/s (constant)
Bit Depth
16 bits
Format
WAV
Encoding
pcm_s16le
Channels
1
Data Set Demographics
Country
The country in which the data was collected.
Spain
Language
The language the participant uses in the data set.
Spanish
Region/Dialect
The regional accent that is used in the data set.
Comunidad de Madrid, Galicia, Andalucía, La Rioja, Región de Murcia Cataluña, Comunidad Valenciana, Cantabria, Asturias, País Vasco Navarra, Castilla León, Castilla La Mancha, Islas Baleares
Gender
The gender breakdown of data set participants.
41.2 % Female, 58.8 % Male
Age
The age distribution of data set participants.
6-14 years old, average 11 years old, median 11 years old
Number of Speakers
The total number of different voices in the data set.
51
Audio Sample
Available soon