Voice Commands in Mexican Spanish (Youth)
Interested in this data set?
Overview
Title
Voice Commands in Mexican Spanish (Youth)
Categories
Speech, Voice Commands
Published Date
September 2019
Description
This data set contains recordings of voice commands without a wake word in Mexican Spanish (es_MX) of 51 participants of age 6-14 (e.g., "Alexa, pon temas de CNCO en Spotify"). The participants recorded their voices remotely using our in-house platform solution, Robson. Each participant has recorded on average 66 utterances (minimum 51, maximum 74). This data set contains the voice command only. The data set of wake words is available in a separate data package. Recordings are grouped by participant IDs and can be mapped to the phrase text. The data set comes with a complete phrase list and speaker metadata.
Data Set Details
Audio Sampling Rate
The number of times the audio is sampled per second, usually measured in kilohertz.
44.1 kHz
Bit Rate
706 kb/s (constant)
Bit Depth
16 bits
Format
WAV
Encoding
pcm_s16le
Channels
1
Data Set Demographics
Country
The country in which the data was collected.
Mexico
Language
The language the participant uses in the data set.
Spanish
Region/Dialect
The regional accent that is used in the data set.
Centro, Norte, Sur, Costa
Gender
The gender breakdown of data set participants.
51.0 % Female, 49.0 % Male
Age
The age distribution of data set participants.
6-14 years old, average 10 years old, median 9 years old
Number of Speakers
The total number of different voices in the data set.
51
Audio Sample
Available soon