Data Sets by Language
Browse our off-the-shelf data sets by language. If you don’t see data in the language you’re looking for, contact us now for a quote.
Speech Data
This data set contains recordings of call center conversations in Japanese (jp_JP).
Speech Data
This data set contains recordings of up to 1000 hours of call center conversations in US English (en_US).
Speech Data
Google Wake Words in US English (en_US) of 103 participants of age 19-68.
Speech Data
Siri Wake Words and Voice Commands in US English (en_US) of 103 participants of age 19-68.
Speech Data
Alexa Wake Words in Mexican Spanish (es_MX) of 106 participants of age 16-65.
Speech Data
500 hours of phone conversations in Japanese (jp_JP).
Speech Data
Alexa wake words in Spanish (es_ES) of 104 participants of age 15-60.
Speech Data
500 hours of phone conversations in Irish English (en_IE).
Speech Data
US English wake words using "Siri" from 103 participants of age 19-68.
Speech Data
US English voice commands including the wake word "OK Google" from 103 participants of age 19-68.
Speech Data
Voice commands in Canadian French (fr_CA) of 50 participants of age 6-14.
Speech Data
Alexa wake words and voice commands in Canadian French (fr_CA) of 100 participants of age 15-65.
Speech Data
Voice commands in Canadian French (fr_CA) of 100 participants of age 15-65.
Speech Data
Alexa wake words and voice commands in Canadian French (fr_CA) of 50 participants of age 6-14.
Speech Data
Voice commands in Italian (it_IT) of 65 participants of age 6-14.
Speech Data
Voice commands in Italian (it_IT) of 135 participants of age 15-65.
Speech Data
Alexa wake words in Italian (it_IT) of 65 participants of age 6-14.
Speech Data
Alexa wake words in Italian (it_IT) of 135 participants of age 15-65.
Speech Data
Voice commands in Mexican Spanish (es_MX) of 106 participants of age 16-65.
Speech Data
Voice commands in Mexican Spanish (es_MX) of 51 participants of age 6-14.
Speech Data
Alexa wake words in Mexican Spanish (es_MX) of 51 participants of age 6-14.
Speech Data
Alexa wake words in Spanish (es_ES) of 51 participants of age 6-14.
Speech Data
Voice commands in Spanish (es_ES) of 104 participants of age 15-60.
Speech Data
Voice commands in Spanish (es_ES) of 51 participants of age 6-14.
Speech Data
50 hours of phone conversations in Dutch (nl-NL).
Speech Data
Wake word "Alexa" in US English (en_US) of 103 participants of age 19-68.