Alexa Wake Words in Canadian French (Adults)
Interested in this data set?
Overview
Title
Alexa Wake Words in Canadian French (Adults)
Categories
Speech, Wake Words, Alexa
Published Date
September 2019
Description
This data set contains recordings of the wake word "Alexa" in Canadian French (fr_CA) of 100 participants of age 15-65 used in voice commands (e.g., "Alexa, raconte-moi une blague."). The participants recorded their voices remotely using our in-house platform solution, Robson. Each participant has recorded on average 71 utterances (minimum 53, maximum 75) of "Alexa". Recordings are grouped by participant IDs and can be mapped to the phrase text. The data set comes with a complete phrase list and speaker metadata.
Data Set Details
Audio Sampling Rate
The number of times the audio is sampled per second, usually measured in kilohertz.
44.1 kHz
Bit Rate
706 kb/s (constant)
Bit Depth
16 bits
Format
WAV
Encoding
pcm_s16le
Channels
1
Data Set Demographics
Country
The country in which the data was collected.
Canada
Language
The language the participant uses in the data set.
French
Region/Dialect
The regional accent that is used in the data set.
Français québécois - dialectes de lest (région de Québec) et du nord Français québécois - dialectes de louest (Montréal, Sherbrooke, Trois-Rivières) Western provinces French - Saskatchewan, Alberta, British Columbia, Northwest Territories Quebec French - Western Central Dialect (Montreal, Sherbrooke, Trois-Rivières...) Quebec French - Western Central Dialect (Montreal, Sherbrooke, Trois-Rivières...) Quebec French - Quebec City/ Capital Dialect, Northern Dialect, Eastern Dialect Ontario French - North, South Français des provinces de louest - Saskatchewan, Alberta, Colombie-Britannique, Territoires du Nord-Ouest Français ontarien - Nord et sud
Gender
The gender breakdown of data set participants.
65.0 % Female, 35.0 % Male
Age
The age distribution of data set participants.
15-65 years old, average 35 years old, median 33 years old
Number of Speakers
The total number of different voices in the data set.
100
Audio Sample
Available soon