Siri Wake Words and Voice Commands in US English
Interested in this data set?
Overview
Title
Siri Wake Words and Voice Commands in US English
Categories
Speech, Wake Words, Voice Commands, Siri
Published Date
September 2019
Description
This data set contains recordings of voice commands including the wake word "Siri" in US English (en_US) of 103 participants of age 19-68 (e.g., "Hey Siri, turn the volume down"). The participants recorded their voices remotely using our in-house mobile data collection app, Robson. Each participant has recorded 10 utterances. This data set contains the voice command with the wake word "Hey Siri".
Data Set Details
Audio Sampling Rate
The number of times the audio is sampled per second, usually measured in kilohertz.
16kHz
Bit Rate
256 kb/s (constant)
Bit Depth
16 bits
Format
WAV
Encoding
pcm_s16le
Channels
1
Data Set Demographics
Country
The country in which the data was collected.
USA
Language
The language the participant uses in the data set.
English
Region/Dialect
The regional accent that is used in the data set.
US Midwest (25), US Northeast (27), US South (26), US West (25)
Gender
The gender breakdown of data set participants.
51.5 % Female, 48.5 % Male
Age
The age distribution of data set participants.
19-68 average 33.2 years old, at the median 30 years old
Number of Speakers
The total number of different voices in the data set.
103
Audio Sample
Available soon