Siri Wake Words in US English

Interested in this data set?

Overview

Title

Siri Wake Words in US English

Categories

Wake Words

Description

This data set contains recordings of the wake word "Hey Siri" in US English (en_US) of 103 participants of age 19-68.  The participants recorded their voices remotely using our in-house mobile data collection app, Robson. Each participant has recorded 10 utterances of "Hey Siri".

Data Set Details

Audio Sampling Rate

The number of times the audio is sampled per second, usually measured in kilohertz.

16kHz

Bit Rate

256 kb/s (constant)

Bit Depth

16 bits

Format

WAV

Encoding

pcm_s16le

Channels

1

Data Set Demographics

Country

The country in which the data was collected.

USA

Language

The language the participant uses in the data set.

English

Region/Dialect

The regional accent that is used in the data set.

US Midwest (25), US Northeast (27), US South (26), US West (25)

Gender

The gender breakdown of data set participants.

51.5 % Female, 48.5 % Male

Age

The age distribution of data set participants.

19-68 average 33.2 years old, at the median 30 years old

Number of Speakers

The total number of different voices in the data set.

103

Audio Sample

Available soon

If you would like to purchase this data set for yourself, do not hesitate to contact us below.

Request a quote.

    Summa Linguae uses cookies to allow us to better understand how the site is used. By continuing to use this site, you consent to this policy.

    Learn More