Building a contact center AI? Summa Linguae has a catalog of off-the-shelf call center data sets available in a variety of languages. And if we don’t already have the data you need, we can collect it for you.
Call Center Data Collection Services
Summa Linguae Technologies offers pre-packaged or custom-collected call center data to help power your contact center interfaces.
We offer phone conversations, text chat transcripts, or any other unique scenario you may require. And we do more than collection, we can also provide full annotation, classification, and labeling services.
For more information about our full capabilities, explore our Data Solutions.
hours of data
from unstructured call center conversations
languages & dialects
of pre-collected data sets, with the capability to collect more
scrubbed for personally identifiable information, with transcription services available
The following call center data sets are readily available for purchase. Personally identifiable information (PII) has been redacted from all recordings, so these data sets are ready to use for your solution.
Interested in one of the data sets? Just fill out the contact form below and we’ll be in touch for a quotation.
Afrikaans - South Africa
af_ZA / 8 kHz / 400 hours / 1787 files
English - New Zealand
en_NZ / 8 kHz / 464 hours / 1700 files
English - United States
en_US / 8 kHz / 15 hours / 89 files / Topic: Retail
Italian - Italy
it_IT / 8 kHz / 10 hours / 262 files
Spanish - Argentina
es_AR / 8 kHz / 420 hours / 3570 files
Polish - Poland
pl_PL / 8 kHz / 172 hours / 3799 files / Topic: Medical bookings
Polish - Poland
pl_PL / 8 kHz / 265 hours / 16763 files / Topic: Outbound B2B sales calls
Portuguese - Brazilian
pt_BR / 8 kHz / 431 hours / 2748 files
Summa Linguae Technologies is a trusted partner for custom speech data collection. We’ve helped many of the world’s top voice developers create voice assistants, chatbots, in-car speech recognition technology, and more.
Manager of Research Data, Nuance Communications
Summa Linguae Technologies has provided exceptional services to the Data Collection team at Nuance Communications, Inc. They have supervised large scale data collection simultaneously in three different countries, consistently delivering quality data on or ahead of schedule. And this was done twice in short order – in Europe and in Asia. Our continuing relationship with Summa Linguae is a great asset to the company.
Contact us for a quote
Tell us about your call center data needs and we’ll get back to you shortly for a consultation.