Speech datasets tailored to your model requirements

Tell us the languages, accents, speaker profiles, recording setup, speech type, transcript format, and metadata you need. We recruit, record, QA, transcribe, and deliver the dataset in the structure your team needs.

5.0 on Datarade Trustpilot SKI Verified Supplier

Most teams do not just need more audio. They need the right speakers, the right speech, and the right structure.

Spirelight builds that dataset.

Recruitment and recording workflow

Recruit the right speakers and collect recordings in one controlled flow.

We source contributors by language, dialect, age, gender, region, device, or other project criteria. Speakers record through browser-based tools with prompts, consent, audio checks, and project guidelines built into the workflow.

Live project tracking

Track collected hours, QA status, and delivery batches while the project runs.

Review progress during production and receive validated batches with audio, transcripts, metadata, and manifests. Your team can test early batches and adjust the collection before the full dataset is finished.

What we do

Speech datasets tailored to your requirements

Need Danish conversations, German call-center speech, French dialect coverage, wake words, commands, or audio plus video recordings? We design the collection and annotation workflow around your data spec, from speaker recruitment to final delivery.

spirelight · session
Live session · Iberian Spanish

Scripted monologues, dialect-tagged

Recording
  • MRMaria · Madrid · 32Done
  • JPJavier · Sevilla · 41Recording
  • ALAna · Bilbao · 27Queued
WAV · 48 kHz · stereo Prompt set 02 / 12
spirelight · transcript
en-IE_002_dialogue_03.json QA · 2 reviewers
  1. 00:00.42 S1 Could you walk me through the booking flow you used last Tuesday?
  2. 00:03.10 S2 Sure, I opened the app, tapped the search bar, then… flagged
  3. 00:06.94 S1 Got it. Any pauses or hesitations there?
  4. 00:09.38 S2 Yeah, [pause 1.2s] I had to scroll to find the right date.
Word-level timestamps · Speaker-aware · Diarized
manifest.json
{
"project": { 3 fields }, {
"id": "sl-9241",
"language": "es-ES",
"hours": 3000
},
"audio": { 3 fields }, {
"format": "wav",
"sample_rate": 48000,
"channels": 2
},
"transcripts": { click to expand }, {
"format": "jsonl",
"timestamps": "word",
"diarized": true
},
"delivery": { click to expand } {
"channel": "s3-bucket",
"checksums": "sha256",
"batches": true
}
}
01

Speech collection

We collect monologues, dialogues, wake words, commands, scripted prompts, roleplays, and natural conversations with speakers matched to your language and profile requirements.

  • Remote or on-site recording
  • Monologues, dialogues, commands, and roleplays
  • Audio-only or synchronized audio plus video
02

Transcription and annotation

We deliver machine-assisted or human-validated transcripts with timestamps, speaker labels, domain terminology, and annotation rules matched to your model training needs.

  • Word-level or segment-level timestamps
  • Speaker labels and dialogue structure
  • Human review based on your QA criteria
03

Dataset delivery

We package audio, transcripts, metadata, consent references, QA notes, and manifests in the format your engineering team needs.

  • WAV, JSON, JSONL, CSV, or custom formats
  • Metadata schemas matched to your spec
  • Bucket transfer, API handoff, or batch delivery
Why Spirelight

Collect the speech data your model is missing

Four reasons teams use Spirelight for custom speech data collection.

Spirelight

Studio recording session

2 hours 500 EUR Madrid Customer hardware For native Iberian Spanish speakers
Your kit, our team.

We can run the session on your hardware so the data matches the production room your product ships into.

Before you arrive
  • Native Iberian Spanish speaker, recruited locally
  • 14 sentence prompts, ~30 minutes recording
  • Recorded with customer-supplied hardware
  • Calibrated to your reference setup
  • 500 EUR after completion
1. Choose a location
Calle de Alcalá 123 Open in Maps ↗
2. Pick a date
April 2026
SuMoTuWeThFrSa 293031 1234 567891011 12131415161718 19202122232425 2627282930
LIVE Iberian Spanish · Day 7 of 14
Auto-adjust on
Hours collected 1,203/3,000
Dialect test accuracy 88.6% ▲ 11.3% since adjustment
Eval accuracy over project days
Recent adjustments
  • Day 4 · 10:42 Gap detected: Jutlandic dialect underrepresented
  • Day 4 · 11:15 +247 prompts deployed to Jutlandic speakers
  • Day 6 · 14:30 Accuracy ↑ 4.2% on dialect test set
01 / 04

Recruit speakers by language, dialect, and profile.

Define the speakers you need by language, dialect, region, age, gender, device, environment, or other project criteria.

Contributor recruitment across 50+ languages and 30+ markets.

Controlled recording setups when quality matters.

We can run remote, on-site, or studio-style sessions using defined microphones, devices, rooms, scripts, and acoustic requirements.

From single-speaker sessions to multi-day collection projects.

Review early batches and adjust the project as it runs.

Your team can test early deliveries, identify gaps, and update speaker targets, prompts, or guidelines before the full dataset is complete.

Mid-project adjustments without restarting production.

Strong coverage in Nordic and harder-to-source European languages.

We regularly recruit and review speakers in markets where off-the-shelf datasets are limited, including Nordic languages, regional dialects, and smaller European language varieties.

Recruiters and reviewers across Denmark, Sweden, Norway, Finland, and Iceland.

Your speech data partner for

fine-tuning and language expansion.

We combine contributor recruitment, recording workflows, transcription, QA, and delivery so your team can expand into new languages without building local operations from scratch.

Use cases

Speech data for common voice AI use cases

The workflow is similar across projects, but the speakers, prompts, recording conditions, annotations, and deliverables change with each use case.

Wake words and commands

Wake words and voice commands.

Collect commands, activation phrases, device instructions, and short utterances across languages, accents, microphones, and environments.

Multilingual

Multilingual ASR and TTS expansion.

Build language, accent, and dialect coverage for speech recognition, synthetic voice, and speech evaluation datasets.

Voice agents and emotion

Voice agents and emotion-aware AI.

Collect conversations, roleplays, customer service scenarios, emotional speech, and domain-specific interactions for more natural voice systems.

Team

The team running your data collection project

Spirelight combines commercial project design, recruitment operations, platform engineering, transcription workflows, QA, and delivery management in one team.

01 / 06

Andreas Kromann

CEO · Commercial lead

Andreas works with clients to turn model requirements into concrete data collection projects.

He defines the project scope, speaker targets, recruitment approach, and delivery expectations before production starts.

Emil Thorsson

CFO · Operations

Emil supports operations, documentation, compliance coordination, and project delivery.

He helps structure the process so recruitment, consent, production, and handoff stay aligned.

Gustav Aggeboe

CTO · Platform architecture

Gustav leads the platform architecture behind Spirelight.

He builds the systems used to manage recording, transcription, QA, metadata, contributor workflows, and dataset delivery.

Joyi Ulfat

Senior Project Manager

Joyi manages project execution across contributors, reviewers, and delivery teams.

She keeps production moving, follows up on daily progress, and helps ensure each project meets its agreed requirements.

Mateo Thelen

Project Manager

Mateo coordinates contributors, recording workflows, and production tasks.

He helps translate project requirements into daily execution and keeps the different parts of the workflow aligned.

Pekka Larjovuori

Crowd Source Expert

Pekka supports recruitment strategy and contributor operations.

He helps source speakers for projects with specific language, dialect, regional, or profile requirements.

Get started

Tell us what training data you need

Tell us the languages, speech type, speakers, recording setup, transcript format, and metadata you need. We return within 48 hours with an initial workflow and data plan.

10,000+
contributors in our recruitment network
50+
languages and dialects recruited for
QA
workflows on every project
48h
target response for project briefs