Speech Data Company

Custom speech data for teams building voice products

We run speech recording, transcription, and dataset delivery workflows tailored to your technical requirements, from targeted recruitment to structured output.

Book a Call See Capabilities

Most voice products serve fewer than 30 of the world's 7,000+ languages.

Spirelight closes that gap.

9:41

Spirelight now

Danish recording proposal 14 sentence prompts · 30 minutes · paid

Spirelight 2m ago

You've been invited to record Jutlandic dialect · in-browser session

All in one platform

The contributors and software you need, all in one place.

Vetted speakers in the right languages, dialects, and age bands record straight from our browser-based capture tools. No extra apps, no third-party software, no fragmented workflow.

Project · live Iberian Spanish · Scripted monologues

Human validated

Hours delivered 0 / 3,000 hrs

WAV 16 kHz 14 dialects 64 contributors JSONL transcripts

Batch 04 ready 312 hrs · validated · 4.2 GB

Live progress, batch downloads

Live progress, validated formats, and batch downloads as you go.

Watch every hour land, then pull validated batches straight into your pipeline whenever you need them. No waiting for final delivery to start training.

spirelight · session

Live session · Iberian Spanish

Scripted monologues, dialect-tagged

Recording

MRMaria · Madrid · 32Done
JPJavier · Sevilla · 41Recording
ALAna · Bilbao · 27Queued

WAV · 48 kHz · stereo Prompt set 02 / 12

spirelight · transcript

en-IE_002_dialogue_03.json QA · 2 reviewers

00:00.42 S1 Could you walk me through the booking flow you used last Tuesday?
00:03.10 S2 Sure, I opened the app, tapped the search bar, then… flagged
00:06.94 S1 Got it. Any pauses or hesitations there?
00:09.38 S2 Yeah, [pause 1.2s] I had to scroll to find the right date.

Word-level timestamps · Speaker-aware · Diarised

manifest.json

{

"project": { 3 fields }, {

"id": "sl-9241",

"language": "es-ES",

"hours": 3000

},

"audio": { 3 fields }, {

"format": "wav",

"sample_rate": 48000,

"channels": 2

},

"transcripts": { click to expand }, {

"format": "jsonl",

"timestamps": "word",

"diarised": true

},

"delivery": { click to expand } {

"channel": "s3-bucket",

"checksums": "sha256",

"batches": true

}

}

01

Speech collection

Remote or on-site recording projects with targeted contributors, configurable prompts, and controlled capture flows.

On-site or remote capture
Dual-channel and dialogue setups
Audio plus video when needed

02

Transcription

Manual, machine-assisted, or hybrid transcription with review layers, timestamps, and speaker-aware structure.

Word-level timestamps
Reviewer sampling during production
Domain terminology handling

03

Dataset delivery

Audio, transcripts, metadata, and manifests packaged to match your pipeline. Click any field to see how it's structured.

JSON and manifest delivery
Custom metadata schemas
Bucket transfer or API handoff

Why Spirelight

More specific than off-the-shelf data

Targeted Crowd Recruitment

Source speakers by language, dialect, location, age, gender, or project-specific criteria.

Flexible Production Setup

Configure monologues, dialogues, dual-channel capture, audio plus video, or structured prompt flows depending on the task.

Collection & Transcription Together

Avoid workflow fragmentation by handling recording, transcription, QA, and packaging in one production chain.

In-Production Quality Control

Review files while the project is running and catch issues before they become delivery problems.

Fast Scaling When Needed

Built for projects that need to move fast without becoming generic.

European Language Strength

A strong fit for multilingual and dialect-sensitive projects across European markets.

01
Flexible Collection Remote & on-site recording workflows
02
Audio Engineering Technical capture & dual-channel
03
Quality Assurance Human-in-the-loop review layers
04
Technical Delivery Structured output & manifest packaging

Speech Collection Workflow

Workflow: Collection

01

Remote & On-site Collection

We source speakers by language, dialect, location, age, and gender. Whether it's controlled recording on-site or distributed remote capture, we handle the recruitment and execution.

› Remote recording workflows
› On-site recording setups
› Monologues and dialogues
› Custom prompts and scenarios

Audio Engineering

Mode: Audio Capture

02

Technical Audio Engineering

Built for technical speech requirements. We configure monologues, dialogues, dual-channel capture, and noise environment checks before any recording begins.

› Dual-channel capture
› Speaker-separated recordings
› Audio plus video capture
› Hardware checks before recording

Quality Assurance

Process: Verification

03

Human-in-the-Loop QA

Manual, machine-assisted, or hybrid transcription with multiple review layers. We catch issues while the project is running, not at the end.

› Human review layers
› Machine-assisted transcription
› Word-level timestamps
› Domain terminology handling

Structured Delivery

Format: Deployment

04

Structured Delivery

Audio, transcripts, and metadata packaged to match your pipeline. We deliver via bucket transfer or direct API handoff with full manifest validation.

› Manifest and checksum packaging
› Bucket delivery or API handoff
› Custom metadata schemas
› JSON and manifest delivery

Use Cases

Designed around your actual requirements

Automotive Voice

Command phrases, in-car scenarios, multilingual prompt sets, and structured dialogue data for voice interfaces across regions and accents.

Wake Word Datasets

Trigger word collection across demographic groups with controlled recording conditions and environmental variation.

Multilingual Assistants

Cross-language training data for virtual assistants covering multiple European languages and regional variants.

Call Simulation

Dialogue capture with role-play scenarios, separated speakers, and real conversational variation for customer interaction systems.

Accessibility Research

Combined audio and video capture with controlled consent flows and structured research delivery for assistive technology.

STT Evaluation

Domain-specific audio with timestamped transcription output for speech-to-text system testing and improvement.

TTS Datasets

Controlled scripts, expressive prompts, higher fidelity requirements, and linked speaker metadata for text-to-speech training.

Dialect Coverage

Language coverage projects targeting specific dialect regions with metadata-rich speaker profiles and geographic targeting.

Track Record

Projects we have delivered

Multilingual Production

Large multilingual dialogue collection

Type: Dialogue recording
Complexity: 5+ languages, tight timeline
Handled: Recruitment, recording, transcription, QA
Delivery: Structured JSON + audio bundles

Controlled Recording

On-site controlled recording workflow

Type: On-site capture
Complexity: Hardware control, environment specs
Handled: Setup, capture, quality gates, packaging
Delivery: Dual-channel WAV + metadata CSV

High-Volume Pipeline

Transcription and QA at scale

Type: Transcription pipeline
Complexity: Multi-reviewer, domain-specific
Handled: Transcription, review layers, consistency
Delivery: Timestamped JSON + manifests

Team

The team behind the projects

Andreas Kromann

Andreas Kromann

CEO

Commercial lead and project design

Emil Thorsson

Emil Thorsson

CFO

Operations, compliance coordination, and delivery support

Gustav Aggeboe

Gustav Aggeboe

CTO

Platform architecture and technical implementation

Joyi Ulfat

Joyi Ulfat

Senior Project Manager

Production oversight and project execution

Mateo Thelen

Mateo Thelen

Project Manager

Coordination of contributors, workflows, and delivery steps

Pekka Larjovuori

Pekka Larjovuori

Crowd Source Expert

Recruitment strategy and crowd operations

Get Started

Need a speech dataset that matches your real requirements?

Tell us what you need to collect, how it should be structured, and where the difficult parts are. We will help scope the workflow.

Book a Call Send Project Brief

Or email us directly at hello@spirelight.net

10,000+

Verified contributors across Europe

50+

Languages & dialects captured

100%

Human QA on every project

48h

Average turnaround on brief response