A speech data company built around custom collection

We design custom speech datasets for voice AI teams whose products have to handle languages, dialects, speakers, and recording conditions that standard datasets do not cover.

Why we exist

Speech data projects break down in predictable places.

Recruitment that does not match the spec. Capture quality that varies too much. Workflows that fragment across vendors. Delivery structures that do not fit the client's pipeline.

Spirelight is built around fixing those problems. Not as a general annotation company, but as a focused partner for teams that need custom, harder-to-source speech datasets with controlled recording, transcription, QA, and delivery.

The deliverable is not volume. It is the right speakers, the right speech, the right structure, and a single workflow that runs from brief to delivery.

How we work

Operational philosophy

Three principles that shape every project we run, from first brief to final delivery.

01

Specificity over scale

We would rather deliver exactly what you need than overwhelm you with volume that does not match the spec. Every project is scoped to your real requirements.

02

Catch problems early

Quality control runs during production, not only after delivery. Reviewers sample files while recording is live. Issues are flagged before they multiply.

03

One workflow, not three vendors

Collection, transcription, QA, and packaging happen in one coordinated chain. No handoff gaps. No format translation between vendors.

Team

The team running your data collection project

Spirelight combines commercial project design, recruitment operations, platform engineering, transcription workflows, QA, and delivery management in one team.

01 / 06

Andreas Kromann

CEO · Commercial lead

Andreas works with clients to turn model requirements into concrete data collection projects.

He defines the project scope, speaker targets, recruitment approach, and delivery expectations before production starts.

Emil Thorsson

CFO · Operations

Emil supports operations, documentation, compliance coordination, and project delivery.

He helps structure the process so recruitment, consent, production, and handoff stay aligned.

Gustav Aggeboe

CTO · Platform architecture

Gustav leads the platform architecture behind Spirelight.

He builds the systems used to manage recording, transcription, QA, metadata, contributor workflows, and dataset delivery.

Joyi Ulfat

Senior Project Manager

Joyi manages project execution across contributors, reviewers, and delivery teams.

She keeps production moving, follows up on daily progress, and helps ensure each project meets its agreed requirements.

Mateo Thelen

Project Manager

Mateo coordinates contributors, recording workflows, and production tasks.

He helps translate project requirements into daily execution and keeps the different parts of the workflow aligned.

Pekka Larjovuori

Crowd Source Expert

Pekka supports recruitment strategy and contributor operations.

He helps source speakers for projects with specific language, dialect, regional, or profile requirements.

Get started

Tell us what training data you need

Tell us the languages, speech type, speakers, recording setup, transcript format, and metadata you need. We return within 48 hours with an initial workflow and data plan.

10,000+
contributors in our recruitment network
50+
languages and dialects recruited for
6
in-house roles in one workflow
48h
target response for project briefs