[ LocaleNLP / ORAOX Platform ]

The community that trains the models.

ORAOX is the data engine behind LocaleNLP. A gamified platform where native speakers across Africa and the Arab world ingest, validate, and certify language data — earning XP, climbing leaderboards, and building the training corpora that power AfriLION models.

1,200+
Contributors
9
Countries
CC-BY-SA
License
How it works

Three steps. Infinite data.

01
Ingest
Every utterance matters.

Contributors upload audio clips, text snippets, or code-switched utterances in their native language. ORAOX accepts voice recordings from any device — smartphone, browser microphone, or file upload. Each submission is timestamped, geotagged by region (not GPS), and language-labelled by the contributor.

  • Audio, text, and code-switch submissions
  • Mobile-first upload — no app required
  • Contributor self-labels language & dialect
  • Automatic quality pre-screening (SNR, duration)
02
Validate
Human-in-the-loop at every step.

Each submission enters a validation queue where other contributors — fluent in the same language — review it: approve if the transcription is correct, flag if the audio is noisy or wrong, or suggest a correction. Three independent approvals are required before a clip is certified for training.

  • 3-approval threshold before certification
  • Flag → review → reject pipeline
  • Expert linguist spot-check layer (1 in 50)
  • Full consent and deletion rights for submitters
03
Gamify
Infrastructure built on intrinsic motivation.

Contributors earn XP for every approved submission and validation action. Accurate validators who consistently agree with the expert layer earn bonus multipliers. The leaderboard resets monthly — keeping competition fresh and preventing farming. Top contributors are credited in dataset releases.

  • XP for submissions, validations, and streaks
  • Expert-alignment bonus multiplier
  • Monthly leaderboard resets
  • Named credit in CC-BY-SA dataset releases
Validation Queue
0 pending
Community

The contributors building the stack.

Rankings reset monthly. Top contributors are named in dataset release notes.

Monthly LeaderboardMARCH 2026
XP Earning Table
Submit audio clip (accepted)+50 XP
Approve a clip (expert-aligned)+20 XP
Flag a clip (expert-agreed)+15 XP
7-day submission streak+200 XP
Expert-alignment bonus (×1.5)multiplier
Monthly top-10 placementNamed credit
Beta Access

ORAOX is in invite-only beta. Priority access for native speakers of Hausa, Wolof, Darija, Amharic, Swahili, Yoruba, Igbo, and Arabic dialect variants.

Request early access →
[ ORAOX / Status: Beta ]

Speak a low-resource language?

Your voice is infrastructure. Every clip you validate closes the gap between your language and the AI models that will serve your community for decades.