Ruka hadi maudhui makuu
Community handbook

Waraka Playbook

A community handbook for building African-language datasets.

A practical, open guide to collecting, annotating, and releasing high-quality data for African languages — across text, speech, and vision — written and maintained by the community that speaks them.

3
modalities — text, speech, vision
6
languages
Open
source & community-owned
Open & community-owned

Read it, use it, help build it.

The Waraka Playbook is free to read online and open to contributions. Fixing an error, translating a page, or sharing what worked on a real project all count.