A curated crowdsourced dataset of Luganda and Swahili speech for text-to-speech synthesisMendeley Data

This data article describes a curated, crowdsourced speech dataset in Luganda and Kiswahili, created to support text-to-speech (TTS) development in low-resource settings. The dataset is derived from Mozilla’s Common Voice corpus and includes only validated utterances from female speakers. A multi-st...

Full description

Saved in:

Bibliographic Details
Main Authors:	Andrew Katumba, Sulaiman Kagumire, Joyce Nakatumba-Nabende, John Quinn, Sudi Murindanyi
Format:	Article
Language:	English
Published:	Elsevier 2025-10-01
Series:	Data in Brief
Subjects:	Speech dataset Text-to-speech Low-resource languages Luganda Kiswahili
Online Access:	http://www.sciencedirect.com/science/article/pii/S2352340925006390
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

http://www.sciencedirect.com/science/article/pii/S2352340925006390

A curated crowdsourced dataset of Luganda and Swahili speech for text-to-speech synthesisMendeley Data

Internet

Similar Items