Crowdsourced high-quality Sinhala [si-lk] multi-speaker speech dataset


This dataset was collected for speech technology research from native Sinhala speakers who volunteered to supply the data. The audio is high quality (48kHz, 16 bit, mono, Wave audio), recorded in a quiet environment.

Some quality checks have been done on the data, but there might still be mistranscriptions or artifacts in the audio.

License - CC BY 4.0

Authors - Unknown

Language - Sinhala

Reference- https://research.google/tools/datasets/sinhala-tts/

http://openslr.org/30/