{% extends "app/base_generic.html" %} {% load static %} {% block title %}TTS Datasets for African languages{% endblock %} {% block content %}

The AfricanVoices corpus is a speech corpus containing datasets of aligned sentences and audio for 11 languages. We have uploaded data for {{ num_languages }} different languages in this website so far. We obtain the datasets in three ways:

Datasets

{# #} {# #} {% for dataset in datasets %} {% if not dataset.lang.lang_wikipedia_url %} {% else %} {% endif %} {% if dataset.pass1_mcd <= 6 %} {% elif dataset.pass1_mcd <= 7 %} {% elif dataset.pass1_mcd <= 8 %} {% else %} {% endif %} {% if not dataset.data_location %} {% else %} {% endif %} {% endfor %}
Data_id Lang code Language Source Speaker No. of sentences Hrs MCD* Quality rfs #utt rfs #mcd Download
{{ dataset.data_id }} {{ dataset.lang.lang_code_639_2 }}{{ dataset.lang.lang_name }} {{ dataset.lang.lang_name }} {{ dataset.source }} {{ dataset.speaker_gender }} {{ dataset.pass1_utt }} {{ dataset.duration|floatformat:2 }} {{ dataset.pass1_mcd|floatformat:2 }} Good Okay Bad}Something is wrong **UnavailableAvailable
* "MCD is a distortion measure, comparing synthesized examples with originals. Smaller is better. For TTS, less than 5 is probably good, less than 6 is probably fine, greater than 6 is possibly bad (but still statistically useful), greater than 7 probably indicates something is wrong. For alignment, MCDs seem to be about 1 larger than for TTS (TTS only uses the best examples, and uses a much more complex prediction model)." Alan W Black
For data from Open.Bible and Bible.is, the MCD reported is for alignment while for the rest the MCD reported is for TTS.

** Suba data is unavailable because the license doesn't allow us to redistribute. You can download it from the Faith Comes by Hearing website for your personal use.

{% endblock %}