Source code and adittional results of INTERSPEECH 2025 paper 'A Dataset for Automatic Assessment of TTS Quality in Spanish'*

*(link to paper coming soon)

This repository stores the code related to the development of INTERSPEECH 2025 paper 'A Dataset for Automatic Assessment of TTS Quality in Spanish'. The most relevant file can be found at analysis/subjective_results_corrected.csv. This file contains the results of the subjective evaluations of text-to-speech systems—a total of 4,326 unique audio ratings provided by 92 evaluators. The structure of this file is as follows:

participant_id: Unique identifier for the evaluator.
age, gender_participant, country, province: Sociodemographic data of the evaluator.
education: Level of familiarity with synthetic voices.
stimuli: Code of the evaluated audio.
stimuli_service: Service that generated the audio.
gender_stimuli: Gender of the synthetic voice.
dialect: Dialect of the synthetic voice.

Repository Structure

The repository is divided into the following main folders:

analysis: Contains preprocessing and statistical analysis scripts for the data collected in the subjective test.
backend: Contains scripts for the database and web server.
classifier: Contains training and evaluation scripts for the DenseMOS classifiers developed in the thesis, including the training, validation, and test splits used.
embeddings: Contains scripts for extracting embeddings from the evaluated audios.
frontend: Contains scripts for the web interface.
NISQA: Link to the NISQA repository.
NISQA_analysis: Contains fine-tuning and evaluation scripts for NISQA models.
write-up: Contains reference files and figures used in the thesis, as well as a .zip file with the LaTeX code of the thesis.

Evaluated Audios Audio dataset will be made available shortly

Name		Name	Last commit message	Last commit date
Latest commit History 106 Commits
NISQA		NISQA
NISQA_analysis		NISQA_analysis
UTMOS		UTMOS
analysis		analysis
backend		backend
classifier		classifier
defensa		defensa
embeddings/wav2vec/code		embeddings/wav2vec/code
frontend		frontend
write-up		write-up
.gitignore		.gitignore
Desarrollo_de_una_base_de_datos_para_evaluación_automática_de_sistemas_de_texto_a_habla.pdf		Desarrollo_de_una_base_de_datos_para_evaluación_automática_de_sistemas_de_texto_a_habla.pdf
README.md		README.md
label_distribution.png		label_distribution.png
notes.txt		notes.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Source code and adittional results of INTERSPEECH 2025 paper 'A Dataset for Automatic Assessment of TTS Quality in Spanish'*

About

Uh oh!

Releases

Packages

Uh oh!

Languages

asosawelford/DenseMOS

Folders and files

Latest commit

History

Repository files navigation

Source code and adittional results of INTERSPEECH 2025 paper 'A Dataset for Automatic Assessment of TTS Quality in Spanish'*

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages