Speaker: Doroteo Torre Toledano. Abstract: In this talk I will review the most recent trajectory of the AUDIAS group in the field of Sound Event Detection (SED), highlighting our participations in DCASE evaluations (Task 4) from 2020 to 2023. Then,… Read More
Large Language Models: From Theory to Practice in Text Classification
Speaker: Miguel Ángel Martínez Pay. Abstract: This work presents a comprehensive overview of Large Language Models (LLMs), from their theoretical framework to practical applications in text classification. It compares the effectiveness of two key approaches: fine-tuning embeddings of smaller models… Read More
Integration of Emotional Information in Speaker Recognition Systems
Speaker: Arturo Domínguez Santos. Abstract: This Master’s thesis addresses the challenge of investigating how emotions affect speakerverification and proposes a system that integrates this emotional variability to try toimprove accuracy. The focus is on the speaker’s emotions, which has traditionally… Read More
Exploring Speech Foundation Models for End-to-End Speaker Diarization
Speaker: Laura Herrera Alarcón. Abstract: In this Master’s Thesis the use of pre-trained models for the diarization task has beenstudied in order to exploit their ability to extract robust and discriminative features.In particular, the WavLM model has been combined with… Read More