Speaker: Clara Adsuar Ávila. Abstract: In this project, we address the importance of enhancing the accessibility and usefulness of Deep Learning technologies for non-standard speakers. From a linguistic perspective, rural Spanish areas are rich in dialectal variety. However, most technology… Read More
Emotion recognition in Spanish audio
Speaker: Manuel Otero González. Abstract: En esta charla se explicará la tarea de reconocimiento de emociones en audios en español, presentando los enfoques más avanzados del estado del arte, como Wav2Vec2 y W2V-Bert. Además, se introducirá el reto EmoSPeech, cuyo… Read More
State of the Art in Sound Event Detection and DCASE Evaluations
Speaker: Doroteo Torre Toledano. Abstract: In this talk I will review the most recent trajectory of the AUDIAS group in the field of Sound Event Detection (SED), highlighting our participations in DCASE evaluations (Task 4) from 2020 to 2023. Then,… Read More
Large Language Models: From Theory to Practice in Text Classification
Speaker: Miguel Ángel Martínez Pay. Abstract: This work presents a comprehensive overview of Large Language Models (LLMs), from their theoretical framework to practical applications in text classification. It compares the effectiveness of two key approaches: fine-tuning embeddings of smaller models… Read More
Integration of Emotional Information in Speaker Recognition Systems
Speaker: Arturo Domínguez Santos. Abstract: This Master’s thesis addresses the challenge of investigating how emotions affect speakerverification and proposes a system that integrates this emotional variability to try toimprove accuracy. The focus is on the speaker’s emotions, which has traditionally… Read More
Exploring Speech Foundation Models for End-to-End Speaker Diarization
Speaker: Laura Herrera Alarcón. Abstract: In this Master’s Thesis the use of pre-trained models for the diarization task has beenstudied in order to exploit their ability to extract robust and discriminative features.In particular, the WavLM model has been combined with… Read More
Interpretation of fingerprint evidence with likelihood ratios (LRs – Likelihood ratios)
Speaker: Joaquín González Rodríguez. Abstract: The forensic fingerprint identification process based on the ACE-V method, widely implemented, makes absolute identification or exclusion decisions that depend on opinions that vary from expert to expert (for example, whether we consider an observed… Read More
SGE & CCC Architecture – Introduction for Beginners
Speaker: Adrián Aranda Marcos. Abstract: Simple technical introduction for using SGE with AUDIAS servers (Son of Grid Engine) and CCC (UAM’s Central Computing Center).
Study of the predictive capacity of the efficacy of platelet-rich plasma (PRP) treatments in joint injuries
Speaker: Berta Caunedo Castro. Abstract: This Final Degree Project evaluates Platelet-Rich Plasma (PRP) therapy as an alternative to traditional treatments for knee osteoarthritis, a prevalent joint condition. PRP uses regenerative growth factors from the patient’s blood, but its variability complicates… Read More
Leveraging Speaker Embeddings in End-to-End Neural Diarization for Two-Speaker Scenarios
Speaker: Juan Ignacio Álvarez Trejos. Abstract: This presentation covers the work presented at Odyssey 2024, focusing on speaker diarization in two-speaker scenarios. End-to-end neural speaker diarization systems are designed to handle overlapping speech while accurately distinguishing between speakers. In this… Read More