Speaker: Manuel Otero González. Abstract: En esta charla se explicará la tarea de reconocimiento de emociones en audios en español, presentando los enfoques más avanzados del estado del arte, como Wav2Vec2 y W2V-Bert. Además, se introducirá el reto EmoSPeech, cuyo… Read More
State of the Art in Sound Event Detection and DCASE Evaluations
Speaker: Doroteo Torre Toledano. Abstract: In this talk I will review the most recent trajectory of the AUDIAS group in the field of Sound Event Detection (SED), highlighting our participations in DCASE evaluations (Task 4) from 2020 to 2023. Then,… Read More
Large Language Models: From Theory to Practice in Text Classification
Speaker: Miguel Ángel Martínez Pay. Abstract: This work presents a comprehensive overview of Large Language Models (LLMs), from their theoretical framework to practical applications in text classification. It compares the effectiveness of two key approaches: fine-tuning embeddings of smaller models… Read More
Integration of Emotional Information in Speaker Recognition Systems
Speaker: Arturo Domínguez Santos. Abstract: This Master’s thesis addresses the challenge of investigating how emotions affect speakerverification and proposes a system that integrates this emotional variability to try toimprove accuracy. The focus is on the speaker’s emotions, which has traditionally… Read More
Exploring Speech Foundation Models for End-to-End Speaker Diarization
Speaker: Laura Herrera Alarcón. Abstract: In this Master’s Thesis the use of pre-trained models for the diarization task has beenstudied in order to exploit their ability to extract robust and discriminative features.In particular, the WavLM model has been combined with… Read More
Interpretation of fingerprint evidence with likelihood ratios (LRs – Likelihood ratios)
Speaker: Joaquín González Rodríguez. Abstract: The forensic fingerprint identification process based on the ACE-V method, widely implemented, makes absolute identification or exclusion decisions that depend on opinions that vary from expert to expert (for example, whether we consider an observed… Read More
SGE & CCC Architecture – Introduction for Beginners
Speaker: Adrián Aranda Marcos. Abstract: Simple technical introduction for using SGE with AUDIAS servers (Son of Grid Engine) and CCC (UAM’s Central Computing Center).
Stabilising Reinforcement Learning with Past Action-State Representation Learning
Speaker: Tamas Endrei. Abstract: Although deep reinforcement learning (DRL) deals with sequential decision-making problems, temporal information representation is absent from state-of-the-art actor-critic algorithms. The reliance on only the current timestep information causes instability in concurrent actions. Furthermore, the over-reliance on… Read More
Study of the predictive capacity of the efficacy of platelet-rich plasma (PRP) treatments in joint injuries
Speaker: Berta Caunedo Castro. Abstract: This Final Degree Project evaluates Platelet-Rich Plasma (PRP) therapy as an alternative to traditional treatments for knee osteoarthritis, a prevalent joint condition. PRP uses regenerative growth factors from the patient’s blood, but its variability complicates… Read More
Enhancing Sound Event Detection and Speaker Verification employing weak supervision
Speaker: Sara Barahona Quirós. Abstract: In this seminar, we will explore approaches for training acoustic event detection and speaker verification systems employing limited labels. Specifically, for the first task, we will explain the optimization process of a system based on… Read More