2024 – Page 2 – AUDIAS-UAM

One model to rule them all? Towards end-to-end joint speaker diarization and speech recognition

October 9, 2024July 9, 2025 Adrián Aranda Márquez

Speaker: Laura Herrera Alarcón. Abstract: This paper presents a novel framework for joint speaker diarization (SD) and automatic speech recognition (ASR), named SLIDAR (sliding-window diarization-augmented recognition). SLIDAR can process arbitrary length inputs and can handle any number of speakers, effectively… Read More

Emotion recognition in Spanish audio

October 2, 2024October 1, 2024 daniel

Speaker: Manuel Otero González. Abstract: En esta charla se explicará la tarea de reconocimiento de emociones en audios en español, presentando los enfoques más avanzados del estado del arte, como Wav2Vec2 y W2V-Bert. Además, se introducirá el reto EmoSPeech, cuyo… Read More

State of the Art in Sound Event Detection and DCASE Evaluations

September 25, 2024September 30, 2024 daniel

Speaker: Doroteo Torre Toledano. Abstract: In this talk I will review the most recent trajectory of the AUDIAS group in the field of Sound Event Detection (SED), highlighting our participations in DCASE evaluations (Task 4) from 2020 to 2023. Then,… Read More

Large Language Models: From Theory to Practice in Text Classification

September 18, 2024September 18, 2024 daniel

Speaker: Miguel Ángel Martínez Pay. Abstract: This work presents a comprehensive overview of Large Language Models (LLMs), from their theoretical framework to practical applications in text classification. It compares the effectiveness of two key approaches: fine-tuning embeddings of smaller models… Read More

Integration of Emotional Information in Speaker Recognition Systems

September 11, 2024September 11, 2024 daniel

Speaker: Arturo Domínguez Santos. Abstract: This Master’s thesis addresses the challenge of investigating how emotions affect speakerverification and proposes a system that integrates this emotional variability to try toimprove accuracy. The focus is on the speaker’s emotions, which has traditionally… Read More

Exploring Speech Foundation Models for End-to-End Speaker Diarization

September 4, 2024September 11, 2024 daniel

Speaker: Laura Herrera Alarcón. Abstract: In this Master’s Thesis the use of pre-trained models for the diarization task has beenstudied in order to exploit their ability to extract robust and discriminative features.In particular, the WavLM model has been combined with… Read More

Interpretation of fingerprint evidence with likelihood ratios (LRs – Likelihood ratios)

July 3, 2024September 13, 2024 daniel

Speaker: Joaquín González Rodríguez. Abstract: The forensic fingerprint identification process based on the ACE-V method, widely implemented, makes absolute identification or exclusion decisions that depend on opinions that vary from expert to expert (for example, whether we consider an observed… Read More

SGE & CCC Architecture – Introduction for Beginners

June 26, 2024February 17, 2026 daniel

Speaker: Adrián Aranda Márquez. Abstract: Simple technical introduction for using SGE with AUDIAS servers (Son of Grid Engine) and CCC (UAM’s Central Computing Center).

Stabilising Reinforcement Learning with Past Action-State Representation Learning

June 18, 2024June 25, 2024 audias

Speaker: Tamas Endrei. Abstract: Although deep reinforcement learning (DRL) deals with sequential decision-making problems, temporal information representation is absent from state-of-the-art actor-critic algorithms. The reliance on only the current timestep information causes instability in concurrent actions. Furthermore, the over-reliance on… Read More

Study of the predictive capacity of the efficacy of platelet-rich plasma (PRP) treatments in joint injuries

June 11, 2024September 18, 2024 daniel

Speaker: Berta Caunedo Castro. Abstract: This Final Degree Project evaluates Platelet-Rich Plasma (PRP) therapy as an alternative to traditional treatments for knee osteoarthritis, a prevalent joint condition. PRP uses regenerative growth factors from the patient’s blood, but its variability complicates… Read More

Year: 2024