Speaker: Arturo Domínguez Santos. Abstract: This Master’s thesis addresses the challenge of investigating how emotions affect speakerverification and proposes a system that integrates this emotional variability to try toimprove accuracy. The focus is on the speaker’s emotions, which has traditionally… Read More
Exploring Speech Foundation Models for End-to-End Speaker Diarization
Speaker: Laura Herrera Alarcón. Abstract: In this Master’s Thesis the use of pre-trained models for the diarization task has beenstudied in order to exploit their ability to extract robust and discriminative features.In particular, the WavLM model has been combined with… Read More
Interpretation of fingerprint evidence with likelihood ratios (LRs – Likelihood ratios)
Speaker: Joaquín González Rodríguez. Abstract: The forensic fingerprint identification process based on the ACE-V method, widely implemented, makes absolute identification or exclusion decisions that depend on opinions that vary from expert to expert (for example, whether we consider an observed… Read More
Stabilising Reinforcement Learning with Past Action-State Representation Learning
Speaker: Tamas Endrei. Abstract: Although deep reinforcement learning (DRL) deals with sequential decision-making problems, temporal information representation is absent from state-of-the-art actor-critic algorithms. The reliance on only the current timestep information causes instability in concurrent actions. Furthermore, the over-reliance on… Read More
Enhancing Sound Event Detection and Speaker Verification employing weak supervision
Speaker: Sara Barahona Quirós. Abstract: In this seminar, we will explore approaches for training acoustic event detection and speaker verification systems employing limited labels. Specifically, for the first task, we will explain the optimization process of a system based on… Read More
Transformers for Binding Prediction of Hypoxia-Induced Factors
Speaker: Manuel Fernando Mollón Laorca. Abstract: Hypoxia-inducible factors (HIFs) are proteins that play a crucial role in the cellular response to low oxygen levels. Accurate prediction of the binding of these factors to their target DNA is essential for understanding… Read More
Whisper‑based spoken term detection systems for search on speech ALBAYZIN evaluation challenge
Speaker: Javier Tejedor Noguerales. Abstract: The vast amount of information stored in audio repositories makes necessary the development of efficient and automatic methods to search on audio content. In that direction, search on speech (SoS) has received much attention in… Read More
Road map for Albayzin Diarization Challenge 2024
Speaker: Jérémie Touati. Abstract: The diarization challenge of the 2024 Albayzin evaluation stands out by various difficulties. The recordings, which come from databases of Spanish radio and television programs, can last up to several hours, they contain an undetermined and… Read More
Introduction to the Language-Based Audio Retrieval task.
Speaker: Manuel Otero. Abstract: Language-Based Audio Retrieval is a task of the DCASE Challenge, which is based on the retrieval of audio information from natural language descriptions. Two of the best performing approaches in the state of the art will… Read More
Data Augmentation for Respiratory Cycle Classification
Speaker: Miguel Ángel. Abstract: Analysing respiratory audios in order to detect and classify adventitious respiratory sounds is of vital importance for the development of continuous monitoring tools for patients with respiratory diseases. The ICBHI 2017 database is the most widely… Read More