Speaker: Sonia Aoi García Shida. Abstract: Automatic composer classification is a challenging task within the field of Music Information Retrieval, as it requires identifying compositional styles between composers from the same musical period, unlike genre classification where differences between classes… Read More
Detection and Grouping of Accents within Rural Spanish
Speaker: Koral Tubia. Abstract: Rural Spanish preserves a rich dialectal diversity that has received little attention form a computational point of view, partly because most speech processing systems are trained on standard, urban speech. In this work, we use the… Read More
Responsible AI for forensic science with non-human biological findings in the Natural Traces Project
Speaker: Manuel Fernando Mollón Laorca Abstract: The growing adoption of AI in forensic science demands high performance, interpretability, robustness, and transparency. This research, part of the Horizon Europe Natural Traces Project (https://naturaltraces.com) advances responsible AI through two key forensic applications.… Read More
Adapting Speaker Diarization to Code-Switched Medical Conversations: AUDIAS-UAM at the DISPLACE-M Challenge
Speaker: Sara Barahona Quirós. Abstract: Speaker diarization of medical conversations presents challenges including spontaneous speech, uneven turn-taking, and speaking style differences between patients and doctors. Track 1 of the DISPLACE-M Challenge addresses this scenario through a dataset of Hindi–English clinical… Read More
Seeing Sound: From Computer Vision to Sound Event Detection
Speaker: Sergio Segovia González. Abstract: This talk presents the trajectory of my PhD from image- and video-based AI to its later transfer into audio and Sound Event Detection. The central idea is how visual perception methods can inspire audio event… Read More
Large-scale evaluation of P300 BCI systems on BigP3BCI
Speaker: Álvaro Sáiz López. Abstract: P300-based brain-computer interfaces (BCIs) provide a non-muscular communication channel for patients with severe motor impairments. This work leverages BigP3BCI, a recently released dataset unifying 18 studies and ~200 subjects, to systematically compare feature extractors for… Read More
Evaluation of P300-Based Brain-Computer Interfaces in Amyotrophic Lateral Sclerosis
Speaker: Julia Reina Boria. Abstract: In this talk, I will present the evaluation of P300-based brain-computer interfaces as an assistive communication technology for people with amyotrophic lateral sclerosis. The work analyzes EEG signals from a P300 dataset and compares different… Read More
Automatic metal subgenre recognition system
Speaker: Alejandro André Vivas Freitas. Abstract: Music can evoke countless emotions regardless of culture or age, and although the classification of musical genres has existed for centuries, automatic classification is a relatively recent discipline (barely two and a half decades… Read More
Audio Event Processing applied to the detection of different frog species.
Speaker: Nicolás Martín Ansorregui. Abstract: Passive Acoustic Monitoring (PAM) in tropical ecosystems faces significant challenges due to overlapping vocalizations and severe class imbalance, often leading to the ‘algorithmic invisibility’ of rare species. This talk presents a deep learning architecture that… Read More
Analysis of Deepfakes and Anti-spoofing in Speaker Verification
Speaker: Alejandro Delgado Montero Abstract: This presentation explores the critical challenge of detecting audio deepfakes and defending speaker verification systems against voice spoofing attacks, evaluated within the frameworks of the ASVspoof 5 and ESDD2 international challenges. We examine the design… Read More
