daniel – AUDIAS-UAM

Automatic Classification of Classical Music Composers from Audio Signals

July 2, 2026July 1, 2026 daniel

Speaker: Sonia Aoi García Shida. Abstract: Automatic composer classification is a challenging task within the field of Music Information Retrieval, as it requires identifying compositional styles between composers from the same musical period, unlike genre classification where differences between classes… Read More

Detection and Grouping of Accents within Rural Spanish

July 2, 2026July 1, 2026 daniel

Speaker: Koral Tubia. Abstract: Rural Spanish preserves a rich dialectal diversity that has received little attention form a computational point of view, partly because most speech processing systems are trained on standard, urban speech. In this work, we use the… Read More

Responsible AI for forensic science with non-human biological findings in the Natural Traces Project

June 19, 2026June 18, 2026 daniel

Speaker: Manuel Fernando Mollón Laorca Abstract: The growing adoption of AI in forensic science demands high performance, interpretability, robustness, and transparency. This research, part of the Horizon Europe Natural Traces Project (https://naturaltraces.com) advances responsible AI through two key forensic applications.… Read More

Adapting Speaker Diarization to Code-Switched Medical Conversations: AUDIAS-UAM at the DISPLACE-M Challenge

June 19, 2026June 18, 2026 daniel

Speaker: Sara Barahona Quirós. Abstract: Speaker diarization of medical conversations presents challenges including spontaneous speech, uneven turn-taking, and speaking style differences between patients and doctors. Track 1 of the DISPLACE-M Challenge addresses this scenario through a dataset of Hindi–English clinical… Read More

Seeing Sound: From Computer Vision to Sound Event Detection

June 12, 2026June 2, 2026 daniel

Speaker: Sergio Segovia González. Abstract: This talk presents the trajectory of my PhD from image- and video-based AI to its later transfer into audio and Sound Event Detection. The central idea is how visual perception methods can inspire audio event… Read More

Large-scale evaluation of P300 BCI systems on BigP3BCI

June 5, 2026June 2, 2026 daniel

Speaker: Álvaro Sáiz López. Abstract: P300-based brain-computer interfaces (BCIs) provide a non-muscular communication channel for patients with severe motor impairments. This work leverages BigP3BCI, a recently released dataset unifying 18 studies and ~200 subjects, to systematically compare feature extractors for… Read More

Evaluation of P300-Based Brain-Computer Interfaces in Amyotrophic Lateral Sclerosis

June 5, 2026June 2, 2026 daniel

Speaker: Julia Reina Boria. Abstract: In this talk, I will present the evaluation of P300-based brain-computer interfaces as an assistive communication technology for people with amyotrophic lateral sclerosis. The work analyzes EEG signals from a P300 dataset and compares different… Read More

Automatic metal subgenre recognition system

June 2, 2026June 3, 2026 daniel

Speaker: Alejandro André Vivas Freitas. Abstract: Music can evoke countless emotions regardless of culture or age, and although the classification of musical genres has existed for centuries, automatic classification is a relatively recent discipline (barely two and a half decades… Read More

Audio Event Processing applied to the detection of different frog species.

June 1, 2026June 2, 2026 daniel

Speaker: Nicolás Martín Ansorregui. Abstract: Passive Acoustic Monitoring (PAM) in tropical ecosystems faces significant challenges due to overlapping vocalizations and severe class imbalance, often leading to the ‘algorithmic invisibility’ of rare species. This talk presents a deep learning architecture that… Read More

Analysis of Deepfakes and Anti-spoofing in Speaker Verification

June 1, 2026June 2, 2026 daniel

Speaker: Alejandro Delgado Montero Abstract: This presentation explores the critical challenge of detecting audio deepfakes and defending speaker verification systems against voice spoofing attacks, evaluated within the frameworks of the ASVspoof 5 and ESDD2 international challenges. We examine the design… Read More

Author: daniel