November 2024 – AUDIAS-UAM

Fusion-Based Speaker Diarization: Insights from IberSpeech2024

November 27, 2024January 30, 2025 Adrián Aranda Márquez

Speaker: Juan Ignacio Álvarez Trejos. Abstract: This talk presents the results of our participation in the speaker diarization challenge at IberSpeech2024. Our approach combines the strengths of three diarization models: a custom-trained Diaper model, Pyannote, and VBx, through an innovative… Read More

Device-robust audio classification

November 20, 2024January 30, 2025 Adrián Aranda Márquez

Speaker: Wiliam Fernando López Gavilánez. Abstract: Audio classifiers designed for deployment across diverse devices often face unforeseen conditions during inference, attributable to device-specific characteristics. These challenges stem from variations in microphone transfer functions or on-chip digital signal pre-processing, which result… Read More

Towards Efficient Conformer-based Sound Event Detection

November 6, 2024January 30, 2025 Adrián Aranda Márquez

Speaker: Sara Barahona Quirós. Abstract: The Conformer architecture has shown excellent performance in accurately classifying sound events but lacks temporal precision when predicting time boundaries. While increasing the length of the input sequences can mitigate this issue, it also increases… Read More

Analysis of Speaker Label Matching for Diarization of Long Audios on RTVE2022 Dataset

November 6, 2024January 30, 2025 Adrián Aranda Márquez

Speaker: Laura Herrera Alarcón. Abstract: This study introduces an algorithm to match predicted speaker labels from short audio segments into a final prediction. This involves extracting an x-vector for each speaker in each segment and applying constrained Agglomerative Clustering to… Read More

Analyzing DiaPer EEND Speaker Diarization Models on the RTVE2022 Dataset

November 6, 2024January 30, 2025 Adrián Aranda Márquez

Speaker: Juan Ignacio Álvarez Trejos. Abstract: The task of speaker diarization has lately been successfully tackled with end-to-end neural diarization (EEND) models instead of modular cascaded ones. Among them, the very new EEND Perceiver-based attractors (DiaPer) comes with a light… Read More

Month: November 2024