Skip to content
Main Menu
  • Home
  • News
  • Seminars
  • People
    • Staff
    • Ph.D. Students
    • Students
    • AUDIAS Alumni
  • Publications
  • Projects & Industry
    • Projects
    • Industry
  • Evaluations
  • For Students
    • Master’s Thesis Proposals
    • Study Grants and Opportunities
    • Teaching
  • Contact
AUDIAS-UAM
  • Home
  • News
  • Seminars
  • People
    • Staff
    • Ph.D. Students
    • Students
    • AUDIAS Alumni
  • Publications
  • Projects & Industry
    • Projects
    • Industry
  • Evaluations
  • For Students
    • Master’s Thesis Proposals
    • Study Grants and Opportunities
    • Teaching
  • Contact

Year: 2024

  • Home
  • 2024

Foundational Models for Self-Supervised Speaker Diarization and Target Speaker ASR

December 18, 2024January 30, 2025 Adrián Aranda Márquez

Speaker: Alicia Lozano-Diez. Abstract: In this talk, I will review a few of the last trends in speaker diarization and target speaker ASR. I will present two papers that address these two tasks respectively, and leverage the power of foundational… Read More

AUDIAS Seminars

What can LLMs bring to the field of acoustic event detection?

December 11, 2024January 30, 2025 Adrián Aranda Márquez

Speaker: Sergio Segovia González. Abstract: The answer to this question through these two articles, “WILDDESED: An LLM-POWERED dataset for wild domestic environment sound event detection system” and “Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event Detection” has been carried… Read More

AUDIAS Seminars

Exploring Large Protein Language Models in Constrained Data Regimes

December 4, 2024January 30, 2025 Adrián Aranda Márquez

Speaker: Manuel Fernando Mollon Laorca. Abstract: In this study, we expand upon the FLIP benchmark—designed for evaluating protein language models (pLMs) in small, specialized prediction tasks—by assessing the performance of state-of-the-art models, including ESM-2, SaProt, and Tranception, on the FLIP… Read More

AUDIAS Seminars

Fusion-Based Speaker Diarization: Insights from IberSpeech2024

November 27, 2024January 30, 2025 Adrián Aranda Márquez

Speaker: Juan Ignacio Álvarez Trejos. Abstract: This talk presents the results of our participation in the speaker diarization challenge at IberSpeech2024. Our approach combines the strengths of three diarization models: a custom-trained Diaper model, Pyannote, and VBx, through an innovative… Read More

AUDIAS Seminars

Device-robust audio classification

November 20, 2024January 30, 2025 Adrián Aranda Márquez

Speaker: Wiliam Fernando López Gavilánez. Abstract: Audio classifiers designed for deployment across diverse devices often face unforeseen conditions during inference, attributable to device-specific characteristics. These challenges stem from variations in microphone transfer functions or on-chip digital signal pre-processing, which result… Read More

AUDIAS Seminars

Towards Efficient Conformer-based Sound Event Detection

November 6, 2024January 30, 2025 Adrián Aranda Márquez

Speaker: Sara Barahona Quirós. Abstract: The Conformer architecture has shown excellent performance in accurately classifying sound events but lacks temporal precision when predicting time boundaries. While increasing the length of the input sequences can mitigate this issue, it also increases… Read More

AUDIAS Seminars

Analysis of Speaker Label Matching for Diarization of Long Audios on RTVE2022 Dataset

November 6, 2024January 30, 2025 Adrián Aranda Márquez

Speaker: Laura Herrera Alarcón. Abstract: This study introduces an algorithm to match predicted speaker labels from short audio segments into a final prediction. This involves extracting an x-vector for each speaker in each segment and applying constrained Agglomerative Clustering to… Read More

AUDIAS Seminars

Analyzing DiaPer EEND Speaker Diarization Models on the RTVE2022 Dataset

November 6, 2024January 30, 2025 Adrián Aranda Márquez

Speaker: Juan Ignacio Álvarez Trejos. Abstract: The task of speaker diarization has lately been successfully tackled with end-to-end neural diarization (EEND) models instead of modular cascaded ones. Among them, the very new EEND Perceiver-based attractors (DiaPer) comes with a light… Read More

AUDIAS Seminars

Automatic Speech Recognition in Dialectal Data (COSER)

October 30, 2024November 6, 2024 daniel

Speaker: Clara Adsuar Ávila. Abstract: In this project, we address the importance of enhancing the accessibility and usefulness of Deep Learning technologies for non-standard speakers. From a linguistic perspective, rural Spanish areas are rich in dialectal variety. However, most technology… Read More

AUDIAS Seminars

Emotion recognition in Spanish audio

October 2, 2024October 1, 2024 daniel

Speaker: Manuel Otero González. Abstract: En esta charla se explicará la tarea de reconocimiento de emociones en audios en español, presentando los enfoques más avanzados del estado del arte, como Wav2Vec2 y W2V-Bert. Además, se introducirá el reto EmoSPeech, cuyo… Read More

AUDIAS Seminars

Posts navigation

1 2 … 4 Next

AUDIAS Seminars

Joint Automatic Speech Recognition And Structure. Learning For Better Speech Understanding

January 29, 2025

Speaker: María Pilar Fernández Gallego. Abstract: Spoken language understanding (SLU)…

A Whisper-based Query-by-Example Spoken Term Detection approach for search on speech

January 22, 2025

Speaker: Javier Tejedor Noguerales. Abstract: Nowadays, in the digital era,…

News & Events

Alicia Lozano-Diez selected for a MSCA grant for an intership at MIT

April 14, 2023

AUDIAS PhD Students hired!

February 2, 2023

About AUDIAS

AUDIAS is a solid research group addressing challenging problems in speech, audio and temporal signals from deep foundations in machine learning and signal processing.

Quick Links

  • Home
  • News
  • People
  • Publications
  • Industry
  • Evaluations
  • For Students
  • Contact

Highlights

DeepMUSE Research Project granted to AUDIAS

June 24, 2022

Sergio Álvarez Balanya selected for an intership in Amazon

April 23, 2022
Copyright © AUDIAS-UAM All rights reserved.
Education Mind by Axle Themes