Speaker: Juan Maroñas Molano Abstract: Deep Neural Networks (DNN) represent the state of the art in many tasks. However, due to their overparameterization, their generalization capabilities are in doubt and still a field under study. Consequently, DNN can overfit and… Read More
Self-supervised deep learning approaches for speaker recognition
Speaker: Joaquín González Abstract: In this talk I will review the thesis “Self-supervised deep learning approaches for speaker recognition” presented by Umair Khan at the UPC (Universidad Politecnica de Cataluña) in January 2021, directed by Javier Hernando. In this thesis… Read More
Data augmentation for improved robustness against packet losses in ASR
Speaker: María Pilar Fernández Gallego Abstract: Nowadays a large amount of companies record conversations, calls, sales or even meetings, in many cases to comply with the current legislation. Apart from the legal need, these recordings constitute an invaluable source of… Read More
End-to-end Query-by-example Spoken Term Detection
Speaker: Juan Ignacio Álvarez Trejos Abstract: Query-by-example Spoken Term Detection (QbE-STD) is a keytechnology to harness the large amount of audiovisual content that is being stored and generated nowadays. Using audio example queries for STD has several advantages such as… Read More
AUDIAS-UAM System for the Albayzin 2020 Speech to Text Challenge
Speaker: Beltrán Labrador Serrano Abstract: This presentation describes the system submitted by the AUDIAS-UAM team for the Albayzin 2020 Speech to Text Challenge. Our system is an end to end Transformer-based system built using ESPnet Toolkit. The acoustic model is… Read More
Multi-resolution Sound Event Detection
Speaker: Diego de Benito Gorrón Abstract: The Sound Event Detection task aims to determine the temporal locations of acoustic events in audio clips. Over the recent years, this field is holding a rising relevance due to the introduction of datasets… Read More
BUT system for the Short-duration Speaker Verification challenge 2020
Speaker: Alicia Lozano Díez Abstract: In this talk, I present the Brno University of Technology (BUT) system submitted for the text-dependent task of the Short-duration Speaker Verification challenge 2020, which was the best performing system for this task. We explored… Read More
Measuring Calibration in Deep Learning
Speaker: Daniel Ramos Castro Abstract: In this talk, we will present the article Nixon et al. 2020, “Measuring Calibration in Deep Learning”, published in CVPR Workshops 2020. In this paper, the current most popular measure of calibration for deep learning,… Read More
Beltrán Labrador has been awarded a FPI PhD Grant from the Spanish Ministry of Education
Beltran Labrador has been selected as a Pre-doctoral fellow by the Spanish Ministry of Education (FPI PhD Grant), to work with AUDIAS towards his PhD degree.
AUDIAS is organizing ALBAYZIN Search-on-Speech 2020
The AUDIAS research group is organizing the fifth ALBAYZIN Search-on-Speech challenge in collaboration with University CEU San Pablo. The challenge will take place in the context of the next IberSPEECH conference.