Speaker: Joaquín González Abstract: In this talk I will review the thesis “Self-supervised deep learning approaches for speaker recognition” presented by Umair Khan at the UPC (Universidad Politecnica de Cataluña) in January 2021, directed by Javier Hernando. In this thesis… Read More
Data augmentation for improved robustness against packet losses in ASR
Speaker: María Pilar Fernández Gallego Abstract: Nowadays a large amount of companies record conversations, calls, sales or even meetings, in many cases to comply with the current legislation. Apart from the legal need, these recordings constitute an invaluable source of… Read More
End-to-end Query-by-example Spoken Term Detection
Speaker: Juan Ignacio Álvarez Trejos Abstract: Query-by-example Spoken Term Detection (QbE-STD) is a keytechnology to harness the large amount of audiovisual content that is being stored and generated nowadays. Using audio example queries for STD has several advantages such as… Read More
AUDIAS-UAM System for the Albayzin 2020 Speech to Text Challenge
Speaker: Beltrán Labrador Serrano Abstract: This presentation describes the system submitted by the AUDIAS-UAM team for the Albayzin 2020 Speech to Text Challenge. Our system is an end to end Transformer-based system built using ESPnet Toolkit. The acoustic model is… Read More
Multi-resolution Sound Event Detection
Speaker: Diego de Benito Gorrón Abstract: The Sound Event Detection task aims to determine the temporal locations of acoustic events in audio clips. Over the recent years, this field is holding a rising relevance due to the introduction of datasets… Read More