AUDIAS Seminars – Page 10

Training Speaker Recognition Systems with Limited Data

November 4, 2022November 11, 2022 daniel

Speaker: Guillermo Recio. Abstract: Based on paper https://www.isca-speech.org/archive/pdfs/interspeech_2022/vaessen22_interspeech.pdf. This work considers training neural networks for speaker recognition with smaller datasets compared to contemporary work. For this purpose, they propose three subsets of the VoxCeleb2 dataset. Each of these subsets contains… Read More

Exploring sequence-to-sequence transformer-transducer models for keyword spotting

October 14, 2022November 15, 2022 daniel

Speaker: Beltrán Labrador Serrano. Abstract: Beltrán’s final Google research internship presentation. This presentation introduces a transformer-transducer keyword spotting system that simultaneously optimizes ASR and keyword spotting losses using a sequence to sequence RNN-T loss. Each loss is further balanced using… Read More

Perceiver: General Perception with Iterative Attention

October 7, 2022October 20, 2022 daniel

Speaker: Juan Ignacio Álvarez Trejos. Abstract: Biological systems perceive the world by simultaneously processing high-dimensional inputs from modalities as diverse as vision, audition, touch, proprioception, etc. The perception models used in deep learning on the other hand are designed for… Read More

Continual learning for recurrent neural networks

September 30, 2022September 30, 2022 daniel

Speaker: Doroteo Torre Toledano Abstract: The current trend in machine learning assumes that there is a fixed distribution of incoming data, so that a fixed model can be learned to map incoming data to output classes. However, real applications in… Read More

Source Separation for Sound Event Detection in Domestic Environments Using Jointly Trained Models

September 23, 2022September 23, 2022 daniel

Speaker: Diego de Benito Gorrón. Abstract: Sound Event Detection and Source Separation are closely related tasks: whereas the first aims to find the time boundaries of acoustic events inside a recording, the goal of the latter is to isolate each… Read More

Representaciones de audio self-supervised Wav2Vec2 para el reconocimiento de locutor

September 16, 2022September 20, 2022 daniel

Speaker: Laura Herrera. Abstract: In this Final Degree Project, different speech representations, extracted by unsupervised learning, have been used to train a speaker recognition system. In particular, Wav2Vec2.0 and WavLM features have been used as a novelty. The Wav2Vec2.0 features… Read More

End-to-end deep learning models for air traffic control speech recognition

September 13, 2022September 20, 2022 daniel

Speaker: Ana Belén Fernández Cordero. Abstract: For many years, Air Traffic Controllers have had to manually type the information they received and transmitted to pilots into the electronic flight strip systems. This time consuming activity contributed to a significant increase… Read More

Efficient Transformers for End-to-End Neural Speaker Diarization

September 9, 2022September 20, 2022 daniel

Speaker: Sergio Izquierdo. Abstract: The recently proposed End-to-End Neural speaker Diarization framework (EEND) handles speech overlap and speech activity detection natively. While extensions of this work have reported remarkable results in both two-speaker and multi-speaker diarization scenarios, these come at… Read More

Sound Event Detection in a large-scale audio dataset with multi-resolution neural networks

September 9, 2022September 20, 2022 daniel

Speaker: Sara Barahona Quirós. Abstract: Sound event detection is the task that aims to automatize the human’s ability of recognizing sound events in the environment by their particular acoustic information. For this purpose, deep learning techniques are employed to build… Read More

A Speaker Verification Backend with Robust Performance across Conditions

July 14, 2022July 14, 2022 daniel

Speaker: Joaquin Gonzalez-Rodriguez. Abstract: Presentation of the paper in https://arxiv.org/abs/2102.01760: L. Ferrer et al. “A Speaker Verification Backend with Robust Performance across Conditions”, 2021. Abstract of the paper (reproduced from the preprint): In this paper, we address the problem of… Read More

Category: AUDIAS Seminars