AUDIAS Seminars – Page 11

Iterative psuedo-forced alignment tool

February 17, 2023March 3, 2023 daniel

Speaker: W. Fernando López Gavilánez. Abstract: High-quality data labeling from specific domains is costly and human time-consuming. In this work, we propose an iterative pseudo-forced alignment algorithm for long audio files with low-quality transcriptions. The alignments are iteratively done by… Read More

Differentially Private Fine-Tuning for Language Models

February 10, 2023February 10, 2023 daniel

Speaker: Beltrán Labrador Serrano. Abstract: Based on https://arxiv.org/abs/2110.06500. In this talk we will comment the paper Differentially Private Fine-Tuning for Language Models, where the authors give simpler, sparser, and faster algorithms for differentially private fine-tuning of large-scale pre-trained language models,… Read More

Conformer Architecture for Sound Event Detection (DCASE)

February 3, 2023February 3, 2023 daniel

Speaker: Sara Barahona Quirós. Abstract: Sound Event Detection is the task that is focused on automatizing the human’s ability of recognizing sound events in the environment. Over the last years, the creation of evaluations such as the Detection and Classification… Read More

MixMatch: A Holistic Approach to Semi-Supervised Learning

January 20, 2023February 3, 2023 daniel

Speaker: Diego de Benito Gorrón. Abstract: This talk is an overview of a NIPS 2019 paper by David Berthelot et al. (Google Research) that proposes a novel method for Semi-supervised learning: MixMatch. “Semi-supervised learning has proven to be a powerful… Read More

Highly accurate protein structure prediction with AlphaFold

January 13, 2023January 16, 2023 daniel

Speaker: Juan Ignacio Álvarez Trejos. Abstract: Based on https://www.nature.com/articles/s41586-021-03819-2. Proteins are essential to life, and understanding their structure can facilitate a mechanistic understanding of their function. Through an enormous experimental effort, the structures of around 100,000 unique proteins have been… Read More

Whisper: Robust Speech Recognition via Large-Scale Weak Supervision

December 2, 2022January 16, 2023 daniel

Speaker: Doroteo Torre Toledano. Abstract: Very recently (in Sept 2022) OpenAI has made freely available a speech recognition neural network called Whisper. One of the main differences with respect to the current state of the art is the use of… Read More

Dynamic Bayesian Networks for Temporal Prediction of Chemical Radioisotope Levels in Nuclear Power Plant Reactors

November 18, 2022January 16, 2023 daniel

Speaker: Daniel Ramos Castro. Abstract: Radiation dose in nuclear power plant reactors is known to be dominated by the presence of radioisotopes in the primary loop of the reactor. In order to strictly control it in normal operation (e.g., cleaning… Read More

Automatic adventitious respiratory sound analysis: A systematic review

November 11, 2022November 16, 2022 daniel

Speaker: Miguel Ángel Martínez Pay. Abstract: Based on https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0177926. Automatic detection or classification of adventitious sounds is useful to assist physicians in diagnosing or monitoring diseases such as asthma, Chronic Obstructive Pulmonary Disease, and pneumonia. This article contains a compilation… Read More

Training Speaker Recognition Systems with Limited Data

November 4, 2022November 11, 2022 daniel

Speaker: Guillermo Recio. Abstract: Based on paper https://www.isca-speech.org/archive/pdfs/interspeech_2022/vaessen22_interspeech.pdf. This work considers training neural networks for speaker recognition with smaller datasets compared to contemporary work. For this purpose, they propose three subsets of the VoxCeleb2 dataset. Each of these subsets contains… Read More

Exploring sequence-to-sequence transformer-transducer models for keyword spotting

October 14, 2022November 15, 2022 daniel

Speaker: Beltrán Labrador Serrano. Abstract: Beltrán’s final Google research internship presentation. This presentation introduces a transformer-transducer keyword spotting system that simultaneously optimizes ASR and keyword spotting losses using a sequence to sequence RNN-T loss. Each loss is further balanced using… Read More

Category: AUDIAS Seminars