2023 – Page 2 – AUDIAS-UAM

Personalized keyword spotting detection : Research internship @ Google

September 25, 2023October 9, 2023 daniel

Speaker: Beltrán Labrador Serrano. Abstract: Keyword spotting systems are used in a variety of applications, such as smart speakers and voice assistants. However, these systems can be challenged by diverse accents, age groups, and speaking conditions.In this talk, I will… Read More

Sound Event Detection with Conformer: the AUDIAS system for DCASE 2023

September 18, 2023September 22, 2023 daniel

Speaker: Sara Barahona Quirós. Abstract: The Conformer architecture has achieved state-of-the-art results in several tasks, including automatic speech recognition and automatic speaker verification. However, its utilization in sound event detection and in particular in the DCASE Challenge Task 4 has… Read More

Deployment of KWS models: audio features optimization and streaming mode

June 16, 2023June 19, 2023 daniel

Speaker: William Fernando López Gavilánez. Abstract: The deployment process of Keyword Spotting (KWS) models depends on the target hardware, it normally includes merging components in a black box, binarization, quantization, and/or mobile optimization. In addition, while processing a continuous stream… Read More

Lines of research in the field of acoustic events detection

June 9, 2023June 19, 2023 daniel

Speaker: Sergio Segovia González. Abstract: Within the development of the doctoral thesis, whose objective is to work in the field of acoustic event detection, it has been carried out the implementation of several lines of research, such as using the… Read More

Fairness in the most popular ASR systems

June 1, 2023June 6, 2023 daniel

Speaker: Pilar Fernández Gallego Abstract: Nowadays ASR (Automatic Speech Recognition) systems have dramatically improved, due both to advances in deep learning and to the collection of large datasets used to train the systems. However, it has been demonstrated that some… Read More

VoxCeleb-Spain: design, acquisition and evaluation with deep neural networks of a database of Spanish celebrity voices

May 26, 2023May 31, 2023 daniel

Speaker: Manuel Otero González. Abstract: This work presents a new database, VozCeleb-Spain, captured following analogous protocols as the well-know VoxCeleb database, but using YouTube(TM) videos of celebrities of Spanish nationality. The evaluation of the database through various experiments is also… Read More

GuitarSet: A Dataset for Guitar Transcription

May 19, 2023May 22, 2023 daniel

Speaker: Diego de Benito Gorrón. Abstract: Based on https://guitarset.weebly.com/uploads/1/2/1/6/121620128/xi_ismir_2018.pdf. The guitar is a popular instrument for a variety of reasons, including its ability to produce polyphonic sound and its musical versatility. The resulting variability of sounds, however, poses significant challenges… Read More

Representing evidence for Bayesian updating: compositional evidence, privacy and calibration

May 12, 2023May 22, 2023 daniel

Speaker: Paul-Gauthier Noé. Abstract: Attribute privacy in multimedia technology aims to hide only one or a few personal characteristics, or attributes, of an individual rather than the full identity. To give a few examples, these attributes can be the sex,… Read More

Detection of abnormalities in electrocardiograms with 2 sensors using machine learning

May 5, 2023May 8, 2023 daniel

Speaker: Ana Molina Conesa. Abstract: This talk is based on the Physionet Challenge 2021, in which participants aim to design and implement an algorithm capable of automatically identifying any cardiac abnormalities present in electrocardiogram (ECG) recordings with 12, 6, 4,… Read More

Anomaly detection in 12-lead electrocardiograms using machine learning

April 28, 2023May 8, 2023 daniel

Speaker: Miguel González Rodríguez. Abstract: The Physionet Challenge 2021 is presented. The goal is to classify 27 types of cardiac anomalies from electrocardiograms using convolutional neural networks (CNN). The challenge database consists of over 30,000 patient records, making it one… Read More

Year: 2023