daniel – Page 6 – AUDIAS-UAM

Deployment of KWS models: audio features optimization and streaming mode

June 16, 2023June 19, 2023 daniel

Speaker: William Fernando López Gavilánez. Abstract: The deployment process of Keyword Spotting (KWS) models depends on the target hardware, it normally includes merging components in a black box, binarization, quantization, and/or mobile optimization. In addition, while processing a continuous stream… Read More

Lines of research in the field of acoustic events detection

June 9, 2023June 19, 2023 daniel

Speaker: Sergio Segovia González. Abstract: Within the development of the doctoral thesis, whose objective is to work in the field of acoustic event detection, it has been carried out the implementation of several lines of research, such as using the… Read More

Fairness in the most popular ASR systems

June 1, 2023June 6, 2023 daniel

Speaker: Pilar Fernández Gallego Abstract: Nowadays ASR (Automatic Speech Recognition) systems have dramatically improved, due both to advances in deep learning and to the collection of large datasets used to train the systems. However, it has been demonstrated that some… Read More

VoxCeleb-Spain: design, acquisition and evaluation with deep neural networks of a database of Spanish celebrity voices

May 26, 2023May 31, 2023 daniel

Speaker: Manuel Otero González. Abstract: This work presents a new database, VozCeleb-Spain, captured following analogous protocols as the well-know VoxCeleb database, but using YouTube(TM) videos of celebrities of Spanish nationality. The evaluation of the database through various experiments is also… Read More

GuitarSet: A Dataset for Guitar Transcription

May 19, 2023May 22, 2023 daniel

Speaker: Diego de Benito Gorrón. Abstract: Based on https://guitarset.weebly.com/uploads/1/2/1/6/121620128/xi_ismir_2018.pdf. The guitar is a popular instrument for a variety of reasons, including its ability to produce polyphonic sound and its musical versatility. The resulting variability of sounds, however, poses significant challenges… Read More

Representing evidence for Bayesian updating: compositional evidence, privacy and calibration

May 12, 2023May 22, 2023 daniel

Speaker: Paul-Gauthier Noé. Abstract: Attribute privacy in multimedia technology aims to hide only one or a few personal characteristics, or attributes, of an individual rather than the full identity. To give a few examples, these attributes can be the sex,… Read More

Detection of abnormalities in electrocardiograms with 2 sensors using machine learning

May 5, 2023May 8, 2023 daniel

Speaker: Ana Molina Conesa. Abstract: This talk is based on the Physionet Challenge 2021, in which participants aim to design and implement an algorithm capable of automatically identifying any cardiac abnormalities present in electrocardiogram (ECG) recordings with 12, 6, 4,… Read More

Anomaly detection in 12-lead electrocardiograms using machine learning

April 28, 2023May 8, 2023 daniel

Speaker: Miguel González Rodríguez. Abstract: The Physionet Challenge 2021 is presented. The goal is to classify 27 types of cardiac anomalies from electrocardiograms using convolutional neural networks (CNN). The challenge database consists of over 30,000 patient records, making it one… Read More

Precomputed Sound Propagation for Virtual Reality & Gaming

April 21, 2023May 5, 2023 daniel

Speaker: Joaquín González Rodríguez. Abstract: This talk is based on: Parametric Wave Field Coding for Precomputed Sound Propagation (ACM Transactions on Graphics, Vol. 33, No. 4, Article 38, Publication Date: July 2014) Parametric Directional Coding for Precomputed Sound Propagation (ACM… Read More

Breath cycle detection in respiratory audios

April 14, 2023April 17, 2023 daniel

Speaker: Miguel Ángel Martínez Pay. Abstract: Neural networks applied to the detection of acoustic events in respiratory audios. Introduction to the ICBHI 2017 database dedicated to the classification of respiratory cycles into “normal”, “with crackles”, “with wheezes”, “with both”. Main… Read More

Author: daniel