2022 – Page 3 – AUDIAS-UAM

Article review: “Objectifying evidence evaluation for gunshot residue comparisons using machine learning on criminal case data”

May 19, 2022May 19, 2022 daniel

Speaker: Daniel Ramos Castro Abstract: Basado en https://doi.org/10.1016/j.forsciint.2022.111293. “Comparative gunshot residue analysis addresses relevant forensic questions such as ‘did suspect X fire shot Y?’. More formally, it weighs the evidence for hypotheses of the form H1: gunshot residue particles found… Read More

Assessing Calibration in the regression setting

May 12, 2022May 19, 2022 daniel

Speaker: Sergio Álvarez Balanya. Abstract: Calibration is a desirable property of pattern recognition systems, especially when their predictions are going to be used to make decisions. In our group, we are used to dealing with calibration in classification tasks such… Read More

Call-sign recognition and understanding for noisy air-traffic transcripts using surveillance information

May 5, 2022May 19, 2022 daniel

Speaker: Ana Belén Fernández Cordero. Abstract: Air traffic control (ATC) relies on communication via speech between pilot and air-traffic controller (ATCO). The call-sign, as unique identifier for each flight, is used to address a specific pilot by the ATCO. Extracting… Read More

AVASpeech-SMAD: A speech and music activity detection database with label co-occurrence

April 28, 2022May 19, 2022 daniel

Speaker: Guillermo Recio Martín. Abstract: AVASpeech is a publicly available dataset created in 2018 to contribute to the task of speech activity detection (SAD) task. This dataset contains three different types of audio segments: clean speech, speech co-occuring with music… Read More

Sergio Álvarez Balanya selected for an intership in Amazon

April 23, 2022June 24, 2022 daniel

Sergio Álvarez Balanya has been recently selected for a summer internship at Amazon, Barcelona, Spain. He will be starting the internship in June and returning to Madrid in December.

Conformer-based sound event detection with semi-supervised learning and data augmentation

April 22, 2022May 19, 2022 daniel

Speaker: Sara Barahona Quirós. Abstract: This paper presents a Conformer-based sound event detection (SED) method, which uses semi-supervised learning and data augmentation. The proposed method employs Conformer, a convolution-augmented Transformer that is able to exploit local features of audio data… Read More

Speaker Diarization with Region Proposal Network

April 7, 2022May 19, 2022 daniel

Speaker: Sergio Izquierdo del Álamo. Abstact: Speaker diarization is an important pre-processing step for many speech applications, and it aims to solve the “who spoke when” problem. Although the standard diarization systems can achieve satisfactory results in various scenarios, they… Read More

Conversational Agents for Health Care

March 31, 2022May 19, 2022 daniel

Speaker: Giuliano Lazzara. Abstract: Brief that focuses on people’s perception of Conversational Agents and proposes these technologies as a tool to deal with underestimated mental issues such as depression and anxiety. Referring to experiments done with “Woebot”, an automated conversational… Read More

data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language

March 24, 2022May 19, 2022 daniel

Speaker: Sergio Segovia. Abstract: The core idea is to predict latent representations of the full input data based on a masked view of the input in a self-distillation setup using a standard Transformer architecture. Instead of predicting modality-specific targets such… Read More

Data Augmentation for Decoupled Calibration of Deep Neural Network Classifiers

March 17, 2022May 19, 2022 daniel

Speaker: Sergio Márquez Carrero. Abstract: Modern Deep Neural Networks (DNN) have significantly outperformed those employed over a decade ago in terms of accuracy. Nonetheless, the outputs generated by these models are poorly calibrated, causing substantial issues in a variety of… Read More

Year: 2022