Skip to content
Main Menu
  • Home
  • Seminars
  • People
    • Staff
    • Ph.D. Students
    • Students
    • AUDIAS Alumni
  • Publications
  • Projects & Industry
    • Projects
    • Industry
  • Evaluations
  • For Students
    • Master’s Thesis Proposals
    • Study Grants and Opportunities
    • Teaching
  • Contact
AUDIAS-UAM
  • Home
  • Seminars
  • People
    • Staff
    • Ph.D. Students
    • Students
    • AUDIAS Alumni
  • Publications
  • Projects & Industry
    • Projects
    • Industry
  • Evaluations
  • For Students
    • Master’s Thesis Proposals
    • Study Grants and Opportunities
    • Teaching
  • Contact

Category: AUDIAS Seminars

  • Home
  • AUDIAS Seminars
  • Page 12

Call-sign recognition and understanding for noisy air-traffic transcripts using surveillance information

May 5, 2022May 19, 2022 daniel

Speaker: Ana Belén Fernández Cordero. Abstract: Air traffic control (ATC) relies on communication via speech between pilot and air-traffic controller (ATCO). The call-sign, as unique identifier for each flight, is used to address a specific pilot by the ATCO. Extracting… Read More

AUDIAS Seminars

AVASpeech-SMAD: A speech and music activity detection database with label co-occurrence

April 28, 2022May 19, 2022 daniel

Speaker: Guillermo Recio Martín. Abstract: AVASpeech is a publicly available dataset created in 2018 to contribute to the task of speech activity detection (SAD) task. This dataset contains three different types of audio segments: clean speech, speech co-occuring with music… Read More

AUDIAS Seminars

Conformer-based sound event detection with semi-supervised learning and data augmentation

April 22, 2022May 19, 2022 daniel

Speaker: Sara Barahona Quirós. Abstract: This paper presents a Conformer-based sound event detection (SED) method, which uses semi-supervised learning and data augmentation. The proposed method employs Conformer, a convolution-augmented Transformer that is able to exploit local features of audio data… Read More

AUDIAS Seminars

Speaker Diarization with Region Proposal Network

April 7, 2022May 19, 2022 daniel

Speaker: Sergio Izquierdo del Álamo. Abstact: Speaker diarization is an important pre-processing step for many speech applications, and it aims to solve the “who spoke when” problem. Although the standard diarization systems can achieve satisfactory results in various scenarios, they… Read More

AUDIAS Seminars

Conversational Agents for Health Care

March 31, 2022May 19, 2022 daniel

Speaker: Giuliano Lazzara. Abstract: Brief that focuses on people’s perception of Conversational Agents and proposes these technologies as a tool to deal with underestimated mental issues such as depression and anxiety. Referring to experiments done with “Woebot”, an automated conversational… Read More

AUDIAS Seminars

data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language

March 24, 2022May 19, 2022 daniel

Speaker: Sergio Segovia. Abstract: The core idea is to predict latent representations of the full input data based on a masked view of the input in a self-distillation setup using a standard Transformer architecture. Instead of predicting modality-specific targets such… Read More

AUDIAS Seminars

Data Augmentation for Decoupled Calibration of Deep Neural Network Classifiers

March 17, 2022May 19, 2022 daniel

Speaker: Sergio Márquez Carrero. Abstract: Modern Deep Neural Networks (DNN) have significantly outperformed those employed over a decade ago in terms of accuracy. Nonetheless, the outputs generated by these models are poorly calibrated, causing substantial issues in a variety of… Read More

AUDIAS Seminars

Connectionist Temporal Classification (CTC) Speech Segmentation

March 10, 2022March 11, 2022 daniel

Speaker: W. Fernando López Gavilanez. Abstract: Motivated by the lack of high-quality labeled data for specific scenarios, such as emergencies in the home environment, we explored a CTC-segmentation method to generate a specific-purpose speech dataset. The project seeks the quality improvement of… Read More

AUDIAS Seminars

BigSSL: Large-Scale Semi-Supervised Learning for ASR

March 3, 2022March 4, 2022 audias

Speaker: Laura Herrera Abstract: This paper deals with results obtained on very large automatic speaker recognition models.A large amount of labelled data is not always available and sometimes they do not generalize enough. Consequently, the authors propose to use pre-trained… Read More

AUDIAS Seminars

Efficient Neural Approaches for Automatic Speech Recognition

February 24, 2022March 4, 2022 audias

Speaker: Doroteo Torre Toledano Abstract: Many different end-to-end neural approaches have been proposed in the last years in the field of automatic speech recognition (ASR). However, most of the research available compares systems only in terms of accuracy (word error… Read More

AUDIAS Seminars

Posts navigation

Previous 1 … 11 12 13 … 15 Next

AUDIAS Seminars

Fitting Protein Language Models (PLMs) for the prediction of protein functionality using zero-shot and few-shot techniques.

September 15, 2025

Speaker: Juan Antonio Gordillo Gayo. Abstract: The unprecedent success of…

Open science in the service of conservation: An accessible, user-friendly machine learning workflow for automated anuran monitoring in complex Neotropical soundscapes

September 10, 2025

Speaker: Gabriel Bidart Abstract: Amphibian populations worldwide are declining, particularly…

Introduction to Protein Language Models: biological concepts and computational tools

July 9, 2025

Speaker: Juan Antonio Gordillo Gayo. Abstract: Proteins are the main…

Optimization of a Deep Learning Model for DNA Analysis under Hypoxemic Conditions

July 2, 2025

Speaker: Paloma Villanueva Fuster. Abstract: This study focuses on predicting…

About AUDIAS

AUDIAS is a solid research group addressing challenging problems in speech, audio and temporal signals from deep foundations in machine learning and signal processing.

Quick Links

  • Home
  • People
  • Publications
  • Industry
  • Evaluations
  • For Students
  • Contact

Highlights

Fitting Protein Language Models (PLMs) for the prediction of protein functionality using zero-shot and few-shot techniques.

September 15, 2025

Open science in the service of conservation: An accessible, user-friendly machine learning workflow for automated anuran monitoring in complex Neotropical soundscapes

September 10, 2025
Copyright © AUDIAS-UAM All rights reserved.
Education Mind by Axle Themes