Skip to content
Main Menu
  • Home
  • Seminars
  • People
    • Staff
    • Ph.D. Students
    • Students
    • AUDIAS Alumni
  • Publications
  • Projects & Industry
    • Projects
    • Industry
  • Evaluations
  • For Students
    • Master’s Thesis Proposals
    • Study Grants and Opportunities
    • Teaching
  • Contact
AUDIAS-UAM
  • Home
  • Seminars
  • People
    • Staff
    • Ph.D. Students
    • Students
    • AUDIAS Alumni
  • Publications
  • Projects & Industry
    • Projects
    • Industry
  • Evaluations
  • For Students
    • Master’s Thesis Proposals
    • Study Grants and Opportunities
    • Teaching
  • Contact

Author: daniel

  • Home
  • daniel
  • Page 9

Call-sign recognition and understanding for noisy air-traffic transcripts using surveillance information

May 5, 2022May 19, 2022 daniel

Speaker: Ana Belén Fernández Cordero. Abstract: Air traffic control (ATC) relies on communication via speech between pilot and air-traffic controller (ATCO). The call-sign, as unique identifier for each flight, is used to address a specific pilot by the ATCO. Extracting… Read More

AUDIAS Seminars

AVASpeech-SMAD: A speech and music activity detection database with label co-occurrence

April 28, 2022May 19, 2022 daniel

Speaker: Guillermo Recio Martín. Abstract: AVASpeech is a publicly available dataset created in 2018 to contribute to the task of speech activity detection (SAD) task. This dataset contains three different types of audio segments: clean speech, speech co-occuring with music… Read More

AUDIAS Seminars

Sergio Álvarez Balanya selected for an intership in Amazon

April 23, 2022June 24, 2022 daniel

Sergio Álvarez Balanya has been recently selected for a summer internship at Amazon, Barcelona, Spain. He will be starting the internship in June and returning to Madrid in December.

Highlights, News & Events

Conformer-based sound event detection with semi-supervised learning and data augmentation

April 22, 2022May 19, 2022 daniel

Speaker: Sara Barahona Quirós. Abstract: This paper presents a Conformer-based sound event detection (SED) method, which uses semi-supervised learning and data augmentation. The proposed method employs Conformer, a convolution-augmented Transformer that is able to exploit local features of audio data… Read More

AUDIAS Seminars

Speaker Diarization with Region Proposal Network

April 7, 2022May 19, 2022 daniel

Speaker: Sergio Izquierdo del Álamo. Abstact: Speaker diarization is an important pre-processing step for many speech applications, and it aims to solve the “who spoke when” problem. Although the standard diarization systems can achieve satisfactory results in various scenarios, they… Read More

AUDIAS Seminars

Conversational Agents for Health Care

March 31, 2022May 19, 2022 daniel

Speaker: Giuliano Lazzara. Abstract: Brief that focuses on people’s perception of Conversational Agents and proposes these technologies as a tool to deal with underestimated mental issues such as depression and anxiety. Referring to experiments done with “Woebot”, an automated conversational… Read More

AUDIAS Seminars

data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language

March 24, 2022May 19, 2022 daniel

Speaker: Sergio Segovia. Abstract: The core idea is to predict latent representations of the full input data based on a masked view of the input in a self-distillation setup using a standard Transformer architecture. Instead of predicting modality-specific targets such… Read More

AUDIAS Seminars

Data Augmentation for Decoupled Calibration of Deep Neural Network Classifiers

March 17, 2022May 19, 2022 daniel

Speaker: Sergio Márquez Carrero. Abstract: Modern Deep Neural Networks (DNN) have significantly outperformed those employed over a decade ago in terms of accuracy. Nonetheless, the outputs generated by these models are poorly calibrated, causing substantial issues in a variety of… Read More

AUDIAS Seminars

Connectionist Temporal Classification (CTC) Speech Segmentation

March 10, 2022March 11, 2022 daniel

Speaker: W. Fernando López Gavilanez. Abstract: Motivated by the lack of high-quality labeled data for specific scenarios, such as emergencies in the home environment, we explored a CTC-segmentation method to generate a specific-purpose speech dataset. The project seeks the quality improvement of… Read More

AUDIAS Seminars

Posts navigation

Previous 1 … 8 9

AUDIAS Seminars

Titans: Learning to Memorize at Test Time

October 23, 2025

Speaker: Adrián Aranda Márquez. Abstract: This presentation provides an in-depth…

Calibration and Fusion of End-to-End Neural Diarization Models: A Comprehensive Framework

October 16, 2025

Speaker: Sergio Álvarez Balanya Abstract: End-to-End Neural Diarization (EEND) systems…

YOLO-based Transfer Learning for Acoustic Event Detection using Visual Object Detection Techniques

October 9, 2025

Speaker: Sergio Segovia González. Abstract: Traditional SED approaches are based…

Auditory General Intelligence (JSALT-2025)

September 25, 2025

Speaker: Laura Herrera Alarcón. Abstract: The emergence of Large Audio…

About AUDIAS

AUDIAS is a solid research group addressing challenging problems in speech, audio and temporal signals from deep foundations in machine learning and signal processing.

Quick Links

  • Home
  • People
  • Publications
  • Industry
  • Evaluations
  • For Students
  • Contact

Highlights

Titans: Learning to Memorize at Test Time

October 23, 2025

Calibration and Fusion of End-to-End Neural Diarization Models: A Comprehensive Framework

October 16, 2025
Copyright © AUDIAS-UAM All rights reserved.
Education Mind by Axle Themes