Skip to content
Main Menu
  • Home
  • Seminars
  • People
    • Staff
    • Ph.D. Students
    • Students
    • AUDIAS Alumni
  • Publications
  • Projects & Industry
    • Projects
    • Industry
  • Evaluations
  • For Students
    • Master’s Thesis Proposals
    • Study Grants and Opportunities
    • Teaching
  • Contact
AUDIAS-UAM
  • Home
  • Seminars
  • People
    • Staff
    • Ph.D. Students
    • Students
    • AUDIAS Alumni
  • Publications
  • Projects & Industry
    • Projects
    • Industry
  • Evaluations
  • For Students
    • Master’s Thesis Proposals
    • Study Grants and Opportunities
    • Teaching
  • Contact

Author: daniel

  • Home
  • daniel
  • Page 10

Conformer-based sound event detection with semi-supervised learning and data augmentation

April 22, 2022May 19, 2022 daniel

Speaker: Sara Barahona Quirós. Abstract: This paper presents a Conformer-based sound event detection (SED) method, which uses semi-supervised learning and data augmentation. The proposed method employs Conformer, a convolution-augmented Transformer that is able to exploit local features of audio data… Read More

AUDIAS Seminars

Speaker Diarization with Region Proposal Network

April 7, 2022May 19, 2022 daniel

Speaker: Sergio Izquierdo del Álamo. Abstact: Speaker diarization is an important pre-processing step for many speech applications, and it aims to solve the “who spoke when” problem. Although the standard diarization systems can achieve satisfactory results in various scenarios, they… Read More

AUDIAS Seminars

Conversational Agents for Health Care

March 31, 2022May 19, 2022 daniel

Speaker: Giuliano Lazzara. Abstract: Brief that focuses on people’s perception of Conversational Agents and proposes these technologies as a tool to deal with underestimated mental issues such as depression and anxiety. Referring to experiments done with “Woebot”, an automated conversational… Read More

AUDIAS Seminars

data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language

March 24, 2022May 19, 2022 daniel

Speaker: Sergio Segovia. Abstract: The core idea is to predict latent representations of the full input data based on a masked view of the input in a self-distillation setup using a standard Transformer architecture. Instead of predicting modality-specific targets such… Read More

AUDIAS Seminars

Data Augmentation for Decoupled Calibration of Deep Neural Network Classifiers

March 17, 2022May 19, 2022 daniel

Speaker: Sergio Márquez Carrero. Abstract: Modern Deep Neural Networks (DNN) have significantly outperformed those employed over a decade ago in terms of accuracy. Nonetheless, the outputs generated by these models are poorly calibrated, causing substantial issues in a variety of… Read More

AUDIAS Seminars

Connectionist Temporal Classification (CTC) Speech Segmentation

March 10, 2022March 11, 2022 daniel

Speaker: W. Fernando López Gavilanez. Abstract: Motivated by the lack of high-quality labeled data for specific scenarios, such as emergencies in the home environment, we explored a CTC-segmentation method to generate a specific-purpose speech dataset. The project seeks the quality improvement of… Read More

AUDIAS Seminars

Posts navigation

Previous 1 … 9 10

AUDIAS Seminars

Summary of the CIFE 2026 (Congreso Internacional de Fonética Experimental)

January 22, 2026

Speaker: Manuel Otero González. Abstract: This presentation explained the development…

A Perceptually-Optimised Hybrid Acoustic Simulation for Interactive Environments

December 11, 2025

Speaker: Maya Tia Kanani. Abstract: Accurate simulation of sound propagation…

Transformer Embeddings Enable Accurate Forensic Classification of Soil DNA

November 27, 2025

Speaker: Manuel Fernando Mollón Laorca. Abstract: In this study, we…

Automatic Speech Recognition and Speaker Diarization in Spanish: Dialectal Varieties and Rural Speech

November 20, 2025

Speaker: Laura Herrera Alarcón Abstract: Audio databases provide essential linguistic…

About AUDIAS

AUDIAS is a solid research group addressing challenging problems in speech, audio and temporal signals from deep foundations in machine learning and signal processing.

Quick Links

  • Home
  • People
  • Publications
  • Industry
  • Evaluations
  • For Students
  • Contact

Highlights

Summary of the CIFE 2026 (Congreso Internacional de Fonética Experimental)

January 22, 2026

A Perceptually-Optimised Hybrid Acoustic Simulation for Interactive Environments

December 11, 2025
Copyright © AUDIAS-UAM All rights reserved.
Education Mind by Axle Themes