Skip to content
Main Menu
  • Home
  • Seminars
  • People
    • Staff
    • Ph.D. Students
    • Students
    • AUDIAS Alumni
  • Publications
  • Projects & Industry
    • Projects
    • Industry
  • Evaluations
  • For Students
    • Master’s Thesis Proposals
    • Study Grants and Opportunities
    • Teaching
  • Contact
AUDIAS-UAM
  • Home
  • Seminars
  • People
    • Staff
    • Ph.D. Students
    • Students
    • AUDIAS Alumni
  • Publications
  • Projects & Industry
    • Projects
    • Industry
  • Evaluations
  • For Students
    • Master’s Thesis Proposals
    • Study Grants and Opportunities
    • Teaching
  • Contact

Category: AUDIAS Seminars

  • Home
  • AUDIAS Seminars
  • Page 13

Connectionist Temporal Classification (CTC) Speech Segmentation

March 10, 2022March 11, 2022 daniel

Speaker: W. Fernando López Gavilanez. Abstract: Motivated by the lack of high-quality labeled data for specific scenarios, such as emergencies in the home environment, we explored a CTC-segmentation method to generate a specific-purpose speech dataset. The project seeks the quality improvement of… Read More

AUDIAS Seminars

BigSSL: Large-Scale Semi-Supervised Learning for ASR

March 3, 2022March 4, 2022 audias

Speaker: Laura Herrera Abstract: This paper deals with results obtained on very large automatic speaker recognition models.A large amount of labelled data is not always available and sometimes they do not generalize enough. Consequently, the authors propose to use pre-trained… Read More

AUDIAS Seminars

Efficient Neural Approaches for Automatic Speech Recognition

February 24, 2022March 4, 2022 audias

Speaker: Doroteo Torre Toledano Abstract: Many different end-to-end neural approaches have been proposed in the last years in the field of automatic speech recognition (ASR). However, most of the research available compares systems only in terms of accuracy (word error… Read More

AUDIAS Seminars

Structured Output Learning

February 10, 2022March 4, 2022 audias

Speaker: María Pilar Fernández Rodríguez Abstract: Speech applications dealing with conversations require not only recognizing the spoken words, but also determining who spoke when, the language, punctuation, capitalization… To deal with it, it is typically addressed by merging the outputs… Read More

AUDIAS Seminars

Voxceleb Experiment: fairness

January 27, 2022March 4, 2022 audias

Speaker: Almudena Aguilera Abstract: The experiment is based on the dataset from Voxceleb [1], using the two pre-trained models. The main idea of these experiments was to study the fairness problems in different demographic groups present in the data base… Read More

AUDIAS Seminars

Semi-Supervised Music Tagging Transformer

December 16, 2021March 4, 2022 audias

Speaker: David Martín Abstract: Music Tagging Transformer (MTT) was recently released in the latest ISMIR 2021 Conference as one of the most erupting deep learning approaches for Music Information Retrieval. It consists of a semi-supervised approach where the model captures… Read More

AUDIAS Seminars

Encoder-Decoder Based Attractor Calculation for End-to-End Neural Diarization

December 9, 2021March 7, 2022 audias

Speaker: Alicia Lozano Díez Abstract: In this talk, we will deeply review the algorithms behind end-to-end systems for speaker diarization based on neural networks. In particular, we will describe how the encoder-decoder part of the model calculates “attractors” that capture… Read More

AUDIAS Seminars

Unsupervised Sound Separation Using Mixture Invariant Training

November 18, 2021March 7, 2022 audias

Speaker: Diego de Benito Gorrón Abstract: In recent years, rapid progress has been made on the problem of single-channel sound separation using supervised training of deep neural networks. In such supervised approaches, a model is trained to predict the component… Read More

AUDIAS Seminars

relMix: An open source software for DNA mixtures with related contributors

November 11, 2021March 7, 2022 audias

Speaker: Elías Hernández Abstract: La prueba de ADN ha supuesto un gran avance en el contexto judicial y muchas veces es considerada como la prueba definitiva para condenar o absolver a un acusado. Los resultados de una prueba de ADN… Read More

AUDIAS Seminars

Improving Fairness in Speaker Recognition

November 4, 2021March 8, 2022 audias

Speaker: Almudena Aguilera Abstract: Speaker Recognition Systems aim to automatically recognize the identity of an individual from a recording of his/her speech or voice. Despite the progress of these systems in terms of accuracy, we must ask ourselves: “What happen… Read More

AUDIAS Seminars

Posts navigation

Previous 1 … 12 13 14 15 Next

AUDIAS Seminars

Calibration and Fusion of End-to-End Neural Diarization Models: A Comprehensive Framework

October 16, 2025

Speaker: Sergio Álvarez Balanya Abstract: End-to-End Neural Diarization (EEND) systems…

YOLO-based Transfer Learning for Acoustic Event Detection using Visual Object Detection Techniques

October 9, 2025

Speaker: Sergio Segovia González. Abstract: Traditional SED approaches are based…

Auditory General Intelligence (JSALT-2025)

September 25, 2025

Speaker: Laura Herrera Alarcón. Abstract: The emergence of Large Audio…

Fitting Protein Language Models (PLMs) for the prediction of protein functionality using zero-shot and few-shot techniques.

September 15, 2025

Speaker: Juan Antonio Gordillo Gayo. Abstract: The unprecedent success of…

About AUDIAS

AUDIAS is a solid research group addressing challenging problems in speech, audio and temporal signals from deep foundations in machine learning and signal processing.

Quick Links

  • Home
  • People
  • Publications
  • Industry
  • Evaluations
  • For Students
  • Contact

Highlights

Calibration and Fusion of End-to-End Neural Diarization Models: A Comprehensive Framework

October 16, 2025

YOLO-based Transfer Learning for Acoustic Event Detection using Visual Object Detection Techniques

October 9, 2025
Copyright © AUDIAS-UAM All rights reserved.
Education Mind by Axle Themes