Skip to content
Main Menu
  • Home
  • Seminars
  • People
    • Staff
    • Ph.D. Students
    • Students
    • AUDIAS Alumni
  • Publications
  • Projects & Industry
    • Projects
    • Industry
  • Evaluations
  • For Students
    • Master’s Thesis Proposals
    • Study Grants and Opportunities
    • Teaching
  • Contact
AUDIAS-UAM
  • Home
  • Seminars
  • People
    • Staff
    • Ph.D. Students
    • Students
    • AUDIAS Alumni
  • Publications
  • Projects & Industry
    • Projects
    • Industry
  • Evaluations
  • For Students
    • Master’s Thesis Proposals
    • Study Grants and Opportunities
    • Teaching
  • Contact

Year: 2024

  • Home
  • 2024
  • Page 2

One model to rule them all? Towards end-to-end joint speaker diarization and speech recognition

October 9, 2024July 9, 2025 Adrián Aranda Márquez

Speaker: Laura Herrera Alarcón. Abstract: This paper presents a novel framework for joint speaker diarization (SD) and automatic speech recognition (ASR), named SLIDAR (sliding-window diarization-augmented recognition). SLIDAR can process arbitrary length inputs and can handle any number of speakers, effectively… Read More

AUDIAS Seminars

Emotion recognition in Spanish audio

October 2, 2024October 1, 2024 daniel

Speaker: Manuel Otero González. Abstract: En esta charla se explicará la tarea de reconocimiento de emociones en audios en español, presentando los enfoques más avanzados del estado del arte, como Wav2Vec2 y W2V-Bert. Además, se introducirá el reto EmoSPeech, cuyo… Read More

AUDIAS Seminars

State of the Art in Sound Event Detection and DCASE Evaluations

September 25, 2024September 30, 2024 daniel

Speaker: Doroteo Torre Toledano. Abstract: In this talk I will review the most recent trajectory of the AUDIAS group in the field of Sound Event Detection (SED), highlighting our participations in DCASE evaluations (Task 4) from 2020 to 2023. Then,… Read More

AUDIAS Seminars

Large Language Models: From Theory to Practice in Text Classification

September 18, 2024September 18, 2024 daniel

Speaker: Miguel Ángel Martínez Pay. Abstract: This work presents a comprehensive overview of Large Language Models (LLMs), from their theoretical framework to practical applications in text classification. It compares the effectiveness of two key approaches: fine-tuning embeddings of smaller models… Read More

AUDIAS Seminars

Integration of Emotional Information in Speaker Recognition Systems

September 11, 2024September 11, 2024 daniel

Speaker: Arturo Domínguez Santos. Abstract: This Master’s thesis addresses the challenge of investigating how emotions affect speakerverification and proposes a system that integrates this emotional variability to try toimprove accuracy. The focus is on the speaker’s emotions, which has traditionally… Read More

AUDIAS Seminars

Exploring Speech Foundation Models for End-to-End Speaker Diarization

September 4, 2024September 11, 2024 daniel

Speaker: Laura Herrera Alarcón. Abstract: In this Master’s Thesis the use of pre-trained models for the diarization task has beenstudied in order to exploit their ability to extract robust and discriminative features.In particular, the WavLM model has been combined with… Read More

AUDIAS Seminars

Interpretation of fingerprint evidence with likelihood ratios (LRs – Likelihood ratios)

July 3, 2024September 13, 2024 daniel

Speaker: Joaquín González Rodríguez. Abstract: The forensic fingerprint identification process based on the ACE-V method, widely implemented, makes absolute identification or exclusion decisions that depend on opinions that vary from expert to expert (for example, whether we consider an observed… Read More

AUDIAS Seminars

SGE & CCC Architecture – Introduction for Beginners

June 26, 2024September 18, 2024 daniel

Speaker: Adrián Aranda Marcos. Abstract: Simple technical introduction for using SGE with AUDIAS servers (Son of Grid Engine) and CCC (UAM’s Central Computing Center).

AUDIAS Seminars

Stabilising Reinforcement Learning with Past Action-State Representation Learning

June 18, 2024June 25, 2024 audias

Speaker: Tamas Endrei. Abstract: Although deep reinforcement learning (DRL) deals with sequential decision-making problems, temporal information representation is absent from state-of-the-art actor-critic algorithms. The reliance on only the current timestep information causes instability in concurrent actions. Furthermore, the over-reliance on… Read More

AUDIAS Seminars

Study of the predictive capacity of the efficacy of platelet-rich plasma (PRP) treatments in joint injuries

June 11, 2024September 18, 2024 daniel

Speaker: Berta Caunedo Castro. Abstract: This Final Degree Project evaluates Platelet-Rich Plasma (PRP) therapy as an alternative to traditional treatments for knee osteoarthritis, a prevalent joint condition. PRP uses regenerative growth factors from the patient’s blood, but its variability complicates… Read More

AUDIAS Seminars

Posts navigation

Previous 1 2 3 4 Next

AUDIAS Seminars

YOLO-based Transfer Learning for Acoustic Event Detection using Visual Object Detection Techniques

October 9, 2025

Speaker: Sergio Segovia González. Abstract: Traditional SED approaches are based…

Auditory General Intelligence (JSALT-2025)

September 25, 2025

Speaker: Laura Herrera Alarcón. Abstract: The emergence of Large Audio…

Fitting Protein Language Models (PLMs) for the prediction of protein functionality using zero-shot and few-shot techniques.

September 15, 2025

Speaker: Juan Antonio Gordillo Gayo. Abstract: The unprecedent success of…

Open science in the service of conservation: An accessible, user-friendly machine learning workflow for automated anuran monitoring in complex Neotropical soundscapes

September 10, 2025

Speaker: Gabriel Bidart Abstract: Amphibian populations worldwide are declining, particularly…

About AUDIAS

AUDIAS is a solid research group addressing challenging problems in speech, audio and temporal signals from deep foundations in machine learning and signal processing.

Quick Links

  • Home
  • People
  • Publications
  • Industry
  • Evaluations
  • For Students
  • Contact

Highlights

YOLO-based Transfer Learning for Acoustic Event Detection using Visual Object Detection Techniques

October 9, 2025

Auditory General Intelligence (JSALT-2025)

September 25, 2025
Copyright © AUDIAS-UAM All rights reserved.
Education Mind by Axle Themes