AUDIAS Seminars – Page 4

The Expected Cost: One Performance Metric to Rule Them All

March 12, 2025June 11, 2025 Adrián Aranda Márquez

Speaker: Daniel Ramos Castro. Abstract: Based on https://openreview.net/forum?id=3mN9QNWArl. Abstract of original paper: “The expected cost (EC) is one of the main classification metrics introduced in statistical and machine learning books. It is based on the assumption that, for a given… Read More

Cybersecurity Today: Attackers and Defenders

March 5, 2025June 11, 2025 Adrián Aranda Márquez

Speaker: Pablo González Escribano. Abstract: As cyber threats continue to evolve at a rapid pace, understanding the tactics, techniques, and procedures (TTPs) employed by attackers is crucial for enhancing defense strategies. In this session, we explored the current landscape of… Read More

Real-time Detection of Synthetic Speech

February 26, 2025June 12, 2025 Pablo González Escribano

Speaker: William Fernando López Gavilánez Abstract: Advances in speech synthesis technology have facilitated numerous beneficial applications. However, they also pose significant threats, especially in the realm of identity spoofing. The study explores the potential of leveraging complex spectrograms for real-time… Read More

Past, present and ¿future? of Scaling Laws for Neural Language Models

February 19, 2025June 12, 2025 Pablo González Escribano

Speaker: Beltrán Labrador Abstract: This presentation examines the scaling laws for neural networks that were foundational to the development of modern, large-scale language models. It revisits the 2020 OpenAI paper that established a key principle: model performance scales predictably with… Read More

Emotion Recognition Based On Speech Analysis For The EmoSPeech 2024 Challenge

February 12, 2025June 12, 2025 Pablo González Escribano

Speaker: Manuel Otero Abstract: This master’s thesis addresses the analysis and recognition of emotions in speech, within the framework of the EmoSPeech 2024 challenge. Different approaches to the state of the art are presented, from traditional methods to current models… Read More

Deep Learning Insights Inspired by Reinforcement Learning Research

February 5, 2025June 11, 2025 Adrián Aranda Márquez

Speaker: Tamas Endrei. Abstract: Despite deep reinforcement learning being around for more than 10 years, traditional deep learning best practices have largely avoided the field until now. This talk elaborates on deep learning techniques uncovered through RL-motivated research, touching on… Read More

Joint Automatic Speech Recognition And Structure. Learning For Better Speech Understanding

January 29, 2025January 30, 2025 Adrián Aranda Márquez

Speaker: María Pilar Fernández Gallego. Abstract: Spoken language understanding (SLU) is a structure prediction task in the field of speech. Recently, many works on SLU that treat it as a sequence-to-sequence task have achieved great success. However, This method is… Read More

A Whisper-based Query-by-Example Spoken Term Detection approach for search on speech

January 22, 2025January 30, 2025 Adrián Aranda Márquez

Speaker: Javier Tejedor Noguerales. Abstract: Nowadays, in the digital era, the amount of information stored in audio repositories is undoubtedly growing. This makes necessary the development of efficient and automatic methods to search on audio content. To address it, search… Read More

NIST 2024 Speaker Recognition Evaluation

January 15, 2025January 30, 2025 Adrián Aranda Márquez

Speaker: Sara Barahona Quirós. Abstract: In this talk we will present our paritcipation to the NIST 2024 SRE Evaluation in collaboration with Brno University of Technology, Polito, Phonexia, Omilia and CRIM. This evaluation focuses on speaker detection over conversational telephone… Read More

Foundational Models for Self-Supervised Speaker Diarization and Target Speaker ASR

December 18, 2024January 30, 2025 Adrián Aranda Márquez

Speaker: Alicia Lozano-Diez. Abstract: In this talk, I will review a few of the last trends in speaker diarization and target speaker ASR. I will present two papers that address these two tasks respectively, and leverage the power of foundational… Read More

Category: AUDIAS Seminars