Speaker: Manuel Otero Abstract: This master’s thesis addresses the analysis and recognition of emotions in speech, within the framework of the EmoSPeech 2024 challenge. Different approaches to the state of the art are presented, from traditional methods to current models… Read More
Deep Learning Insights Inspired by Reinforcement Learning Research
Speaker: Tamas Endrei. Abstract: Despite deep reinforcement learning being around for more than 10 years, traditional deep learning best practices have largely avoided the field until now. This talk elaborates on deep learning techniques uncovered through RL-motivated research, touching on… Read More
Joint Automatic Speech Recognition And Structure. Learning For Better Speech Understanding
Speaker: María Pilar Fernández Gallego. Abstract: Spoken language understanding (SLU) is a structure prediction task in the field of speech. Recently, many works on SLU that treat it as a sequence-to-sequence task have achieved great success. However, This method is… Read More
A Whisper-based Query-by-Example Spoken Term Detection approach for search on speech
Speaker: Javier Tejedor Noguerales. Abstract: Nowadays, in the digital era, the amount of information stored in audio repositories is undoubtedly growing. This makes necessary the development of efficient and automatic methods to search on audio content. To address it, search… Read More
NIST 2024 Speaker Recognition Evaluation
Speaker: Sara Barahona Quirós. Abstract: In this talk we will present our paritcipation to the NIST 2024 SRE Evaluation in collaboration with Brno University of Technology, Polito, Phonexia, Omilia and CRIM. This evaluation focuses on speaker detection over conversational telephone… Read More