Speaker: Javier Galán Fernández. Abstract: Cardiovascular diseases are the leading cause of death in the world, accounting for 32% of all deaths recorded throughout the year. The 2016 PhysioNet challenge aimed to encourage the development of algorithms to classify heart… Read More
How speaker diarization evolved recently: from clustering to end-to-end approaches
Speaker: Alicia Lozano Díez. Abstract: Speaker diarization systems aim to segment a multi-speaker audio recording according to speaker changes, providing the time stamps of the activity of each speaker, including segments where nobody speaks and those where more than one… Read More
VoxCeleb-Spain: Design, Acquisition and Preliminar Evaluation
Speaker: Manuel Otero González. Abstract: Description of VoxCeleb and its latest Challenges (2019-2022), elaboration and capture of audio database of celebrities of Spanish nationality, and preliminary evaluation of a pre-trained system with the acquired data.
MusicLM: Generating music from text
Speaker: Laura Herrera Alarcón Abstract: Based on https://arxiv.org/pdf/2301.11325.pdf. This paper presents a new model for generating high-fidelity music from text descriptions. It combines SoundStream, w2v-BERT and MuLan, 3 models that allow to obtain temporal coherence and high quality audios of… Read More