Speaker: Doroteo Torre Toledanos. Abstract: Language-based audio retrieval is the task of retrieving audio segments containing sound described in a natural language text. This task was first proposed in a DCASE Challenge in 2022 as a subtask of the audio… Read More
Development of a Guardrail System for Bank Movement Assistant
Speaker: Miguel Ángel Martínez Pay. Abstract: This seminar outlines the process of creating a guardrail for a banking transactions assistant. The guardrail acts as a security system that filters user queries, determining which can be processed by the assistant and… Read More
Neural Discrete Representation Learning Revisited: Applications of VQ-VAE
Speaker: Manuel Fernando Mollón Laorca. Abstract: Since the publication of Neural Discrete Representation Learning in 2018, Vector Quantized Variational Autoencoders (VQ-VAEs) have gained significant attention for their ability to bridge continuous and discrete representations. In particular, their integration with transformer… Read More