Speaker: Beltrán Labrador Serrano Abstract: This presentation describes the system submitted by the AUDIAS-UAM team for the Albayzin 2020 Speech to Text Challenge. Our system is an end to end Transformer-based system built using ESPnet Toolkit. The acoustic model is… Read More
Multi-resolution Sound Event Detection
Speaker: Diego de Benito Gorrón Abstract: The Sound Event Detection task aims to determine the temporal locations of acoustic events in audio clips. Over the recent years, this field is holding a rising relevance due to the introduction of datasets… Read More