lorenlugosch / end-to-end-SLUView external linksLinks
PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learning
☆230Mar 23, 2021Updated 4 years ago
Alternatives and similar repositories for end-to-end-SLU
Users that are interested in end-to-end-SLU are comparing it to the libraries listed below
Sorting:
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 4 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42May 29, 2019Updated 6 years ago
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 6 years ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆942Sep 4, 2024Updated last year
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- Problem Agnostic Speech Encoder☆446Jul 6, 2023Updated 2 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Nov 12, 2019Updated 6 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,395Mar 14, 2022Updated 3 years ago
- This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…☆1,210Dec 19, 2020Updated 5 years ago
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 3 years ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆26Dec 4, 2023Updated 2 years ago
- This setup allows to train end-to-end neural models for spoken language understanding (SLU).☆24Jun 12, 2023Updated 2 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- Repository for SLURP paper☆108Apr 20, 2022Updated 3 years ago
- TTS model based on Transformer.☆58Aug 2, 2019Updated 6 years ago
- Sound Related Deep Learning Tasks boosting repository with pytorch☆88Jul 25, 2024Updated last year
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆246Oct 30, 2019Updated 6 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Aug 8, 2019Updated 6 years ago
- dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.☆520Jul 11, 2023Updated 2 years ago
- ☆50Feb 13, 2022Updated 4 years ago
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,232Apr 28, 2021Updated 4 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Oct 22, 2020Updated 5 years ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Nov 8, 2018Updated 7 years ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Aug 29, 2023Updated 2 years ago
- CMU Wilderness Multilingual Speech Dataset☆291Apr 20, 2019Updated 6 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆367Oct 12, 2021Updated 4 years ago
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆499Jun 11, 2021Updated 4 years ago
- ☆17Aug 27, 2025Updated 5 months ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- Demos, pretrained models, and (WIP) code supporting Representation Mixing☆51Dec 18, 2018Updated 7 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆809Apr 6, 2023Updated 2 years ago
- Tools for ASR Corpus Generation from Online Video☆140Feb 10, 2019Updated 7 years ago
- A fast parallel implementation of RNN Transducer.☆314Jun 7, 2023Updated 2 years ago