PyTorch code for end-to-end spoken language understanding (SLU) with ASR-based transfer learning
☆231Mar 23, 2021Updated 5 years ago
Alternatives and similar repositories for end-to-end-SLU
Users that are interested in end-to-end-SLU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- This setup allows to train end-to-end neural models for spoken language understanding (SLU).☆24Jun 12, 2023Updated 2 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- RawNet: Fast End-to-End Neural Vocoder☆43May 29, 2019Updated 7 years ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆939Sep 4, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 7 years ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- Problem Agnostic Speech Encoder☆447Jul 6, 2023Updated 2 years ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,398Mar 14, 2022Updated 4 years ago
- This setup allows to train end-to-end neural models for spoken language understanding (SLU).☆11Jun 12, 2023Updated 2 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Nov 14, 2020Updated 5 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- Repository for SLURP paper☆109Apr 20, 2022Updated 4 years ago
- ☆50Feb 13, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A Pytorch implementation for the ZeroSpeech 2019 challenge.☆112Nov 12, 2019Updated 6 years ago
- This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…☆1,215Dec 19, 2020Updated 5 years ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Nov 8, 2018Updated 7 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 3 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 3 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Oct 22, 2020Updated 5 years ago
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,240Apr 28, 2021Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- TTS model based on Transformer.☆58Aug 2, 2019Updated 6 years ago
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆37Aug 29, 2023Updated 2 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆810Apr 6, 2023Updated 3 years ago
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆501Jun 11, 2021Updated 4 years ago
- Vocode spectrograms to audio with generative adversarial networks☆64Aug 8, 2019Updated 6 years ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆27Dec 4, 2023Updated 2 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆247Oct 30, 2019Updated 6 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆370Oct 12, 2021Updated 4 years ago
- ☆17Aug 27, 2025Updated 9 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆16Mar 26, 2022Updated 4 years ago
- A test bed for updates and new features | pytorch/audio☆171May 17, 2020Updated 6 years ago
- dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.☆524Jul 11, 2023Updated 2 years ago
- CMU Wilderness Multilingual Speech Dataset☆292Apr 20, 2019Updated 7 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- A fast parallel implementation of RNN Transducer.☆314Jun 7, 2023Updated 2 years ago
- An implementation of Tacotron and Tacotron2☆80Aug 4, 2021Updated 4 years ago