A fully convolution-network for speech-to-text, built on pytorch.
☆126May 20, 2020Updated 5 years ago
Alternatives and similar repositories for wav2letter.pytorch
Users that are interested in wav2letter.pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- automatically align transcribed audio and generate a wav2letter training corpus☆36Apr 11, 2023Updated 2 years ago
- Speech Recognition model based off of FAIR research paper built using Pytorch.☆87Dec 11, 2018Updated 7 years ago
- Automatic Speech Recognition☆20Aug 24, 2018Updated 7 years ago
- Facebook AI Research Automatic Speech Recognition Toolkit☆23Mar 13, 2021Updated 5 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- speech-to-text in pytorch☆82Mar 14, 2019Updated 7 years ago
- ☆12Aug 25, 2017Updated 8 years ago
- PyTorch end-to-end speech recognition☆49Dec 30, 2020Updated 5 years ago
- Speech-to-text based on wav2letter built for transfer learning☆98Oct 21, 2022Updated 3 years ago
- A PyTorch Implementation of End-to-End Models for Speech-to-Text☆769Jul 6, 2023Updated 2 years ago
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 6 years ago
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)☆32May 14, 2024Updated last year
- Speech Recognition using DeepSpeech2.☆2,140Dec 13, 2022Updated 3 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆127Jun 10, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Jul 6, 2023Updated 2 years ago
- speech engine training projects☆29Apr 19, 2021Updated 4 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆123Apr 15, 2020Updated 5 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆30Oct 3, 2023Updated 2 years ago
- Attention based aspect extraction via pytorch☆14Jun 8, 2020Updated 5 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- An opensource speech-to-text software written in tensorflow☆160Oct 15, 2022Updated 3 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- Collaborative audio annotation tool☆18Sep 16, 2022Updated 3 years ago
- ☆22Aug 29, 2019Updated 6 years ago
- Bias Tests for Voice Technologies (bt4vt)☆11Jun 16, 2024Updated last year
- Listen Attend and Spell (LAS) implement in pytorch☆60Sep 4, 2018Updated 7 years ago
- ASR with PyTorch☆140Mar 10, 2019Updated 7 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Anonymous ICLR Submission☆14Sep 25, 2019Updated 6 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Mar 18, 2019Updated 7 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆25Feb 27, 2020Updated 6 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15May 19, 2020Updated 5 years ago
- WaveGlow vocoder with VQVAE☆61Jun 18, 2019Updated 6 years ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆940Sep 4, 2024Updated last year
- Keyword Search Recipe for Subword ASR☆30Jul 12, 2019Updated 6 years ago