Facebook AI Research Automatic Speech Recognition Toolkit
☆23Mar 13, 2021Updated 5 years ago
Alternatives and similar repositories for wav2letter
Users that are interested in wav2letter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- automatically align transcribed audio and generate a wav2letter training corpus☆36Apr 11, 2023Updated 3 years ago
- ☆12Aug 25, 2017Updated 8 years ago
- speech engine training projects☆29Apr 19, 2021Updated 5 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 5 months ago
- https://challenge.enliple.com/☆16Jun 10, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Baidu's DeepSpeech updated for better training☆23Sep 5, 2018Updated 7 years ago
- A fully convolution-network for speech-to-text, built on pytorch.☆126May 20, 2020Updated 5 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆58Jun 1, 2018Updated 7 years ago
- Dereverberation of Speech Signals Using Weighted Prediction Error☆23May 17, 2019Updated 6 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Jul 6, 2023Updated 2 years ago
- TTS model based on Transformer.☆58Aug 2, 2019Updated 6 years ago
- speaker diarization system using an LSTM☆23Jan 4, 2023Updated 3 years ago
- ☆12Jul 15, 2016Updated 9 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The performance of turbo equalizers in both ISI channel and multipath fading channel is evaluated☆11Nov 24, 2020Updated 5 years ago
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 6 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Control the mouse using a keyboard or speech recognition on Linux☆12Jul 11, 2019Updated 6 years ago
- The History of Speech Recognition to the Year 2030☆13Aug 14, 2021Updated 4 years ago
- Pytorch Text GAN for lyrics generation☆10Apr 13, 2019Updated 7 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 11 years ago
- A MATLAB function library containing encoders, decoders and weight enumerators for Reed-Muller codes.☆11Aug 19, 2023Updated 2 years ago
- Yet another dependency parser, integrated with tokenizer, tagger and visualization tool.☆11Mar 18, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆11Nov 6, 2024Updated last year
- This is the official code used for WAT 2017 Description Paper titled A Bag of Useful Tricks for Practical Neural Machine Translation: Emb…☆12Oct 24, 2017Updated 8 years ago
- ☆14Nov 22, 2022Updated 3 years ago
- experimental naru programming language implementation☆10Jan 3, 2023Updated 3 years ago
- An application of stacked denoising autoencoders to multi-modal (images and audio) abstract feature discovery☆12Oct 23, 2013Updated 12 years ago
- ☆10Aug 3, 2020Updated 5 years ago
- Toolkit for training/adapting CMU Sphinx acoustic models☆17May 25, 2018Updated 7 years ago
- An open-access corpus of conversational bilingual speech in Cantonese and English☆40Apr 28, 2022Updated 3 years ago
- Control your computer by voice!☆13Dec 8, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Oct 12, 2022Updated 3 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- Neural model for prediction of stress position in Russian words☆13Jun 22, 2025Updated 9 months ago
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 6 months ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Nov 25, 2024Updated last year
- Voice activity engine benchmark framework☆21Jan 14, 2026Updated 3 months ago