assafmu / wav2letter_pytorchView external linksLinks
An implementation of the Wav2Letter Speech-to-Text model using PyTorch.
☆14Mar 8, 2023Updated 2 years ago
Alternatives and similar repositories for wav2letter_pytorch
Users that are interested in wav2letter_pytorch are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- ☆16Sep 12, 2019Updated 6 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- Paderbox: A collection of utilities for audio / speech processing☆43Jul 21, 2025Updated 6 months ago
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆15Apr 8, 2024Updated last year
- ☆17Apr 14, 2023Updated 2 years ago
- A neural language modeling toolkit built on PyTorch☆19Mar 17, 2023Updated 2 years ago
- A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…☆21Jan 26, 2020Updated 6 years ago
- [NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords☆18Nov 30, 2024Updated last year
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Jun 12, 2023Updated 2 years ago
- Code to reproduce the experiments in the paper "Fast and stable blind source separation with rank-1 updates" presented at ICASSP 2020.☆21Apr 14, 2020Updated 5 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Jul 25, 2024Updated last year
- ☆17Aug 27, 2025Updated 5 months ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆22Dec 8, 2022Updated 3 years ago
- Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and …☆20Oct 6, 2022Updated 3 years ago
- ☆24Mar 13, 2020Updated 5 years ago
- Baseline convolutional ASR system in PyTorch☆21Nov 16, 2023Updated 2 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregr…☆28Nov 23, 2023Updated 2 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- ☆21Aug 29, 2019Updated 6 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆29Sep 20, 2021Updated 4 years ago
- Data Science Utils: Frequently Used Methods for Data Science☆37Updated this week
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- <In Development> Transformers for Keras that support sklearn's .fit .predict .☆30Jun 23, 2020Updated 5 years ago
- Balanced Error Rate for Speaker Diarization☆33Feb 28, 2023Updated 2 years ago
- Da - ECHO - RetrievAl - daTasEt☆34Jul 7, 2024Updated last year
- ☆32Jul 27, 2022Updated 3 years ago
- ☆64Aug 14, 2023Updated 2 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆63Oct 15, 2019Updated 6 years ago