☆11Nov 28, 2025Updated 3 months ago
Alternatives and similar repositories for icefall
Users that are interested in icefall are comparing it to the libraries listed below
Sorting:
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆10Jul 8, 2020Updated 5 years ago
- Mispronunciation detection code for jingju singing voice☆20Sep 5, 2018Updated 7 years ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 4 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- A fast parallel implementation of RNN Transducer.☆12Apr 8, 2025Updated 10 months ago
- ☆13Oct 27, 2021Updated 4 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Mar 30, 2020Updated 5 years ago
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆18Jul 23, 2024Updated last year
- ☆16Nov 8, 2020Updated 5 years ago
- AudioVisual Diarization - Supervised and Unsupervised☆15Nov 22, 2022Updated 3 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Jun 25, 2021Updated 4 years ago
- ☆15Nov 5, 2021Updated 4 years ago
- Metappearance: Meta-Learning for Visual Appearance Reproduction☆21Sep 19, 2022Updated 3 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Nov 30, 2022Updated 3 years ago
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆17Feb 11, 2023Updated 3 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆16May 14, 2022Updated 3 years ago
- Automatic Speech Recognition at the University of Edinburgh.☆16Mar 14, 2021Updated 4 years ago
- Implementations of growing and pruning in neural networks☆22Jul 26, 2023Updated 2 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Oct 8, 2021Updated 4 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Nov 2, 2022Updated 3 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 4 months ago
- ☆55Jan 13, 2023Updated 3 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated last month
- ☆25Mar 12, 2022Updated 3 years ago
- Pytorch cpp api examples/practices☆23Mar 2, 2024Updated 2 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Aug 15, 2022Updated 3 years ago
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Aug 2, 2021Updated 4 years ago
- Stochastic gradient descent with model building☆27Feb 15, 2023Updated 3 years ago
- ☆32Jun 26, 2023Updated 2 years ago
- This is an educational repository containing implementation of some search algorithms in Artificial Intelligence.☆26Jul 5, 2019Updated 6 years ago
- A Tiny Project For ASR model training and Deployment☆26Oct 14, 2022Updated 3 years ago