☆11Mar 4, 2026Updated 3 weeks ago
Alternatives and similar repositories for icefall
Users that are interested in icefall are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆10Jul 8, 2020Updated 5 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Mar 30, 2020Updated 5 years ago
- Mispronunciation detection code for jingju singing voice☆20Sep 5, 2018Updated 7 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- Myanmar lexicon analyzer - Sorting and Segmentation☆10Aug 11, 2021Updated 4 years ago
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated last month
- 一键将视频转换为优质小红书笔记,自动优化内容和配图;追加了可以读取本地视频的功能☆12Dec 22, 2024Updated last year
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 4 years ago
- Myanmar consonant and vowel audio files that I recorded at University of Computer Studies Banmaw☆11Mar 2, 2019Updated 7 years ago
- A fast parallel implementation of RNN Transducer.☆12Apr 8, 2025Updated 11 months ago
- An implementation of MeloTTS by onnxruntime☆29Oct 27, 2024Updated last year
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆18Jul 23, 2024Updated last year
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆17Feb 11, 2023Updated 3 years ago
- Transformer based ASR Engine.☆13Aug 23, 2021Updated 4 years ago
- Metappearance: Meta-Learning for Visual Appearance Reproduction☆21Sep 19, 2022Updated 3 years ago
- ☆16Nov 8, 2020Updated 5 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Jun 25, 2021Updated 4 years ago
- ☆27Mar 9, 2023Updated 3 years ago
- ☆11Oct 24, 2022Updated 3 years ago
- c# library for decoding K2 transducer Models,used in speech recognition (ASR)☆13Aug 20, 2025Updated 7 months ago
- ☆29Aug 8, 2024Updated last year
- AudioVisual Diarization - Supervised and Unsupervised☆15Nov 22, 2022Updated 3 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- ☆14Jan 31, 2023Updated 3 years ago
- ☆15Nov 5, 2021Updated 4 years ago
- Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21☆16May 14, 2022Updated 3 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Bits and bobs for making and checking Myanmar fonts☆11Feb 2, 2026Updated last month
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- This repository is for the paper Incorporating External POS Tagger for Punctuation Restoration. Proc. Interspeech 2021, 1987-1991, doi: 1…☆11Jul 5, 2023Updated 2 years ago
- c# wrapper for kaldi-native-fbank,used to extract audio features in speech recognition (ASR) task☆10Jul 26, 2025Updated 7 months ago
- A reliable, beautiful and powerful markdown plug-in for WordPress, supporting editing and rendering☆13Apr 29, 2023Updated 2 years ago
- pytorch code for sound event localization and classification☆13Aug 12, 2021Updated 4 years ago
- v4l2 to oepncv mat,surport V4L2_PIX_FMT_YUYV,V4L2_PIX_FMT_MJPEG,V4L2_PIX_FMT_NV12,V4L2_PIX_FMT_YVU420,V4L2_PIX_FMT_YUV420☆31Dec 10, 2019Updated 6 years ago
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆14Aug 9, 2024Updated last year
- Implementations of growing and pruning in neural networks☆22Jul 26, 2023Updated 2 years ago
- Automatic Speech Recognition at the University of Edinburgh.☆16Mar 14, 2021Updated 5 years ago
- ☆55Jan 13, 2023Updated 3 years ago