csukuangfj/icefall

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/csukuangfj/icefall)

csukuangfj / icefall

☆11

Alternatives and similar repositories for icefall

Users that are interested in icefall are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MiuLab / Lattice-Transformer-SLU
View on GitHub
Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"
☆10Jul 8, 2020Updated 6 years ago
ondrejklejch / acoustic_punctuation
View on GitHub
NMT based punctuation prediction system using lexical and acoustic features .
☆14Mar 30, 2020Updated 6 years ago
ronggong / mispronunciation-detection
View on GitHub
Mispronunciation detection code for jingju singing voice
☆19Sep 5, 2018Updated 7 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
EZ-VC / EZ-VC
View on GitHub
[EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion
☆43Sep 9, 2025Updated 10 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
wengzhiwen / xhs_note_generator
View on GitHub
一键将视频转换为优质小红书笔记，自动优化内容和配图；追加了可以读取本地视频的功能
☆12Dec 22, 2024Updated last year
minthanthtoo / myanmar-collation-stats
View on GitHub
Myanmar lexicon analyzer - Sorting and Segmentation
☆10Aug 11, 2021Updated 4 years ago
monatis / asr-annotation-bot
View on GitHub
Simple Telegram bot to annotate and varify automatic speech recognition datasets
☆12Mar 30, 2021Updated 5 years ago
Enescigdem / SignLanguageRecognizer
View on GitHub
☆16Nov 8, 2020Updated 5 years ago
season-studio / MeloTTS-ONNX
View on GitHub
An implementation of MeloTTS by onnxruntime
☆30Oct 27, 2024Updated last year
b-flo / warp-transducer
View on GitHub
A fast parallel implementation of RNN Transducer.
☆12Apr 8, 2025Updated last year
ye-kyaw-thu / Spectrograms-of-Myanmar-Speech
View on GitHub
Myanmar consonant and vowel audio files that I recorded at University of Computer Studies Banmaw
☆11Mar 2, 2019Updated 7 years ago
MiuLab / Lattice-ELMo
View on GitHub
Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"
☆18Feb 11, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
fanlu / wenet
View on GitHub
Transformer based ASR Engine.
☆13Aug 23, 2021Updated 4 years ago
kaistmm / voxceleb-disentangler
View on GitHub
[INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…
☆18Jul 23, 2024Updated 2 years ago
mfischer-ucl / metappearance
View on GitHub
Metappearance: Meta-Learning for Visual Appearance Reproduction
☆22Sep 19, 2022Updated 3 years ago
ShiningLab / POS-Tagger-for-Punctuation-Restoration
View on GitHub
This repository is for the paper Incorporating External POS Tagger for Punctuation Restoration. Proc. Interspeech 2021, 1987-1991, doi: 1…
☆11May 24, 2026Updated 2 months ago
guolele1990 / rknn_FaceRecognization
View on GitHub
☆28Mar 9, 2023Updated 3 years ago
WangHelin1997 / SpecAugment-plus
View on GitHub
A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification
☆34Jun 25, 2021Updated 5 years ago
mehedihasanbijoy / DPCSpell
View on GitHub
[Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages
☆14Aug 9, 2024Updated last year
manyeyes / K2TransducerAsr
View on GitHub
c# library for decoding K2 transducer Models，used in speech recognition (ASR)
☆13Aug 20, 2025Updated 11 months ago
catherine-qian / cocosda-SSL
View on GitHub
pytorch code for sound event localization and classification
☆13Aug 12, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
danpovey / conditional-flow-matching
View on GitHub
☆29Aug 8, 2024Updated last year
yangjingyuan / ConstDecoder
View on GitHub
☆11Oct 24, 2022Updated 3 years ago
dcaulley / av_diarization
View on GitHub
AudioVisual Diarization - Supervised and Unsupervised
☆15Nov 22, 2022Updated 3 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
yuhangear / kaldi-android
View on GitHub
☆15Nov 5, 2021Updated 4 years ago
manyeyes / AliCTTransformerPunc
View on GitHub
c# library for decoding CTTransformer punc models, which can add punctuation to Chinese and English texts
☆14Aug 18, 2025Updated 11 months ago
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
k2-fsa / kaldi-decoder
View on GitHub
Decoders from Kaldi using OpenFst
☆35Apr 10, 2026Updated 3 months ago
JuanFMontesinos / Acappella-YNet
View on GitHub
Official implementation of A cappella: Audio-visual Singing VoiceSeparation, from BMVC21
☆18May 14, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
leto19 / WhiSQA
View on GitHub
Whisper Speech Quality Assessment (WhiSQA)
☆16Apr 14, 2026Updated 3 months ago
rishikksh20 / NU-Wave2-pytorch
View on GitHub
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]
☆25Jul 5, 2022Updated 4 years ago
zhu-han / SpeechLLM
View on GitHub
LLM-based ASR recipe with Zipformer encoder and Qwen LLM
☆35Sep 25, 2025Updated 10 months ago
manyeyes / KaldiNativeFbankSharp
View on GitHub
c# wrapper for kaldi-native-fbank，used to extract audio features in speech recognition (ASR) task
☆10Jul 26, 2025Updated 11 months ago
ohbendy / Myanmar-font-resources
View on GitHub
Bits and bobs for making and checking Myanmar fonts
☆12Feb 2, 2026Updated 5 months ago
jackworkshop / WP-ReliableMD
View on GitHub
A reliable, beautiful and powerful markdown plug-in for WordPress, supporting editing and rendering
☆13Apr 29, 2023Updated 3 years ago
nxp-appcodehub / dm-eiq-genai-flow-demonstrator
View on GitHub
The eIQ GenAI Flow Demonstrator is a Conversational AI Pipeline application designed for NXP i.MX95 devices.
☆21Apr 16, 2026Updated 3 months ago