csukuangfj / kaldi-native-fbankView external linksLinks
Kaldi-compatible online fbank extractor without external dependencies
☆141Oct 9, 2025Updated 4 months ago
Alternatives and similar repositories for kaldi-native-fbank
Users that are interested in kaldi-native-fbank are comparing it to the libraries listed below
Sorting:
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆213Aug 7, 2025Updated 6 months ago
- c# wrapper for kaldi-native-fbank,used to extract audio features in speech recognition (ASR) task☆10Jul 26, 2025Updated 6 months ago
- Colab notebooks for Next-gen Kaldi☆29Oct 12, 2025Updated 4 months ago
- ☆28Oct 7, 2025Updated 4 months ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆55Sep 1, 2025Updated 5 months ago
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated 2 weeks ago
- python wrapper for kaldi's native I/O☆27Jan 9, 2025Updated last year
- Memory efficient transducer loss computation☆69Jun 10, 2022Updated 3 years ago
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 2 years ago
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆80Nov 7, 2025Updated 3 months ago
- Exploring Binary Classification Loss for Speaker Verification☆18Jul 18, 2023Updated 2 years ago
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 5 months ago
- FSA/FST algorithms, differentiable, with PyTorch compatibility.☆1,305Nov 19, 2025Updated 2 months ago
- ONNX Inference of Pyannote Segmentation☆97Dec 23, 2024Updated last year
- Moved to https://github.com/k2-fsa/icefall☆146Oct 13, 2022Updated 3 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆79Jun 30, 2025Updated 7 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆128Apr 26, 2023Updated 2 years ago
- ☆67Mar 25, 2022Updated 3 years ago
- Balanced Error Rate for Speaker Diarization☆33Feb 28, 2023Updated 2 years ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆54Dec 6, 2023Updated 2 years ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆110May 20, 2025Updated 8 months ago
- List of NN based singal processing papers☆22Jun 5, 2023Updated 2 years ago
- A simple package for Guided source separation (GSS)☆133May 20, 2024Updated last year
- ☆1,363Updated this week
- Production First and Production Ready End-to-End Keyword Spotting Toolkit☆691Sep 17, 2025Updated 5 months ago
- Onset-and-Offset-Aware Sound Event Detection☆20Feb 10, 2025Updated last year
- ☆28Aug 8, 2024Updated last year
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆21Jul 26, 2021Updated 4 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC☆20Apr 16, 2023Updated 2 years ago
- Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context☆214Sep 10, 2024Updated last year
- Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, …☆1,632Oct 20, 2025Updated 3 months ago
- Production first, nn-based on-device signal processing toolkit.☆65May 30, 2023Updated 2 years ago
- Streaming Audiotransformers for online Audio tagging☆51Jun 14, 2024Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆109Aug 16, 2024Updated last year
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement☆16Jul 11, 2025Updated 7 months ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Mar 26, 2025Updated 10 months ago
- ☆56Jul 17, 2023Updated 2 years ago
- ☆32Oct 28, 2022Updated 3 years ago