ZhaoZeyu1995 / WaterfallView external linksLinks
An ASR toolkit with the freedom of topology
☆10Dec 18, 2023Updated 2 years ago
Alternatives and similar repositories for Waterfall
Users that are interested in Waterfall are comparing it to the libraries listed below
Sorting:
- c# library for decoding K2 transducer Models,used in speech recognition (ASR)☆13Aug 20, 2025Updated 5 months ago
- ☆28Oct 7, 2025Updated 4 months ago
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM☆17Nov 7, 2024Updated last year
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆79Jun 30, 2025Updated 7 months ago
- Apply https://github.com/k2-fsa/sherpa-ncnn in live streaming and WebRTC☆20Apr 16, 2023Updated 2 years ago
- Script to perform statistical significance test between ASR hypotheses.☆22Aug 13, 2017Updated 8 years ago
- The RWTH ASR Toolkit.☆58Feb 6, 2026Updated last week
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated 2 weeks ago
- Provide accurate offline voice-to-text services for VR,AR and Android platforms, such as oculus quest1/2/pro or pico3/4☆26May 21, 2024Updated last year
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- Colab notebooks for Next-gen Kaldi☆29Oct 12, 2025Updated 4 months ago
- ☆33Jul 23, 2024Updated last year
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆213Aug 7, 2025Updated 6 months ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated last year
- node-addon-api for HarmonyOS/HarmonyNext☆12Jul 23, 2025Updated 6 months ago
- Python wrapper for kaldi's arpa2fst☆37Aug 27, 2025Updated 5 months ago
- Automatically generate a world map showing where contributions to your repository are coming from.☆12Apr 11, 2024Updated last year
- Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)☆10Jun 2, 2021Updated 4 years ago
- c# wrapper for kaldi-native-fbank,used to extract audio features in speech recognition (ASR) task☆10Jul 26, 2025Updated 6 months ago
- Shell script templates used to create ament workspaces.☆10Jan 21, 2026Updated 3 weeks ago
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated last year
- ☆13May 25, 2023Updated 2 years ago
- Personal maintenance add-on projects currently include speech-to-text, RSS, local GPT, cloud backup, and go2rtc☆12Updated this week
- ☆10Aug 18, 2023Updated 2 years ago
- Go module for https://github.com/celo-org/bls-zexe/☆13Feb 2, 2024Updated 2 years ago
- ☆13Sep 25, 2024Updated last year
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Jun 1, 2023Updated 2 years ago
- Pybind11 bindings for Kaldi☆15Feb 1, 2026Updated last week
- Sample code to record audio and save to Server Side Blazor using MediaRecorder API and Recorder.js☆13Dec 26, 2020Updated 5 years ago
- Draco is a script to convert reddit thread to Org document☆10Aug 9, 2022Updated 3 years ago
- A Gnome Shell Extension to interact with the Home Assistant API☆10Jan 26, 2021Updated 5 years ago
- 🔗 A developer's tool for understanding new codebases☆33Jan 25, 2026Updated 2 weeks ago
- Faros is an open BLE beacon supporting Google's Eddystone open beacon format. Faros includes source code for the Arduino platform and the…☆10May 5, 2016Updated 9 years ago
- Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context☆214Sep 10, 2024Updated last year
- julialang cookbook☆16Oct 3, 2014Updated 11 years ago
- Testing KAN-based text generation GPT models☆18May 6, 2024Updated last year
- Zerospeech Challenge 2021: validation and evaluation software☆12Jun 13, 2022Updated 3 years ago
- WikiQA,复现论文《Multihop Atention Networks for Qestion Answer Matching》☆11Mar 25, 2019Updated 6 years ago
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Nov 19, 2024Updated last year