DeutscheKI / tevr-asr-toolView external linksLinks
State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.
☆412Aug 9, 2022Updated 3 years ago
Alternatives and similar repositories for tevr-asr-tool
Users that are interested in tevr-asr-tool are comparing it to the libraries listed below
Sorting:
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆34Mar 31, 2023Updated 2 years ago
- An On-Premises, Streaming Speech Recognition System☆682Nov 28, 2021Updated 4 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆174Jun 9, 2023Updated 2 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆293Aug 5, 2021Updated 4 years ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆57Mar 12, 2024Updated last year
- FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS☆20Nov 15, 2022Updated 3 years ago
- ☆32Jan 6, 2022Updated 4 years ago
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆262Nov 15, 2025Updated 3 months ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Jul 14, 2022Updated 3 years ago
- HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools☆469Sep 20, 2023Updated 2 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Mar 14, 2023Updated 2 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆25Jul 5, 2022Updated 3 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Feb 18, 2025Updated 11 months ago
- ☆13Nov 16, 2022Updated 3 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- Proposed splits for the LREC Wikipron paper☆15Apr 7, 2020Updated 5 years ago
- Convert kaldi feature extraction and nnet3 models into Tensorflow Lite models. Currently aimed at converting kaldi's x-vector models and …☆20Oct 6, 2022Updated 3 years ago
- Fleur implements a Bloom Filter library in C that is fully compatible with DCSO's Go and python implementations.☆117Feb 23, 2023Updated 2 years ago
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to…☆36Jan 16, 2021Updated 5 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆331Nov 15, 2024Updated last year
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆25Dec 21, 2023Updated 2 years ago
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- Digital Speech Processing in PyTorch.☆15Aug 12, 2022Updated 3 years ago
- LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search☆94Sep 1, 2021Updated 4 years ago
- ☆55Jan 13, 2023Updated 3 years ago
- ICASSP 2023 Accepted☆189May 6, 2024Updated last year
- ☆56Dec 19, 2022Updated 3 years ago
- Gemma 3 pure inference in C☆103Feb 4, 2026Updated last week
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆106Mar 25, 2023Updated 2 years ago
- python wrapper for kaldi's native I/O☆27Jan 9, 2025Updated last year
- ☆42Mar 25, 2022Updated 3 years ago
- PyTorch Implementation of ProDiff (ACM-MM'22) with a Extremely-Fast diffusion speech synthesis pipeline☆432Apr 19, 2023Updated 2 years ago