Sisyphus recipies for ASR
☆19Jun 7, 2026Updated last week
Alternatives and similar repositories for i6_core
Users that are interested in i6_core are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Updated this week
- A Workflow Manager in Python☆50May 29, 2026Updated 2 weeks ago
- ☆13Aug 23, 2024Updated last year
- experiments with RETURNN☆162Updated this week
- ☆30Apr 29, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The RWTH ASR Toolkit.☆58Updated this week
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- The RWTH extensible training framework for universal recurrent neural networks☆375Updated this week
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.☆13Oct 2, 2025Updated 8 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- ☆17Apr 14, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆41Sep 9, 2025Updated 9 months ago
- Using Heltec ESP32 MCU to control an ADAU1701 audio DSP via I2C.☆12Dec 27, 2022Updated 3 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 4 years ago
- ☆25Mar 6, 2024Updated 2 years ago
- ☆22May 27, 2026Updated 2 weeks ago
- This repository contains the files used for our Interspeech 2017 paper.☆16May 30, 2017Updated 9 years ago
- Suppress mouse & keyboard events on MacOSX. Baby-proof my Mac!☆14Oct 19, 2023Updated 2 years ago
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆37Mar 3, 2026Updated 3 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- LSD-SLAM☆10Jun 6, 2018Updated 8 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆16Jun 13, 2022Updated 4 years ago
- ☆12Dec 9, 2015Updated 10 years ago
- Text-to-Speech Benchmark☆26Apr 2, 2026Updated 2 months ago
- ☆14Nov 11, 2017Updated 8 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 4 years ago
- General tools for voice analysis.☆25May 13, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- superfast text to speech in any voice☆62Feb 16, 2026Updated 3 months ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Dec 12, 2024Updated last year
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Port of the OpenFST library to Windows☆83Apr 23, 2024Updated 2 years ago
- ☆18Aug 23, 2025Updated 9 months ago
- ☆21Feb 5, 2018Updated 8 years ago