Sisyphus recipies for ASR
☆19Mar 18, 2026Updated last week
Alternatives and similar repositories for i6_core
Users that are interested in i6_core are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Mar 16, 2026Updated last week
- A Workflow Manager in Python☆49Mar 16, 2026Updated last week
- ☆13Aug 23, 2024Updated last year
- experiments with RETURNN☆161Feb 7, 2026Updated last month
- ☆30Jan 22, 2026Updated 2 months ago
- The RWTH ASR Toolkit.☆58Updated this week
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- The RWTH extensible training framework for universal recurrent neural networks☆373Mar 17, 2026Updated last week
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.☆13Oct 2, 2025Updated 5 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- ☆23Feb 4, 2020Updated 6 years ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆36Sep 9, 2025Updated 6 months ago
- ☆17Apr 14, 2023Updated 2 years ago
- Using Heltec ESP32 MCU to control an ADAU1701 audio DSP via I2C.☆11Dec 27, 2022Updated 3 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 4 years ago
- Code for DSTC9 Track 1 - Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access.☆11Apr 13, 2022Updated 3 years ago
- ☆20Mar 16, 2026Updated last week
- ☆25Mar 6, 2024Updated 2 years ago
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆30Mar 3, 2026Updated 3 weeks ago
- This repository contains the files used for our Interspeech 2017 paper.☆16May 30, 2017Updated 8 years ago
- Suppress mouse & keyboard events on MacOSX. Baby-proof my Mac!☆14Oct 19, 2023Updated 2 years ago
- superfast text to speech in any voice☆61Feb 16, 2026Updated last month
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- LSD-SLAM☆10Jun 6, 2018Updated 7 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆12Dec 9, 2015Updated 10 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- Text-to-Speech Latency Benchmark☆22Updated this week
- ☆14Nov 11, 2017Updated 8 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 4 years ago
- Github implementation of https://reports.chatclimate.ai/☆23Jun 16, 2025Updated 9 months ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆23Feb 27, 2026Updated 3 weeks ago
- General tools for voice analysis.☆25Jul 30, 2025Updated 7 months ago
- DSTC10 Track 2 - Knowledge-grounded Task-oriented Dialogue Modeling on Spoken Conversations☆62Jul 25, 2023Updated 2 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago