Sisyphus recipies for ASR
☆19Feb 25, 2026Updated last week
Alternatives and similar repositories for i6_core
Users that are interested in i6_core are comparing it to the libraries listed below
Sorting:
- ☆13Updated this week
- A Workflow Manager in Python☆49Updated this week
- ☆13Aug 23, 2024Updated last year
- ☆30Jan 22, 2026Updated last month
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- experiments with RETURNN☆161Feb 7, 2026Updated 3 weeks ago
- The RWTH ASR Toolkit.☆58Updated this week
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- ☆18Feb 16, 2026Updated 2 weeks ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.☆13Oct 2, 2025Updated 5 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- This repository contains the files used for our Interspeech 2017 paper.☆16May 30, 2017Updated 8 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- ☆17Oct 16, 2018Updated 7 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- ☆12Dec 9, 2015Updated 10 years ago
- Crawling and creating a German language model resource☆18Aug 23, 2022Updated 3 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- A fork of Idiap Research Institute's DiarTk diarization toolkit☆16Feb 20, 2016Updated 10 years ago
- ☆17Jun 30, 2020Updated 5 years ago
- ☆15Jul 11, 2022Updated 3 years ago
- ☆21Feb 5, 2018Updated 8 years ago
- Example workflow for our data-centric speech benchmark☆17Jul 6, 2023Updated 2 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 3 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 4 years ago
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 6 months ago
- ☆17Mar 1, 2024Updated 2 years ago
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆29Feb 19, 2026Updated last week
- ☆17Apr 14, 2023Updated 2 years ago
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆41Feb 9, 2023Updated 3 years ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆34Sep 9, 2025Updated 5 months ago
- Text-to-Speech Latency Benchmark☆22Jan 16, 2026Updated last month