Sisyphus recipies for ASR
☆19May 11, 2026Updated last week
Alternatives and similar repositories for i6_core
Users that are interested in i6_core are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Updated this week
- A Workflow Manager in Python☆50Apr 23, 2026Updated last month
- ☆13Aug 23, 2024Updated last year
- experiments with RETURNN☆162May 8, 2026Updated 2 weeks ago
- ☆30Apr 29, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The RWTH ASR Toolkit.☆58Updated this week
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- The RWTH extensible training framework for universal recurrent neural networks☆374Updated this week
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.☆13Oct 2, 2025Updated 7 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- ☆23Feb 4, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆17Apr 14, 2023Updated 3 years ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆38Sep 9, 2025Updated 8 months ago
- Using Heltec ESP32 MCU to control an ADAU1701 audio DSP via I2C.☆11Dec 27, 2022Updated 3 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 4 years ago
- Code for DSTC9 Track 1 - Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access.☆11Apr 13, 2022Updated 4 years ago
- ☆25Mar 6, 2024Updated 2 years ago
- ☆22Apr 26, 2026Updated 3 weeks ago
- This repository contains the files used for our Interspeech 2017 paper.☆16May 30, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Suppress mouse & keyboard events on MacOSX. Baby-proof my Mac!☆14Oct 19, 2023Updated 2 years ago
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆37Mar 3, 2026Updated 2 months ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- LSD-SLAM☆10Jun 6, 2018Updated 7 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- ☆12Dec 9, 2015Updated 10 years ago
- Text-to-Speech Benchmark☆26Apr 2, 2026Updated last month
- ☆14Nov 11, 2017Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 4 years ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆23Mar 30, 2026Updated last month
- General tools for voice analysis.☆25May 13, 2026Updated last week
- DSTC10 Track 2 - Knowledge-grounded Task-oriented Dialogue Modeling on Spoken Conversations☆62Jul 25, 2023Updated 2 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Github implementation of https://reports.chatclimate.ai/☆24Jun 16, 2025Updated 11 months ago
- superfast text to speech in any voice☆62Feb 16, 2026Updated 3 months ago