Drax: Speech Recognition with Discrete Flow Matching
☆75Oct 15, 2025Updated 8 months ago
Alternatives and similar repositories for drax
Users that are interested in drax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- proof of concept conversation orchestrator with a speech-language model☆20Oct 19, 2024Updated last year
- a Frontier Japanese Speech Generation net☆65May 15, 2025Updated last year
- Speaker embedding for anime speech domain based on ECAPA_TDNN☆20Jun 22, 2025Updated last year
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- Neural Homomorphic Vocoder optimized for singing voice synthesis☆38May 2, 2026Updated 2 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆13Nov 30, 2022Updated 3 years ago
- Deploy KoGPT with Triton Inference Server☆14Nov 18, 2022Updated 3 years ago
- ☆12Mar 24, 2024Updated 2 years ago
- A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.☆31Jan 4, 2026Updated 6 months ago
- The demo page for ALMTokenizer☆59Apr 14, 2025Updated last year
- ☆35Oct 23, 2025Updated 8 months ago
- Yeoman generator for webapps utilizing ClojureScript on the front-end and back-end.☆28Mar 19, 2014Updated 12 years ago
- An official implementation of Style-Talker for Spoken Dialogue Generation☆23Jan 12, 2025Updated last year
- A small rust-based data loader☆37Feb 20, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Convert Numerical Representations to Korean Pronunciation☆14Apr 20, 2020Updated 6 years ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆26Updated this week
- Official Repository of UltraVoice☆62Oct 28, 2025Updated 8 months ago
- Unofficial PyTorch Implementation of "Were RNNs All We Needed?"☆17Mar 20, 2025Updated last year
- ☆33Oct 28, 2025Updated 8 months ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Sep 21, 2025Updated 9 months ago
- Knowledge work sdk☆49Feb 23, 2026Updated 4 months ago
- [SIGGRAPH 2025] Official Implementation of "Instant Self-Intersection Repair for 3D Meshes"☆52Mar 26, 2026Updated 3 months ago
- A collection of all our phonemeizers for dataset construction and inference☆30Feb 21, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A python package for finding words that sound like other words. Useful for entity resolution and poetry, among other things.☆15Oct 26, 2022Updated 3 years ago
- A modular, RedisTimeSeries-native observability agent. Designed for developers, tinkerers, and infrastructure teams who want full contro…☆27May 30, 2025Updated last year
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated last year
- AgentFM is a peer-to-peer network that turns everyday computers into a decentralized AI supercomputer. AgentFM lets you run massive AI wo…☆125Jun 3, 2026Updated last month
- A collections of audio codecs with a standardized API☆41Apr 15, 2026Updated 2 months ago
- ☆24Mar 13, 2020Updated 6 years ago
- ☆11Feb 5, 2024Updated 2 years ago
- A starter kit for deploying Swift applications to Vercel☆10Apr 6, 2024Updated 2 years ago
- ☆23Jul 8, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Implementation of adaptive gradient clipping for base pytorch☆21Aug 8, 2024Updated last year
- Jointly encoding word confusion networks (WCNs) and dialogue context with BERT for spoken language understanding (SLU).☆12Jun 12, 2023Updated 3 years ago
- a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.☆38Apr 7, 2025Updated last year
- fast, rust based epub library for python☆63Jun 12, 2026Updated 3 weeks ago
- A collection of various LLM sampling methods implemented in pure Pytorch☆30Dec 9, 2024Updated last year
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Mar 21, 2025Updated last year
- Implementing the OPRO paper☆16Sep 18, 2023Updated 2 years ago