yukara-ikemiya / modified-shortcut-models-pytorchView external linksLinks
PyTorch implementation of Shortcut Models [Frans, 2025] with little modification
☆71Jul 11, 2025Updated 7 months ago
Alternatives and similar repositories for modified-shortcut-models-pytorch
Users that are interested in modified-shortcut-models-pytorch are comparing it to the libraries listed below
Sorting:
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- Code for PolyTask: Learning Unified Policies through Behavior Distillation☆12Oct 13, 2023Updated 2 years ago
- ☆22Dec 19, 2025Updated last month
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Nov 19, 2024Updated last year
- Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models☆15Sep 10, 2025Updated 5 months ago
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆38Feb 10, 2026Updated last week
- [NeurIPS 2024] Official code for "Variational Distillation of Diffusion Policies into Mixture of Experts"☆17Dec 7, 2024Updated last year
- A collection of real-time audio effect algorithms implemented in C++.☆19Jul 16, 2025Updated 7 months ago
- ☆15Jun 13, 2023Updated 2 years ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Feb 7, 2026Updated last week
- This repo contains the official PyTorch implementation of "Analyzing Discrete Self Supervised Speech Representation For Spoken Language M…☆20Jan 3, 2023Updated 3 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- [CoRL 2025] Pretraining code for FLOWER VLA on OXE☆29Sep 22, 2025Updated 4 months ago
- The accompanying code for "Exploring the limits of decoder-only models trained on public speech recognition corpora" (Ankit Gupta, George…☆20Oct 11, 2024Updated last year
- llama2 in Julia☆14Jul 24, 2023Updated 2 years ago
- An AR+AR TTS attempt.