Lightning-Fast, On-Device TTS — running natively via ONNX.
☆73May 18, 2026Updated 3 weeks ago
Alternatives and similar repositories for supertonic-py
Users that are interested in supertonic-py are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Playwright Bot to Automate Trading on Deriv. It's a cross-platform Desktop app written in Python, no deployments hustles☆17Apr 8, 2022Updated 4 years ago
- ☆31Nov 24, 2023Updated 2 years ago
- ☆21Jul 6, 2025Updated 11 months ago
- ☆11Feb 26, 2024Updated 2 years ago
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆66Dec 26, 2025Updated 5 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Application framework for Multimodal Distributed inference & Orchestration.☆172Updated this week
- ☆23Aug 4, 2025Updated 10 months ago
- ☆44Sep 19, 2024Updated last year
- ☆13Aug 14, 2022Updated 3 years ago
- ☆13Aug 13, 2023Updated 2 years ago
- Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)☆19Apr 11, 2025Updated last year
- ☆12Jul 6, 2023Updated 2 years ago
- PyTorch implementation of StableMask (ICML'24)☆15Jun 27, 2024Updated last year
- An up-to-date & curated list of awesome semi-supervised segmentation papers, methods & resources.☆13Dec 22, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…☆126Mar 3, 2026Updated 3 months ago
- Official PyTorch implementation of NeurIPS 2022 paper "Invertible Monotone Operators for Normalizing Flows"☆15Nov 28, 2022Updated 3 years ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated last year
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆27Feb 11, 2026Updated 3 months ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Jul 24, 2024Updated last year
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆21Jul 16, 2023Updated 2 years ago
- Real-valued non-volume preserving(RealNVP) implementation with PyTorch☆15May 15, 2019Updated 7 years ago
- ☆25Jun 2, 2026Updated last week
- Tutorial of YART (Yet Another Robotics Tutorial)☆18Mar 8, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆170Sep 19, 2024Updated last year
- [CVPR 2024] U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation☆18Sep 1, 2024Updated last year
- High-performance, semantic turn detection for conversational AI☆41Oct 1, 2025Updated 8 months ago
- Toward Spatially Unbiased Generative Models (ICCV 2021)☆90Sep 14, 2021Updated 4 years ago
- ☆19Aug 23, 2024Updated last year
- ☆33Oct 28, 2025Updated 7 months ago
- Public dataset developed by KICT_INTFLOW for IITP AI GrandChallenge 2019, Track-3☆13Mar 4, 2020Updated 6 years ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆18May 20, 2025Updated last year
- Official implementation of BVAE-TTS☆173Sep 26, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Easily Credit Contributors in Git Commits☆10May 8, 2023Updated 3 years ago
- Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes…☆10Feb 20, 2025Updated last year
- [INTERSPEECH 2025] The official implementation of DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for…☆17Sep 7, 2025Updated 9 months ago
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆36Oct 23, 2025Updated 7 months ago
- ☆13Jan 5, 2022Updated 4 years ago
- This project enhances the ZLT-X28 &PRO 5G router by integrating OpenWrt's LuCI interface without replacing the original vendor firmware. …☆24Sep 4, 2025Updated 9 months ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆19Jun 5, 2023Updated 3 years ago