Lightning-Fast, On-Device TTS — running natively via ONNX.
☆34May 15, 2026Updated this week
Alternatives and similar repositories for supertonic-py
Users that are interested in supertonic-py are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆31Nov 24, 2023Updated 2 years ago
- ☆20Jul 6, 2025Updated 10 months ago
- ☆11Feb 26, 2024Updated 2 years ago
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆66Dec 26, 2025Updated 4 months ago
- Application framework for Multimodal Distributed inference & Orchestration.☆162May 12, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆44Sep 19, 2024Updated last year
- ☆23Aug 4, 2025Updated 9 months ago
- ☆13Aug 14, 2022Updated 3 years ago
- ☆13Aug 13, 2023Updated 2 years ago
- Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)☆19Apr 11, 2025Updated last year
- ☆12Jul 6, 2023Updated 2 years ago
- PyTorch implementation of StableMask (ICML'24)☆15Jun 27, 2024Updated last year
- An up-to-date & curated list of awesome semi-supervised segmentation papers, methods & resources.☆13Dec 22, 2023Updated 2 years ago
- Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…☆119Mar 3, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official PyTorch implementation of NeurIPS 2022 paper "Invertible Monotone Operators for Normalizing Flows"☆15Nov 28, 2022Updated 3 years ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated 11 months ago
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆26Feb 11, 2026Updated 3 months ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Jul 24, 2024Updated last year
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆22Jul 16, 2023Updated 2 years ago
- Real-valued non-volume preserving(RealNVP) implementation with PyTorch☆15May 15, 2019Updated 7 years ago
- ☆25Jul 30, 2025Updated 9 months ago
- Tutorial of YART (Yet Another Robotics Tutorial)☆18Mar 8, 2023Updated 3 years ago
- ☆169Sep 19, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [CVPR 2024] U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation☆18Sep 1, 2024Updated last year
- High-performance, semantic turn detection for conversational AI☆38Oct 1, 2025Updated 7 months ago
- [INTERSPEECH 2025] The official implementation of DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for…☆16Sep 7, 2025Updated 8 months ago
- Toward Spatially Unbiased Generative Models (ICCV 2021)☆90Sep 14, 2021Updated 4 years ago
- ☆19Aug 23, 2024Updated last year
- ☆33Oct 28, 2025Updated 6 months ago
- Public dataset developed by KICT_INTFLOW for IITP AI GrandChallenge 2019, Track-3☆13Mar 4, 2020Updated 6 years ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆18May 20, 2025Updated 11 months ago
- Official implementation of BVAE-TTS☆173Sep 26, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Easily Credit Contributors in Git Commits☆10May 8, 2023Updated 3 years ago
- Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes…☆10Feb 20, 2025Updated last year
- 知网爬虫,作者、摘要、题目、发表期刊等主要内容的获取☆39Feb 22, 2025Updated last year
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆35Oct 23, 2025Updated 6 months ago
- ☆13Jan 5, 2022Updated 4 years ago
- This project enhances the ZLT-X28 &PRO 5G router by integrating OpenWrt's LuCI interface without replacing the original vendor firmware. …☆21Sep 4, 2025Updated 8 months ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆19Jun 5, 2023Updated 2 years ago