Lightning-Fast, On-Device TTS — running natively via ONNX.
☆82May 18, 2026Updated last month
Alternatives and similar repositories for supertonic-py
Users that are interested in supertonic-py are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Playwright Bot to Automate Trading on Deriv. It's a cross-platform Desktop app written in Python, no deployments hustles☆20Apr 8, 2022Updated 4 years ago
- ☆31Nov 24, 2023Updated 2 years ago
- ☆21Jul 6, 2025Updated 11 months ago
- ☆11Feb 26, 2024Updated 2 years ago
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆68Dec 26, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Application framework for Multimodal Distributed inference & Orchestration.☆174Updated this week
- ☆23Aug 4, 2025Updated 10 months ago
- ☆45Sep 19, 2024Updated last year
- ☆13Aug 14, 2022Updated 3 years ago
- ☆13Aug 13, 2023Updated 2 years ago
- Subtask-Aware Visual Reward Learning from Segmented Demonstrations (ICLR 2025 accepted)☆19Apr 11, 2025Updated last year
- An end-to-end framework for multi-speaker transcription that jointly models who spoke, when, and what.☆260Jun 22, 2026Updated last week
- 深度调研报告生成 Skill — 一条命令,十分钟出券商级深度调研报告 / Professional deep research report generation Skill · Supports 19 languages☆408Updated this week
- ☆12Jul 6, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PyTorch implementation of StableMask (ICML'24)☆15Jun 27, 2024Updated 2 years ago
- An up-to-date & curated list of awesome semi-supervised segmentation papers, methods & resources.☆13Dec 22, 2023Updated 2 years ago
- Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…☆129Mar 3, 2026Updated 3 months ago
- Official PyTorch implementation of NeurIPS 2022 paper "Invertible Monotone Operators for Normalizing Flows"☆15Nov 28, 2022Updated 3 years ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆22Jun 6, 2025Updated last year
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆30Feb 11, 2026Updated 4 months ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Jul 24, 2024Updated last year
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆21Jul 16, 2023Updated 2 years ago
- Real-valued non-volume preserving(RealNVP) implementation with PyTorch☆15May 15, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆25Updated this week
- Tutorial of YART (Yet Another Robotics Tutorial)☆18Mar 8, 2023Updated 3 years ago
- ☆171Sep 19, 2024Updated last year
- [CVPR 2024] U-VAP: User-specified Visual Appearance Personalization via Decoupled Self Augmentation☆18Sep 1, 2024Updated last year
- High-performance, semantic turn detection for conversational AI☆42Oct 1, 2025Updated 8 months ago
- Toward Spatially Unbiased Generative Models (ICCV 2021)☆90Sep 14, 2021Updated 4 years ago
- ☆19Aug 23, 2024Updated last year
- ☆33Oct 28, 2025Updated 8 months ago
- Public dataset developed by KICT_INTFLOW for IITP AI GrandChallenge 2019, Track-3☆13Mar 4, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆18May 20, 2025Updated last year
- Official implementation of BVAE-TTS☆173Sep 26, 2022Updated 3 years ago
- Easily Credit Contributors in Git Commits☆10May 8, 2023Updated 3 years ago
- Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes…☆10Feb 20, 2025Updated last year
- [INTERSPEECH 2025] The official implementation of DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for…☆17Sep 7, 2025Updated 9 months ago
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆36Oct 23, 2025Updated 8 months ago
- ☆13Jan 5, 2022Updated 4 years ago