VoxCPM2 TTS for ComfyUI. 30 languages, voice design, controllable cloning, 48kHz audio, and LoRA training
☆72Apr 12, 2026Updated this week
Alternatives and similar repositories for ComfyUI-VoxCPM2
Users that are interested in ComfyUI-VoxCPM2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A set of comfyui multi class nodes☆17Sep 3, 2025Updated 7 months ago
- ☆46Apr 8, 2026Updated last week
- ☆26Apr 9, 2026Updated last week
- Portrait Tools: Facial detection cropping, alignment, ID photo, etc☆20Jun 15, 2025Updated 10 months ago
- modelscope-qwen-image-api☆127Jan 4, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆20Mar 3, 2025Updated last year
- ☆99Apr 6, 2026Updated last week
- A simple 3D model processing tool within ComfyUI☆23Oct 18, 2024Updated last year
- Custom nodes by IAMCCS for ComfyUI — includes WANAnimate LoRA Loader Fix and cinematic extensions.☆82Updated this week
- ☆49Nov 3, 2025Updated 5 months ago
- ☆30Aug 12, 2023Updated 2 years ago
- Amphion-MaskGCT:0-sample voice synthesis and OpenAI-whisper-large-v3:Speech-to-text ComfyUI node packaging☆27Mar 5, 2025Updated last year
- Cochlear implant signal processing☆10Jun 24, 2021Updated 4 years ago
- Provide AX=YB Hand Eye Calibration for da Vinci Robot, using RealSense and Aruco with accuracy of 5mm☆13Oct 5, 2017Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Fast kernel library for Diffusion inference with multiple compute backends.☆93Mar 12, 2026Updated last month
- Sign and verify orchestrated HTTP requests☆123Mar 9, 2026Updated last month
- a compact audio-to-phoneme aligner for singing voice☆12Jan 17, 2024Updated 2 years ago
- Yolo26 model supports android deployment.☆35Jan 21, 2026Updated 2 months ago
- SE(n)++: An Efficient Unified Solution to Multiple Pose Estimation Problems☆15Aug 5, 2020Updated 5 years ago
- ☆13Feb 26, 2017Updated 9 years ago
- XViewer is a easy tools for data visualization. You can use it to visualize a data file or a data dir such as EuROC, TUM VIO dataset.☆17Jan 3, 2025Updated last year
- Supplementary materials for "Evaluating generalised additive mixed modelling strategies for dynamic speech analysis"☆10Jan 25, 2021Updated 5 years ago
- Prompt Generator for Video, Audio, Image, and Text. A node for ComfyUI. Including Deepseek, Alibaba Cloud Qwen, Google Gemini, and locall…☆53Jul 11, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- These custom nodes provide a rotation aware face extraction, paste back, and various face related masking options.☆175Oct 8, 2025Updated 6 months ago
- The ListHelper collection provides powerful list manipulation and AI integration for ComfyUI with GGUF/Qwen LLM CLIP☆66Feb 8, 2026Updated 2 months ago
- A lightweight tool that efficiently isolates target speaker data from your datasets.☆19Nov 23, 2024Updated last year
- AuraRing: Precise Electromagnetic Finger Tracking (IMWUT 2019)☆17Jan 6, 2020Updated 6 years ago
- audio/speech feature extraction using parselmouth, librosa, disvoice☆10Jan 28, 2022Updated 4 years ago
- Use accelerometer, magnetometer, gyroscope data, use ESKF to estimate attitude.☆15Jan 1, 2021Updated 5 years ago
- audiolm-pytorch training code☆15Jul 31, 2023Updated 2 years ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Lightweight and Efficient, 🎧Ultra High-Quality Voice Cloning, Chinese and English.☆212Jun 11, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- boss直聘mcp server☆233Nov 4, 2025Updated 5 months ago
- ForceAlign is a Python library for forced alignment of English text to English audio. You can use ForceAlign to get word or phoneme level…☆27Dec 4, 2024Updated last year
- I wanted guided tutorials on digital signal processing so I decided to create them. The result is this ebook: "Digital Signal Processing …☆12Feb 5, 2024Updated 2 years ago
- ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"☆14Nov 29, 2024Updated last year
- SoulX-Podcast: Towards Realistic Long-form Podcasts with Dialectal and Paralinguistic Diversity☆90Oct 31, 2025Updated 5 months ago
- ☆14Apr 1, 2024Updated 2 years ago
- ☆10May 17, 2021Updated 4 years ago