korakoe / VALL-E-XView external linksLinks
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
☆16Apr 18, 2024Updated last year
Alternatives and similar repositories for VALL-E-X
Users that are interested in VALL-E-X are comparing it to the libraries listed below
Sorting:
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆24Jun 24, 2024Updated last year
- animatediff prompt travel☆19Jan 27, 2024Updated 2 years ago
- ☆16Apr 23, 2024Updated last year
- ☆12Sep 26, 2023Updated 2 years ago
- Implementation for "Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffu…☆13Sep 8, 2023Updated 2 years ago
- [NeurIPS 2023] Official Code for "Towards Robust and Expressive Whole-body Human Pose and Shape Estimation"☆50Updated this week
- BH hackathon☆14Apr 4, 2024Updated last year
- NOVA-3D: Non-overlapped Views for 3D Anime Character Reconstruction☆24Mar 14, 2024Updated last year
- UnrealBakedSDF is a sample Unreal project for importing and visualizing BakedSDF meshes.☆15Jun 14, 2023Updated 2 years ago
- ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text☆38Oct 17, 2025Updated 3 months ago
- A TriposR implementation for WebUI☆60Mar 13, 2024Updated last year
- A WebGPU port of Stable Diffusion using tinygrad☆35Jan 3, 2025Updated last year
- The Devil is in the Edges: Monocular Depth Estimation with Edge-aware Consistency Fusion☆35Apr 1, 2024Updated last year
- [ECCVW 2024] Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models☆35May 10, 2025Updated 9 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆32Jun 15, 2023Updated 2 years ago
- ☆15Jan 8, 2024Updated 2 years ago
- Dungeon procedural generator similar to whatabou's "One Page Dungeon"☆48Jan 4, 2026Updated last month
- ☆14Oct 16, 2023Updated 2 years ago
- MMD viewer powered by Babylon.js and babylon-mmd☆16Aug 2, 2025Updated 6 months ago
- ☆76Dec 14, 2024Updated last year
- A ComfyUI plugin that provides a user interface of StableStudio☆22Aug 15, 2025Updated 6 months ago
- MJCF Importer Extension☆18Jul 24, 2025Updated 6 months ago
- STDFormer: Spatio Temporal Disentanglement Learning for 3D Human Mesh Recovery from Monocular Videos with Transformer☆45Mar 14, 2024Updated last year
- a naive 3d human pose editor GUI.☆20Jul 12, 2023Updated 2 years ago
- ☆16Apr 7, 2024Updated last year
- InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion (CVPR 2024)☆77May 3, 2024Updated last year
- Quick lookup for Instant-angelo (https://github.com/hugoycj/Instant-angelo) results☆21Oct 22, 2023Updated 2 years ago
- Thubail maker/ image editor using PHP☆19Aug 11, 2021Updated 4 years ago
- [3DV 2024] CloSe: A 3D Clothing Segmentation Dataset and Model☆79Jul 1, 2025Updated 7 months ago
- [TMLR 2025] The official repository of the paper "Unsupervised Discovery of Object-Centric Neural Fields"☆18Feb 3, 2025Updated last year
- ☆24Sep 5, 2025Updated 5 months ago
- Port of Facebook's LLaMA model in C/C++☆21Nov 6, 2023Updated 2 years ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆16Feb 4, 2024Updated 2 years ago
- [TOG 2023] HAvatar: High-fidelity Head Avatar via Facial Model ConditionedNeural Radiance Field☆129Jul 22, 2024Updated last year
- This project is dedicated to advancing the field of animatronic robots by enabling them to generate lifelike facial expressions, pushing …☆11Apr 1, 2024Updated last year
- ☆19Jul 11, 2024Updated last year
- BakedSDF2FBX is a utility script for converting BakedSDF GLB files to FBX, allowing them to be used in real-time 3D tools like Unity and …☆29Jun 14, 2023Updated 2 years ago
- A front-end GUI for interacting with AI Horde's distributed cluster of Stable Diffusion workers☆23Jul 4, 2025Updated 7 months ago
- [NeurIPS 2025] Official code for ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation☆33Oct 17, 2025Updated 4 months ago