☆95Oct 21, 2025Updated 7 months ago
Alternatives and similar repositories for HunyuanVision
Users that are interested in HunyuanVision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Aug 21, 2025Updated 9 months ago
- Galadriel TEE oracle configuration and verification code [Deprecated]☆16Oct 10, 2024Updated last year
- Stable-DiffCoder is a family of lightweight open-source code DLLMs(diffusion large language models) comprising base and instruct models, …☆84Mar 9, 2026Updated 2 months ago
- An open agentic system built on smolagents, integrating multimodal state-of-the-art music AI models for understanding, generation, and in…☆30Feb 6, 2026Updated 3 months ago
- [ACL2026] Uni-MMMU : A Massive Multi-discipline Multimodal Unified Benchmark☆25Apr 13, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆36Feb 28, 2026Updated 2 months ago
- ☆11Jan 18, 2024Updated 2 years ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Feb 28, 2025Updated last year
- ☆13Nov 2, 2020Updated 5 years ago
- [ICASSP 2025] Official implementation of "ViolinDiff: Enhancing Expressive Violin Synthesis with Pitch Bend Conditioning".☆16Feb 2, 2025Updated last year
- ☆11Dec 29, 2021Updated 4 years ago
- This is the official repository of Emotion-Driven Melody Harmonization via Melodic Variation and Functional Representation.☆12Sep 25, 2024Updated last year
- Towards Photorealistic 4D Scene Generation via Video Diffusion Models☆19Jun 12, 2024Updated last year
- A real-time voice conversion model based on VITS.☆17Aug 1, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The official repo of the paper titled DeH4R: A Decoupled and Hybrid Method for Road Network Graph Extraction.☆23Apr 10, 2026Updated last month
- Info for prospective PhD students for Chris Donahue's lab at CMU starting Fall 23.☆12Nov 13, 2022Updated 3 years ago
- [ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling☆38Feb 25, 2026Updated 3 months ago
- Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas☆114Feb 3, 2026Updated 3 months ago
- [ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation☆34Mar 10, 2026Updated 2 months ago
- MIR conference deadline countdowns☆11May 12, 2026Updated 2 weeks ago
- Code for the paper "No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations"☆12Oct 31, 2024Updated last year
- non-rigid registration in NIMBLE: A Non-rigid Hand Model with Bones and Muscles☆11Sep 2, 2022Updated 3 years ago
- ☆12Mar 28, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [3DV 2026] GIGA: Generalizable Sparse Image-driven Gaussian Humans☆17Jan 28, 2026Updated 3 months ago
- This library implements functions and classes for mesh registration, data augmentation, and data normalisation.☆12Oct 7, 2024Updated last year
- [ICCV 2025 Oral] Official implementation of Learning Streaming Video Representation via Multitask Training.☆91Dec 24, 2025Updated 5 months ago
- Official Code for CVPR2025 Paper: LatentHOI: On the Generalizable Hand Object Motion Generation with Latent Hand Diffusion☆31May 4, 2026Updated 3 weeks ago
- [ECCV'24] 3D Reconstruction of Objects in Hands without Real World 3D Supervision☆17Feb 3, 2025Updated last year
- [TPAMI 26/ NeurIPS 24] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner…☆75Oct 21, 2025Updated 7 months ago
- [ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM☆20May 22, 2025Updated last year
- [NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alig…☆161Sep 25, 2025Updated 8 months ago
- Distillation of Self-Supervised Representation-Based Speech Quality Assessment☆46May 15, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆20Mar 23, 2025Updated last year
- Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation (NeurIPS 23)☆12May 7, 2025Updated last year
- Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views (ICCV2023)☆14Oct 9, 2023Updated 2 years ago
- Official Repository for "Finding NeMO: A Geometry-Aware Representation of Template Views for Few-Shot Perception"☆28Apr 28, 2026Updated 3 weeks ago
- ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation☆29May 27, 2025Updated 11 months ago
- ☆38May 28, 2025Updated 11 months ago
- ☆58Updated this week