Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation
☆652May 12, 2026Updated last week
Alternatives and similar repositories for tuna-2
Users that are interested in tuna-2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆183Dec 11, 2025Updated 5 months ago
- [CVPR 2026 Highlight] Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision☆146Apr 16, 2026Updated last month
- [ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion☆304Jul 15, 2025Updated 10 months ago
- This repository maintains the code for my master thesis "learn semantic 3d reconstruction on octree"☆13May 8, 2019Updated 7 years ago
- A framework for camera-controllable image editing using unified geometric guidance and video models.☆65Apr 28, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for 'Single-Image 3D Human Reconstruction with 3D-Aware Diffusion Priors and Facial Enhancement [Siggraph Asia 2025]'☆21Feb 1, 2026Updated 3 months ago
- [AAAI 2026] UltraGen☆78Feb 1, 2026Updated 3 months ago
- The official repository of EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model (CVPRW 2026)☆48Apr 19, 2026Updated last month
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆39Dec 30, 2025Updated 4 months ago
- [Official Repo] SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing☆203Apr 13, 2026Updated last month
- [ICML'26] VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.☆57Updated this week
- [Arxiv 2026] ActionPlan: Future-Aware Streaming Motion Synthesis via Frame-Level Action Planning☆83Mar 26, 2026Updated last month
- UniMesh: Unifying 3D Mesh Understanding and Generation☆55May 8, 2026Updated 2 weeks ago
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.☆44Mar 23, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [arXiv 2025.12] Animate Any Character in Any World☆97Mar 10, 2026Updated 2 months ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆73Feb 26, 2026Updated 2 months ago
- [T-RO'25] HiMo: High-Speed Objects Motion Compensation in Point Clouds☆78Jan 5, 2026Updated 4 months ago
- Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders☆245Feb 13, 2026Updated 3 months ago
- Official Code for MIMETIC^2☆13Nov 19, 2024Updated last year
- Feed-forward model for predicting 3D physics with 3DGS + NeRF☆291Mar 5, 2026Updated 2 months ago
- DreamStyle: A Unified Framework for Video Stylization☆119Jan 7, 2026Updated 4 months ago
- Official Implementation of MultiWorld: Scalable Multi-Agent Multi-View Video World Models☆205May 12, 2026Updated last week
- A clean Pytorch Implementation of Mean Flow, with FID evaluation on the fly☆58Sep 21, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆227Jul 16, 2025Updated 10 months ago
- [SIGGRAPH 2026 / TOG] Official code of the paper "UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Pr…☆212May 15, 2026Updated last week
- [ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy☆957Feb 27, 2026Updated 2 months ago
- ☆21Jun 3, 2023Updated 2 years ago
- [ICLR 2026] NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks☆172Apr 2, 2026Updated last month
- [ICLR 2025] Official implementation of "DiffSplat: Repurposing Image Diffusion Models for Scalable 3D Gaussian Splat Generation".☆506Mar 19, 2026Updated 2 months ago
- Code for paper "CLiFT: Compressive Light-Field Tokens for Compute Efficient and Adaptive Neural Rendering" [NeurIPS 2025 (spotlight)]☆75Aug 2, 2025Updated 9 months ago
- ☆94Apr 29, 2026Updated 3 weeks ago
- Real-Time Physical Action-Conditioned Video Generation☆198Mar 6, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- DeepVerse: 4D Autoregressive Video Generation as a World Model☆230Aug 11, 2025Updated 9 months ago
- [CVPR 2026] Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆177Feb 25, 2026Updated 2 months ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]☆82Mar 3, 2026Updated 2 months ago
- ☆54Feb 12, 2026Updated 3 months ago
- [3DV 2026 Oral] VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space☆229Nov 25, 2025Updated 5 months ago
- WorldGrow: Generating Infinite 3D World [AAAI 2026 Oral]☆455Dec 3, 2025Updated 5 months ago
- [ICLR 2026 Oral (top 1.2%)] Official implementation of DepthLM☆340Apr 25, 2026Updated 3 weeks ago