Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation
☆725Jun 9, 2026Updated 3 weeks ago
Alternatives and similar repositories for tuna-2
Users that are interested in tuna-2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆189Dec 11, 2025Updated 6 months ago
- [CVPR 2026 Highlight] Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision☆153Apr 16, 2026Updated 2 months ago
- [ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion☆303Jul 15, 2025Updated 11 months ago
- This repository maintains the code for my master thesis "learn semantic 3d reconstruction on octree"☆13May 8, 2019Updated 7 years ago
- A framework for camera-controllable image editing using unified geometric guidance and video models.☆66Jun 25, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICRA 2026] UltraDexGrasp: Learning Universal Dexterous Grasping for Bimanual Robots with Synthetic Data☆79Mar 6, 2026Updated 3 months ago
- Code for 'Single-Image 3D Human Reconstruction with 3D-Aware Diffusion Priors and Facial Enhancement [Siggraph Asia 2025]'☆21Feb 1, 2026Updated 5 months ago
- [AAAI 2026] UltraGen☆78Feb 1, 2026Updated 5 months ago
- The official repository of EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model (CVPRW 2026)☆50Apr 19, 2026Updated 2 months ago
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆40Dec 30, 2025Updated 6 months ago
- [ICML'26] VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.☆67Jun 6, 2026Updated 3 weeks ago
- [Official Repo] SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing☆212Apr 13, 2026Updated 2 months ago
- UniMesh: Unifying 3D Mesh Understanding and Generation☆58May 8, 2026Updated last month
- [Arxiv 2026] ActionPlan: Future-Aware Streaming Motion Synthesis via Frame-Level Action Planning☆90Mar 26, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.☆48Jun 1, 2026Updated last month
- [ECCV 2026] CustomX: Unified Character, Action, and Scene Customization in Video World Models☆96Jun 25, 2026Updated last week
- [T-RO'25] HiMo: High-Speed Objects Motion Compensation in Point Clouds☆79Jan 5, 2026Updated 5 months ago
- Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders☆252Feb 13, 2026Updated 4 months ago
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".☆179Feb 4, 2026Updated 4 months ago
- Feed-forward model for predicting 3D physics with 3DGS + NeRF☆294Mar 5, 2026Updated 3 months ago
- DreamStyle: A Unified Framework for Video Stylization☆122Jan 7, 2026Updated 5 months ago
- A clean Pytorch Implementation of Mean Flow, with FID evaluation on the fly☆58Sep 21, 2025Updated 9 months ago
- ☆234Jul 16, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Implementation of MultiWorld: Scalable Multi-Agent Multi-View Video World Models☆236May 12, 2026Updated last month
- [ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy☆977Feb 27, 2026Updated 4 months ago
- ☆21Jun 3, 2023Updated 3 years ago
- [ICLR 2026] NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks☆177Apr 2, 2026Updated 3 months ago
- [SIGGRAPH 2026 / TOG] Official code of the paper "UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Pr…☆234May 15, 2026Updated last month
- [ICLR 2025] Official implementation of "DiffSplat: Repurposing Image Diffusion Models for Scalable 3D Gaussian Splat Generation".☆530Mar 19, 2026Updated 3 months ago
- Code for paper "CLiFT: Compressive Light-Field Tokens for Compute Efficient and Adaptive Neural Rendering" [NeurIPS 2025 (spotlight)]☆77Aug 2, 2025Updated 11 months ago
- ☆94Apr 29, 2026Updated 2 months ago
- Real-Time Physical Action-Conditioned Video Generation☆209Mar 6, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- DeepVerse: 4D Autoregressive Video Generation as a World Model☆232Aug 11, 2025Updated 10 months ago
- [CVPR 2026] Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆178Feb 25, 2026Updated 4 months ago
- ☆63Feb 12, 2026Updated 4 months ago
- [3DV 2026 Oral] VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space☆232Nov 25, 2025Updated 7 months ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]☆87Mar 3, 2026Updated 3 months ago
- WorldGrow: Generating Infinite 3D World [AAAI 2026 Oral]☆462Dec 3, 2025Updated 6 months ago
- [ICLR 2026 Oral (top 1.2%)] Official implementation of DepthLM☆358Jun 1, 2026Updated last month