Official implementation of Tuna-2: Pixel Embeddings Beat Vision Encoders for Unified Understanding and Generation
☆703May 19, 2026Updated 3 weeks ago
Alternatives and similar repositories for tuna-2
Users that are interested in tuna-2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] Light-X: Generative 4D Video Rendering with Camera and Illumination Control☆186Dec 11, 2025Updated 6 months ago
- [ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion☆303Jul 15, 2025Updated 10 months ago
- This repository maintains the code for my master thesis "learn semantic 3d reconstruction on octree"☆13May 8, 2019Updated 7 years ago
- A framework for camera-controllable image editing using unified geometric guidance and video models.☆66Apr 28, 2026Updated last month
- [ICRA 2026] UltraDexGrasp: Learning Universal Dexterous Grasping for Bimanual Robots with Synthetic Data☆78Mar 6, 2026Updated 3 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code for 'Single-Image 3D Human Reconstruction with 3D-Aware Diffusion Priors and Facial Enhancement [Siggraph Asia 2025]'☆21Feb 1, 2026Updated 4 months ago
- [AAAI 2026] UltraGen☆78Feb 1, 2026Updated 4 months ago
- The official repository of EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model (CVPRW 2026)☆49Apr 19, 2026Updated last month
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆40Dec 30, 2025Updated 5 months ago
- [Official Repo] SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing☆208Apr 13, 2026Updated last month
- [ICML'26] VideoGPA is a self-supervised framework that enhances 3D consistency in Video Diffusion Models.☆64Jun 1, 2026Updated last week
- [Arxiv 2026] ActionPlan: Future-Aware Streaming Motion Synthesis via Frame-Level Action Planning☆88Mar 26, 2026Updated 2 months ago
- Resilient multi-LLM orchestration with in-built failure handing, rate limits, retries, and circuit breaker.☆47Jun 1, 2026Updated last week
- [arXiv 2512.17796] Animate Any Character in Any World☆96Mar 10, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆75Feb 26, 2026Updated 3 months ago
- [T-RO'25] HiMo: High-Speed Objects Motion Compensation in Point Clouds☆79Jan 5, 2026Updated 5 months ago
- Official Code for MIMETIC^2☆13Nov 19, 2024Updated last year
- Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders☆250Feb 13, 2026Updated 3 months ago
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".☆178Feb 4, 2026Updated 4 months ago
- Feed-forward model for predicting 3D physics with 3DGS + NeRF☆294Mar 5, 2026Updated 3 months ago
- DreamStyle: A Unified Framework for Video Stylization☆119Jan 7, 2026Updated 5 months ago
- Official Implementation of MultiWorld: Scalable Multi-Agent Multi-View Video World Models☆223May 12, 2026Updated 3 weeks ago
- A clean Pytorch Implementation of Mean Flow, with FID evaluation on the fly☆58Sep 21, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆230Jul 16, 2025Updated 10 months ago
- [ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy☆967Feb 27, 2026Updated 3 months ago
- ☆21Jun 3, 2023Updated 3 years ago
- [ICLR 2026] NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks☆174Apr 2, 2026Updated 2 months ago
- [SIGGRAPH 2026 / TOG] Official code of the paper "UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Pr…☆225May 15, 2026Updated 3 weeks ago
- [ICLR 2025] Official implementation of "DiffSplat: Repurposing Image Diffusion Models for Scalable 3D Gaussian Splat Generation".☆520Mar 19, 2026Updated 2 months ago
- Code for paper "CLiFT: Compressive Light-Field Tokens for Compute Efficient and Adaptive Neural Rendering" [NeurIPS 2025 (spotlight)]☆76Aug 2, 2025Updated 10 months ago
- ☆94Apr 29, 2026Updated last month
- Real-Time Physical Action-Conditioned Video Generation☆203Mar 6, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CVPR 2026] Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆178Feb 25, 2026Updated 3 months ago
- ☆62Feb 12, 2026Updated 4 months ago
- [3DV 2026 Oral] VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space☆232Nov 25, 2025Updated 6 months ago
- D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI [ICLR 2026]☆83Mar 3, 2026Updated 3 months ago
- WorldGrow: Generating Infinite 3D World [AAAI 2026 Oral]☆456Dec 3, 2025Updated 6 months ago
- [ICLR 2026 Oral (top 1.2%)] Official implementation of DepthLM☆355Jun 1, 2026Updated last week
- The official implementation of the paper SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder☆21Oct 19, 2025Updated 7 months ago