This repository contains the code for the paper - "Aligning Text, Images, and 3D Structure Token-by-Token" (CVPR 2026)
☆48Jun 11, 2025Updated last year
Alternatives and similar repositories for kyvo
Users that are interested in kyvo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Diorama: Unleashing Zero-shot Single-view 3D Scene Modeling (ICCV 2025 Highlight)☆53Jan 22, 2026Updated 4 months ago
- Official implementation of the paper "SceneEval: Evaluating Semantic Coherence in Text-Conditioned 3D Indoor Scene Synthesis"☆39Updated this week
- [ICML 2026] 🎨 Occluded 3D Scene Reconstruction from a Single Image.☆85Updated this week
- Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning☆34Mar 15, 2026Updated 2 months ago
- [CVPR 2025🔥] Official codebase for "Global-Local Tree Search in VLMs for 3D Indoor Scene Generation" and our arxiv 2026 extension☆22Jun 5, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Views☆22Jan 14, 2026Updated 5 months ago
- MeshArt: Generating Articulated Meshes with Structure-Guided Transformers (CVPR2025)☆55Jun 9, 2025Updated last year
- A PyTorch Implementation of MLS-MPM (Moving Least Squares Material Point Method)☆25Mar 20, 2025Updated last year
- Professional desktop app for converting text to audiobooks with local TTS☆33Oct 6, 2025Updated 8 months ago
- PyTorch implementation of the paper: CASAGPT: Cuboid Arrangement and Scene Assembly for Interior Design [CVPR 2025]☆15Apr 5, 2025Updated last year
- A 3D mesh viewer for vscode☆75Apr 14, 2026Updated 2 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Vertebral-level CT/X-ray registration through joint 3D Radiative Gaussians (RadGS) reconstruction and 3D/3D registration.☆37Updated this week
- Open-world 3D part segmentation of point clouds☆121Jul 27, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code release of "Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion".☆114Oct 16, 2025Updated 7 months ago
- Code for "ReSpace: Text-Driven Autoregressive 3D Indoor Scene Synthesis and Editing"☆67Apr 1, 2026Updated 2 months ago
- Building an Intelligent AWS Cloud Engineer Agent with Strands Agents SDK☆27Dec 16, 2025Updated 5 months ago
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 7 months ago
- ☆124Jul 19, 2025Updated 10 months ago
- ☆13Jul 10, 2024Updated last year
- [EMNLP 2025 Main] Official implementation of VRoPE: Rotary Position Embedding for Video Large Language Models.☆28Nov 18, 2025Updated 6 months ago
- [CVPR 2025] MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention☆41Mar 12, 2025Updated last year
- ☆19Aug 7, 2025Updated 10 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A collection of Claude commands and utilities☆27Jun 2, 2026Updated last week
- [CVPR 2025] TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree Sequencing☆190May 22, 2025Updated last year
- Official code for "LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models" (CVPR 2025)☆174Jun 18, 2025Updated 11 months ago
- Implementation of Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players☆607May 28, 2026Updated 2 weeks ago
- [MM 2024] The official implementation code for "DAT: Dialogue-Aware Transformer with Modality-Group Fusion for Human Engagement Estimatio…☆36Oct 31, 2024Updated last year
- [CVPR 2025] Program synthesis for 3D spatial reasoning☆60Jun 16, 2025Updated 11 months ago
- ☆23Dec 11, 2024Updated last year
- Official Pytorch implementation for SGCR: Spherical Gaussians for Efficient 3D Curve Reconstruction (CVPR2025)☆35May 14, 2025Updated last year
- CoPart (ICCV 2025): A part-based 3D generation framework & the first large-scale part-level 3D dataset.☆207Jul 23, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ISPRS 2026] The P3 Dataset: Pixels, Points and Polygons for Multimodal Building Vectorization☆39May 27, 2026Updated 2 weeks ago
- (CVPR2025 Highlight) Official repository of paper "Panorama Generation From NFoV Image Done Right"☆19May 29, 2025Updated last year
- [TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation☆15Mar 7, 2026Updated 3 months ago
- Fastest way to scaffold FastHTML applications.☆37Sep 13, 2025Updated 9 months ago
- Official repository of the paper "R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding"☆23Dec 2, 2024Updated last year
- [NeurIPS 2025] InstaInpaint: Instant 3D-Scene Inpainting with Masked Large Reconstruction Model☆56May 1, 2026Updated last month
- ☆18May 15, 2025Updated last year