This repository contains the code for the paper - "Aligning Text, Images, and 3D Structure Token-by-Token" (CVPR 2026)
β47Jun 11, 2025Updated 11 months ago
Alternatives and similar repositories for kyvo
Users that are interested in kyvo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Diorama: Unleashing Zero-shot Single-view 3D Scene Modeling (ICCV 2025 Highlight)β50Jan 22, 2026Updated 4 months ago
- [arXiv'25]π Unseen 3D Geometry Reasoning from a Single Image.β82Jul 10, 2025Updated 10 months ago
- Official implementation of the paper "SceneEval: Evaluating Semantic Coherence in Text-Conditioned 3D Indoor Scene Synthesis"β37Mar 29, 2026Updated last month
- Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoningβ34Mar 15, 2026Updated 2 months ago
- [CVPR 2025π₯] Official codebase for "Global-Local Tree Search in VLMs for 3D Indoor Scene Generation"β20Apr 18, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Viewsβ22Jan 14, 2026Updated 4 months ago
- MeshArt: Generating Articulated Meshes with Structure-Guided Transformers (CVPR2025)β55Jun 9, 2025Updated 11 months ago
- A PyTorch Implementation of MLS-MPM (Moving Least Squares Material Point Method)β25Mar 20, 2025Updated last year
- Professional desktop app for converting text to audiobooks with local TTSβ33Oct 6, 2025Updated 7 months ago
- PyTorch implementation of the paper: CASAGPT: Cuboid Arrangement and Scene Assembly for Interior Design [CVPR 2025]β15Apr 5, 2025Updated last year
- A 3D mesh viewer for vscodeβ75Apr 14, 2026Updated last month
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"β10Jul 19, 2024Updated last year
- Vertebral-level CT/X-ray registration through joint 3D Radiative Gaussians (RadGS) reconstruction and 3D/3D registration.β36Oct 18, 2025Updated 7 months ago
- Open-world 3D part segmentation of point cloudsβ119Jul 27, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code release of "Assembler: Scalable 3D Part Assembly via Anchor Point Diffusion".β112Oct 16, 2025Updated 7 months ago
- Code for "ReSpace: Text-Driven Autoregressive 3D Indoor Scene Synthesis and Editing"β67Apr 1, 2026Updated last month
- Test-Time Memory Framework: Control Hallucinations in Foundation Modelsβ11Nov 4, 2025Updated 6 months ago
- β124Jul 19, 2025Updated 10 months ago
- β13Jul 10, 2024Updated last year
- [EMNLP 2025 Main] Official implementation of VRoPE: Rotary Position Embedding for Video Large Language Models.β27Nov 18, 2025Updated 6 months ago
- [CVPR 2025] MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attentionβ41Mar 12, 2025Updated last year
- β19Aug 7, 2025Updated 9 months ago
- A collection of Claude commands and utilitiesβ27Updated this week
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [CVPR 2025] TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree Sequencingβ188May 22, 2025Updated last year
- Official code for "LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models" (CVPR 2025)β172Jun 18, 2025Updated 11 months ago
- [MM 2024] The official implementation code for "DAT: Dialogue-Aware Transformer with Modality-Group Fusion for Human Engagement Estimatioβ¦β37Oct 31, 2024Updated last year
- [CVPR 2025] Program synthesis for 3D spatial reasoningβ59Jun 16, 2025Updated 11 months ago
- β23Dec 11, 2024Updated last year
- Official Pytorch implementation for SGCR: Spherical Gaussians for Efficient 3D Curve Reconstruction (CVPR2025)β35May 14, 2025Updated last year
- CoPart (ICCV 2025): A part-based 3D generation framework & the first large-scale part-level 3D dataset.β202Jul 23, 2025Updated 10 months ago
- [ISPRS 2026] The P3 Dataset: Pixels, Points and Polygons for Multimodal Building Vectorizationβ38Dec 11, 2025Updated 5 months ago
- (CVPR2025 Highlight) Official repository of paper "Panorama Generation From NFoV Image Done Right"β19May 29, 2025Updated 11 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generationβ14Mar 7, 2026Updated 2 months ago
- Fastest way to scaffold FastHTML applications.β37Sep 13, 2025Updated 8 months ago
- Official repository of the paper "R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding"β23Dec 2, 2024Updated last year
- [NeurIPS 2025] InstaInpaint: Instant 3D-Scene Inpainting with Masked Large Reconstruction Modelβ55May 1, 2026Updated 3 weeks ago
- β18May 15, 2025Updated last year
- Zsh completion plugin for the LLM CLI tool by Simon Willisonβ20May 28, 2025Updated 11 months ago
- [ICCV 2025] Diffusion Curriculum (DisCL)β18Sep 26, 2025Updated 7 months ago