KAIST-Visual-AI-Group/VG-AVS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KAIST-Visual-AI-Group/VG-AVS)

KAIST-Visual-AI-Group / VG-AVS

Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection

☆24

Alternatives and similar repositories for VG-AVS

Users that are interested in VG-AVS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KAIST-Visual-AI-Group / ORIGEN
View on GitHub
[NeurIPS 2025] Official code for ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation
☆32Oct 17, 2025Updated 9 months ago
KAIST-Visual-AI-Group / BezierFlow
View on GitHub
[ICLR 2026] Official code for BézierFlow: Learning Bézier Stochastic Interpolant Schedulers for Few-Step Generation
☆21Apr 13, 2026Updated 3 months ago
KAIST-Visual-AI-Group / Psi-Sampler
View on GitHub
[NeurIPS 2025, Spotlight] Official code for Initial Particle Sampling for SMC-Based Inference-Time Reward Alignment in Score-Based Genera…
☆18Feb 3, 2026Updated 5 months ago
KAIST-Visual-AI-Group / PairFlow
View on GitHub
[ICLR 2026] Official code for PairFlow: Closed-Form Source-Target Coupling for Few-Step Generation in Discrete Flow Models
☆16Jul 3, 2026Updated 2 weeks ago
KAIST-Visual-AI-Group / MatLat
View on GitHub
[CVPR 2026 Highlight] Official code for MatLat: Material Latent Space for PBR Texture Generation
☆17Updated this week
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
KAIST-Visual-AI-Group / APC-VLM
View on GitHub
[ICCV 2025] Official code for Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation
☆66Sep 12, 2025Updated 10 months ago
KAIST-Visual-AI-Group / GrounDiT
View on GitHub
[NeurIPS 2024] Official Implementation of GrounDiT
☆59Dec 12, 2024Updated last year
KAIST-Visual-AI-Group / Token-Warping-MLLM
View on GitHub
☆22Mar 31, 2026Updated 3 months ago
KAIST-Visual-AI-Group / SyncTweedies
View on GitHub
Official implementation of SyncTweedies: A General Generative Framework Based on Synchronized Diffusions (NeurIPS 2024)
☆69Aug 4, 2024Updated last year
mll-lab-nu / ViewAgent
View on GitHub
☆20Jul 3, 2026Updated 2 weeks ago
KAIST-Visual-AI-Group / Flow-Inference-Time-Scaling
View on GitHub
[NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing
☆75Oct 12, 2025Updated 9 months ago
kietngt00 / UFC
View on GitHub
[NeurIPS 2025] Universal Few-Shot Spatial Control for Diffusion Models
☆21Sep 18, 2025Updated 10 months ago
mll-lab-nu / Theory-of-Space
View on GitHub
THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…
☆85Feb 27, 2026Updated 4 months ago
KAIST-Visual-AI-Group / PDS
View on GitHub
Official Implementation of Posterior Distillation Sampling
☆94Jul 7, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
KAIST-Visual-AI-Group / StochSync
View on GitHub
Official implementation of StochSync: a zero-shot approach for image generation in arbitrary spaces via stochastic diffusion synchronizat…
☆21Jun 24, 2025Updated last year
DoHunLee1 / VideoGuide
View on GitHub
[CVPR2025] Official repository for "VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide"
☆30May 27, 2025Updated last year
irom-princeton / spine
View on GitHub
Geometry Meets Vision: Revisiting Pretrained Semantics in Distilled Fields
☆32Oct 3, 2025Updated 9 months ago
Visual-AI / 3DRS
View on GitHub
[NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding
☆158Dec 9, 2025Updated 7 months ago
kaist-cvml / geometric-distillation
View on GitHub
[EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation
☆39Jun 12, 2025Updated last year
SNU-VGILab / improving-editability
View on GitHub
[Official Implementation] Improving Editability in Image Generation with Layer-wise Memory, CVPR 2025
☆38Mar 2, 2026Updated 4 months ago
KAIST-Visual-AI-Group / PartGlot
View on GitHub
Official Implementation of PartGlot (CVPR 2022 Oral)
☆34Sep 23, 2025Updated 9 months ago
tsinghua-fib-lab / CityEQA
View on GitHub
☆29Feb 20, 2025Updated last year
fereenwong / cdViews
View on GitHub
official code for "3D Question Answering via only 2D Vision-Language Models"
☆24Mar 4, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mhsung / libigl-renderer
View on GitHub
☆19Mar 14, 2023Updated 3 years ago
cvlab-kaist / DiffTrack
View on GitHub
[NeurIPS'25] Official implementation of "Emergent Temporal Correspondences from Video Diffusion Models"
☆99Dec 3, 2025Updated 7 months ago
SNU-VGILab / InstantDrag
View on GitHub
InstantDrag: Improving Interactivity in Drag-based Image Editing
☆237May 28, 2026Updated last month
cheolhong0916 / contrastive-probing
View on GitHub
☆15Jun 19, 2026Updated last month
KAIST-Visual-AI-Group / SyncDiffusion
View on GitHub
[NeurIPS 2023] Official implementation of SyncDiffusion
☆169Apr 20, 2024Updated 2 years ago
DveloperY0115 / torch-NeRF
View on GitHub
Pytorch implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis (Mildenhall et al., ECCV 2020 Oral, Best…
☆47Mar 7, 2024Updated 2 years ago
gca-spatial-reasoning / gca
View on GitHub
Official Implementation of "Geometrically-Constrained Agent for Spatial Reasoning"
☆89Apr 7, 2026Updated 3 months ago
deepplants / recursive-deep-spectral-clustering
View on GitHub
[NeurIPS 2024] Unsupervised Hierarchy-Agnostic Segmentation: Parsing Semantic Image Structure
☆12Nov 27, 2025Updated 7 months ago
memory-eqa / MemoryEQA
View on GitHub
MemoryEQA
☆27May 4, 2026Updated 2 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
sled-group / COMFORT
View on GitHub
[ICLR 2025 Oral] Official Implementation for "Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Un…
☆22Oct 24, 2024Updated last year
cvlab-kaist / GARD
View on GitHub
Official implementation of "GARD: Geometry-Aware Representation Denoising for Multi-view Image Restoration and 3D Reconstruction"
☆47May 27, 2026Updated last month
JasonQSY / AffordanceLLM
View on GitHub
Code for "AffordanceLLM: Grounding Affordance from Vision Language Models"
☆14Oct 18, 2024Updated last year
hyungjin-chung / VPS
View on GitHub
☆16Sep 11, 2025Updated 10 months ago
1202kbs / DMCMC
View on GitHub
Official PyTorch implementation of "Denoising MCMC for Accelerating Diffusion-Based Generative Models", ICML 2023 Oral Paper
☆31Sep 14, 2023Updated 2 years ago
carpedkm / disentangled-subject-to-vid
View on GitHub
Learning Zero-Shot Subject-Driven Video Generation Using 1% Compute
☆59Jul 9, 2026Updated last week
sjpark5800 / LA-DETR
View on GitHub
[WACV 2026] MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval
☆14Sep 18, 2025Updated 10 months ago