Official Repository of VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents
β101Mar 10, 2026Updated last week
Alternatives and similar repositories for VisGym
Users that are interested in VisGym are comparing it to the libraries listed below
Sorting:
- π₯ [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"β26Feb 9, 2025Updated last year
- π₯ [NeurIPS 2025] Official implementation of "Generate, but Verify: Reducing Visual Hallucination in Vision-Language Models with Retrospeβ¦β54Jan 22, 2026Updated last month
- π₯ [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"β47Jun 16, 2024Updated last year
- paper on dexpilotβ15Oct 14, 2019Updated 6 years ago
- Training recipe for SpatialReasoner [NeurIPS 2025]β41Updated this week
- official code for unigameβ19Nov 26, 2025Updated 3 months ago
- LogiCity@NeurIPS'24, D&B track. A multi-agent inductive learning environment for "abstractions".β27Jun 10, 2025Updated 9 months ago
- Causal Analysis of Agent Behavior for AI Safetyβ20Jun 27, 2023Updated 2 years ago
- Official PyTorch Implementation of "Minority-Focused Text-to-Image Generation via Prompt Optimization" (CVPR 2025 Oral)β27Apr 8, 2025Updated 11 months ago
- β13Jun 22, 2025Updated 8 months ago
- An official implementation of RoDyGS: Robust Dynamic Gaussian Splatting for Casual Videosβ37Dec 11, 2024Updated last year
- VHTestβ16Oct 31, 2024Updated last year
- β35Nov 15, 2025Updated 4 months ago
- The official implementation of the paper "Large Scale Knowledge Washing"β10Jun 12, 2024Updated last year
- β17Dec 11, 2024Updated last year
- β13Jul 26, 2023Updated 2 years ago
- βοΈ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".β53Feb 23, 2026Updated 3 weeks ago
- (NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlapsβ25Jan 22, 2026Updated last month
- EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memoryβ61Jan 13, 2026Updated 2 months ago
- β34Jun 5, 2025Updated 9 months ago
- β20Nov 13, 2023Updated 2 years ago
- β23Dec 23, 2025Updated 2 months ago
- β33Feb 15, 2026Updated last month
- β23Jun 5, 2025Updated 9 months ago
- A basic repository for a Clang-based tool, with CMake integration.β10Sep 22, 2023Updated 2 years ago
- Official repo for: Epipolar Geometry Improves Video Generation Modelsβ81Oct 28, 2025Updated 4 months ago
- β42Mar 11, 2026Updated last week
- https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoTβ126Jan 30, 2026Updated last month
- Official repository for βReasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Spaceββ18Jan 27, 2026Updated last month
- Improved Implementation for Training GLIGEN: Open-Set Grounded Text-to-Image Generationβ46Jun 1, 2024Updated last year
- Automated GPU Kernel Generation via Co-Evolving Intrinsic World Modelβ85Mar 2, 2026Updated 2 weeks ago
- Research works from Tencent AI Lab regarding self-evolving agentsβ85Jan 30, 2026Updated last month
- The official repo for LIFT: Language-Image Alignment with Fixed Text Encodersβ42Jun 10, 2025Updated 9 months ago
- a set of tools for computer vision processingβ18Jul 9, 2016Updated 9 years ago
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoningβ43Dec 9, 2024Updated last year
- β15Feb 21, 2025Updated last year
- A simple SQL parser based on Apache Calcite.β13Jan 17, 2026Updated 2 months ago
- β14Jan 9, 2018Updated 8 years ago
- A lightweight tool for detecting bugs on Graph Database Management Systemsβ15Jan 9, 2024Updated 2 years ago