[CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images
☆56Jan 23, 2026Updated last month
Alternatives and similar repositories for CodeVision
Users that are interested in CodeVision are comparing it to the libraries listed below
Sorting:
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆12Oct 12, 2024Updated last year
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆28Nov 4, 2025Updated 3 months ago
- The open-source code for the NeurIPS 2025 paper, "Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learn…☆44Jan 5, 2026Updated last month
- a unified reinforcement learning toolbox for joint RL on language models and diffusion models☆75Feb 7, 2026Updated 3 weeks ago
- Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning☆45Jul 2, 2025Updated 7 months ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- Holistic Evaluation of Multimodal LLMs on Spatial Intelligence☆82Feb 13, 2026Updated 2 weeks ago
- Doodling our way to AGI ✏️ 🖼️ 🧠☆121May 29, 2025Updated 9 months ago
- Dr. MAS is an end-to-end RL training framework for multi-agent LLM systems, supporting the co-training of multiple (heterogeneous) LLMs.☆89Feb 11, 2026Updated 2 weeks ago
- ☆68Nov 5, 2025Updated 3 months ago
- Software to enable data-rich collaboration from high-resolution display walls to your laptop☆16Feb 19, 2026Updated last week
- MiniMax-Provider-Verifier offers a rigorous, vendor-agnostic way to verify whether third-party deployments of the Minimax M2 model are co…☆28Feb 18, 2026Updated last week
- [ICLR 2026] "VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use"☆156Feb 7, 2026Updated 3 weeks ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- AI-native knowledge kernel for human/agent collaboration. Use it as a Knowledge Base, Wiki, Annotator, Research Tool, or Agentic Memory.☆29Updated this week
- ☆17May 25, 2025Updated 9 months ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- ☆10Sep 4, 2025Updated 5 months ago
- Benchmark evaluating ocean forecasting systems against reference datasets and observations.☆24Feb 20, 2026Updated last week
- ☆31Feb 3, 2026Updated 3 weeks ago
- MCP server for Grok AI API integration☆19Jun 2, 2025Updated 8 months ago
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated 11 months ago
- Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO☆92Dec 1, 2025Updated 2 months ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- Combined InstantID🔥 and FouriScale to generate high resolution image!☆11Apr 3, 2024Updated last year
- ☆13Oct 21, 2024Updated last year
- Auction Theory Toolbox – Computer Verified Auctions☆14Jul 12, 2016Updated 9 years ago
- CVE-Factory☆53Feb 13, 2026Updated 2 weeks ago
- ☆13Nov 5, 2024Updated last year
- ☆24Dec 19, 2025Updated 2 months ago
- [ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"☆11Sep 3, 2024Updated last year
- This repository collects awesome representative papers and resources for "From Pre-training to Post-training: A Survey on Time Series Fou…☆30Feb 1, 2026Updated 3 weeks ago
- 💻InfiniteGPU is a platform that enables effortless exchange of compute resources for AI workloads☆81Jan 17, 2026Updated last month
- [ICLR 2026] [NeurIPS 2025] ViPRA: Video Prediction for Robot Actions☆25Jan 27, 2026Updated last month
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆36Oct 29, 2025Updated 3 months ago
- DualBind is a 3D structure-based deep learning model with a dual-loss framework for accurate and fast protein-ligand binding affinity pre…☆14Oct 21, 2025Updated 4 months ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆14Aug 6, 2025Updated 6 months ago
- Fuzzing solmate with medusa☆10Aug 14, 2023Updated 2 years ago