[CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images
☆69Jan 23, 2026Updated 3 months ago
Alternatives and similar repositories for CodeVision
Users that are interested in CodeVision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆72Feb 1, 2026Updated 3 months ago
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆14Oct 12, 2024Updated last year
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 4 months ago
- Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images☆61Nov 4, 2025Updated 6 months ago
- ☆65Feb 27, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation☆27Dec 12, 2025Updated 5 months ago
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆30Nov 4, 2025Updated 6 months ago
- [ICLR 2026] SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models☆95Mar 9, 2026Updated 2 months ago
- [ICLR 2026] "VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use"☆181Mar 20, 2026Updated last month
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- This is the implementation of the paper "ResWorld: Temporal Residual World Model for End-to-End Autonomous Driving" (ICLR 2026)☆44Feb 5, 2026Updated 3 months ago
- ☆18Jun 3, 2024Updated last year
- ☆14Apr 16, 2024Updated 2 years ago
- ☆33Feb 12, 2026Updated 3 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [CVPR 2026] "GenieDrive: Towards Physics-Aware Driving World Model with 4D Occupancy Guided Video Generation"☆81Updated this week
- Official repository for Polarity Sampling, CVPR 2022 ORAL☆13Jul 25, 2022Updated 3 years ago
- [ICCV2023] PyTorch implementation of ''Spatial-Aware Token for Weakly Supervised Object Localization''.☆23Oct 24, 2023Updated 2 years ago
- [IJCV] PyTorch implementation of "Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation"☆19Oct 25, 2023Updated 2 years ago
- [ACL 2024] An easily extensible framework for simultaneous, text-to-text neural machine translation (SimulMT) for LLMs.☆18Apr 21, 2025Updated last year
- ☆16Sep 25, 2025Updated 7 months ago
- Multi-step reasoning MLLM☆23Mar 8, 2026Updated 2 months ago
- The code repository for "Wings: Learning Multimodal LLMs without Text-only Forgetting" [NeurIPS 2024]☆27Dec 28, 2024Updated last year
- Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Expl…☆12Apr 4, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- (ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training☆13Feb 15, 2025Updated last year
- [ICLR 2026] NOVA3R: Non-pixel-aligned Visual Transformer for Amodal 3D Reconstruction☆117Apr 11, 2026Updated last month
- [FCS'24] LVLM Safety paper☆19Jan 4, 2025Updated last year
- ☆13May 17, 2025Updated last year
- [CVPR 2026 Main] MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generation☆24Mar 26, 2026Updated last month
- OmniGAIA: Towards Native Omni-Modal AI Agents☆127Apr 2, 2026Updated last month
- Doodling our way to AGI ✏️ 🖼️ 🧠☆125May 29, 2025Updated 11 months ago
- apply .cube file on image in python☆16Oct 2, 2021Updated 4 years ago
- ☆13Nov 5, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the paper: "Modular Neural Image Signal Processing". A modular neural ISP with interpretable stages, multi-style rendering, cros…☆39Jan 19, 2026Updated 4 months ago
- Uses a GAN to enhance images of Fingerprints☆13Jan 27, 2026Updated 3 months ago
- ☆38Jan 9, 2026Updated 4 months ago
- Offical respority for Gait Recogniton with Drones: A benchmark (TMM 2023)☆10Feb 2, 2024Updated 2 years ago
- [ICLR'25] Official repository for "AVHBench: A Cross-Modal Hallucination Evaluation for Audio-Visual Large Language Models"☆24Mar 8, 2026Updated 2 months ago
- Why do deep convolutional networks generalize so poorly to small image transformations?☆11Jun 23, 2019Updated 6 years ago
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆13Nov 1, 2025Updated 6 months ago