UniPat-AI/SWE-Vision

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/UniPat-AI/SWE-Vision)

UniPat-AI / SWE-Vision

☆169

Alternatives and similar repositories for SWE-Vision

Users that are interested in SWE-Vision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UniPat-AI / UniScientist
View on GitHub
UniScientist is designed to advance universal scientific research intelligence through a unified paradigm
☆168Mar 14, 2026Updated 4 months ago
UniPat-AI / BabyVision
View on GitHub
We introduce BabyVision, a benchmark revealing the infancy of AI vision.
☆231Jan 13, 2026Updated 6 months ago
UniPat-AI / SaaS-Bench
View on GitHub
Official repository for SaaS-Bench: realistic, locally deployable SaaS workflows for GUI agent evaluation.
☆87Jun 5, 2026Updated last month
Cominclip / OmniVerifier
View on GitHub
[ICLR 2026 Oral & ICML 2026] Generative Universal Verifier as Multimodal Meta-Reasoner
☆64May 29, 2026Updated last month
kiaia / GIRAFFE
View on GitHub
Extending context length of visual language models
☆12Dec 18, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
FYYDCC / IVT-LR
View on GitHub
Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”
☆18Jan 27, 2026Updated 5 months ago
Alibaba-NLP / MaskSearch
View on GitHub
Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"
☆155May 27, 2025Updated last year
chenllliang / G1
View on GitHub
G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
☆103May 20, 2025Updated last year
ByteDance-Seed / Seed-1.8
View on GitHub
☆219Dec 19, 2025Updated 7 months ago
microsoft / MM-WebAgent
View on GitHub
Build coherent and visually polished multimodal webpages with hierarchical planning, AIGC tools, and iterative reflection.
☆15May 17, 2026Updated 2 months ago
CaraJ7 / DraCo
View on GitHub
Offical Repository for Paper: DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation
☆17Dec 7, 2025Updated 7 months ago
micky-li-hd / CoCo
View on GitHub
CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation
☆54Apr 9, 2026Updated 3 months ago
vl-rewardbench / VL_RewardBench
View on GitHub
☆29Jul 23, 2025Updated 11 months ago
alohays / openai-tool2mcp
View on GitHub
mcp wrapper for openai built-in tools
☆12Mar 13, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
MoonshotAI / WorldVQA
View on GitHub
☆119Feb 4, 2026Updated 5 months ago
DoubtedSteam / MM-GCoT
View on GitHub
The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"
☆22Jul 21, 2025Updated last year
xinyan-cxy / MINT-CoT
View on GitHub
[NeurIPS 2025] MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning
☆107Sep 19, 2025Updated 10 months ago
TongkunGuan / Qwen-CodePercept
View on GitHub
[CVPR2026] CodePercept: Code-Grounded Visual STEM Perception for MLLM
☆44Updated this week
CriticBench / CriticBench
View on GitHub
[ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning
☆31Mar 5, 2024Updated 2 years ago
Jayce-Ping / AutoGPS
View on GitHub
Code for paper *AutoGPS: Automated Geometry Problem Solving via Multimodal Formalization and Deductive Reasoning*
☆17Jul 19, 2025Updated last year
MMBrowseComp / MM-BrowseComp
View on GitHub
☆70Jan 4, 2026Updated 6 months ago
pipilurj / G-LLaVA
View on GitHub
Official github repo of G-LLaVA
☆154Feb 20, 2025Updated last year
SWE-Perf / SWE-Perf
View on GitHub
☆52Oct 28, 2025Updated 8 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
agimus-project / guided_tamp_benchmark
View on GitHub
This repository contains benchmarking code for the ICRA 2023 submission titled Multi-Contact Task and Motion Planning Guided by Video Dem…
☆14Apr 20, 2025Updated last year
finyorko / longcli-bench
View on GitHub
LongCLI-Bench's official repository
☆44May 25, 2026Updated last month
thomasjoshi / agents-never-forget
View on GitHub
☆18May 18, 2025Updated last year
Claw-Eval-Live / Claw-Eval-Live
View on GitHub
☆43Jun 17, 2026Updated last month
lancopku / clip-openness
View on GitHub
[ACL 2023] Delving into the Openness of CLIP
☆24Jan 11, 2023Updated 3 years ago
HKU-MMLab / Math-VR-CodePlot-CoT
View on GitHub
Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images
☆63Nov 4, 2025Updated 8 months ago
SKYLENAGE-AI / QwenClawBench
View on GitHub
General Agent Benchmark for OpenClaw, made by Qwen Team, Alibaba Group.
☆58Jun 10, 2026Updated last month
claw-eval / claw-eval
View on GitHub
Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.
☆728May 17, 2026Updated 2 months ago
THU-KEG / PairJudgeRM
View on GitHub
☆15Apr 14, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
cythu / PeBR-R1
View on GitHub
☆15Apr 20, 2026Updated 3 months ago
SKYLENAGE-AI / DeepVision-103K
View on GitHub
Codebase for DeepVision-103K
☆22Feb 21, 2026Updated 5 months ago
vlf-silkie / VLFeedback
View on GitHub
☆102Dec 22, 2023Updated 2 years ago
multimodal-reasoning-lab / Bagel-Zebra-CoT
View on GitHub
https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT
☆137Jan 30, 2026Updated 5 months ago
humanlaya / OneMillion-Bench
View on GitHub
Evals Harness for $OneMillion-Bench
☆48Apr 21, 2026Updated 3 months ago
InternLM / ARC-VL
View on GitHub
[CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"
☆46Nov 26, 2025Updated 7 months ago
Wild-Cooperation-Hub / Awesome-MLLM-Reasoning-Benchmarks
View on GitHub
A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.
☆76Mar 18, 2025Updated last year