☆160Mar 18, 2026Updated 2 months ago
Alternatives and similar repositories for SWE-Vision
Users that are interested in SWE-Vision are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆28Jul 23, 2025Updated 9 months ago
- [NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"☆30Mar 30, 2026Updated last month
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"☆100Oct 27, 2025Updated 6 months ago
- Visual Generation Tuning☆100Apr 16, 2026Updated last month
- ☆40Mar 26, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning☆102May 20, 2025Updated last year
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆36Oct 3, 2024Updated last year
- [ICCV 2025] Official implementation of SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts☆38Apr 3, 2026Updated last month
- Fine-tuned LLMs generate accurate 3D human avatars from textual descriptions using the SMPL-X model, enhancing customization and simulati…☆38Feb 5, 2025Updated last year
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆40May 26, 2025Updated 11 months ago
- Dataset from Tip of the Tongue Known-Item Retrieval (2021) paper.☆12Nov 4, 2021Updated 4 years ago
- 清华大学人工智能导论(龙明盛老师)课程课件,作业以 及试题☆16Jun 26, 2023Updated 2 years ago
- Implementation of the Snappy compression algorithm as a RoCC accelerator☆12Jul 29, 2019Updated 6 years ago
- Code for PyMTL Tutorial @ ISCA 2019☆11Jun 22, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆38Jan 9, 2026Updated 4 months ago
- The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"☆92Oct 15, 2025Updated 7 months ago
- mcp wrapper for openai built-in tools☆12Mar 13, 2025Updated last year
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆14Oct 12, 2024Updated last year
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆14Nov 1, 2025Updated 6 months ago
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆27Nov 28, 2025Updated 5 months ago
- ☆15Feb 11, 2025Updated last year
- A video question answering dataset that focuses on the dynamics properties of objects (velocity, acceleration) and their collisions withi…☆20Apr 23, 2025Updated last year
- Official implementation of "Continual Learning by Modeling Intra-Class Variation" (MOCA). [TMLR 2023]☆16Mar 3, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆15Apr 8, 2026Updated last month
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆58Feb 23, 2026Updated 2 months ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- [ACM MM 2025] LMM4Edit: Benchmarking and Evaluating Multimodal Image Editing with LMMs☆16Apr 16, 2026Updated last month
- A timer theme of Wallpaper Engine (13k Subscribers)☆13Oct 26, 2022Updated 3 years ago
- Layout, rendering ELK Graph generated by easysoc-firrtl, and display the graph as an interactive diagram to represent Chisel generated Fi…☆13Apr 1, 2022Updated 4 years ago
- ☆31Updated this week
- ☆17Apr 10, 2025Updated last year
- Deep Interest Network for Click-Through Rate Prediction Deep Interest Evolution Network for Click-Through Rate Prediction☆11Oct 14, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]☆16Jul 15, 2025Updated 10 months ago
- Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups☆51Dec 23, 2024Updated last year
- [ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"☆10Sep 3, 2024Updated last year
- DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation.☆137Feb 10, 2026Updated 3 months ago
- [ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensi…☆22Jun 12, 2025Updated 11 months ago
- Simple setup for personal dotfiles☆11Mar 29, 2026Updated last month
- Increasing the scale and diversity of chart de-rendering data.☆12Mar 13, 2024Updated 2 years ago