xjywhu / Awesome-Multimodal-LLM-for-CodeLinks
Multimodal Large Language Models for Code Generation under Multimodal Scenarios
☆196Updated 3 weeks ago
Alternatives and similar repositories for Awesome-Multimodal-LLM-for-Code
Users that are interested in Awesome-Multimodal-LLM-for-Code are comparing it to the libraries listed below
Sorting:
- ☆229Updated 3 weeks ago
- Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents☆217Updated 8 months ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replay☆148Updated 8 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆156Updated 7 months ago
- [FSE'2026] SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks☆138Updated this week
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆142Updated 11 months ago
- [NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning☆150Updated 4 months ago
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆351Updated 2 weeks ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆290Updated 2 months ago
- [ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆177Updated 3 months ago
- Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"☆120Updated 2 months ago
- Reproducing R1 for Code with Reliable Rewards☆285Updated 8 months ago
- [NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!☆161Updated last week
- A Lightweight Visual Reasoning Benchmark for Evaluating Large Multimodal Models through Complex Diagrams in Coding Tasks☆13Updated 11 months ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆95Updated 9 months ago
- ☆177Updated last month
- Test-time preferenece optimization (ICML 2025).☆178Updated 8 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆312Updated 3 weeks ago
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆133Updated 10 months ago
- The implementation for ICLR 2025 Oral: From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions.☆52Updated 5 months ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond☆342Updated last week
- The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"☆154Updated last month
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆209Updated 9 months ago
- Towards a Unified View of Large Language Model Post-Training☆199Updated 4 months ago
- VeriWeb: Verifiable Long-Chain Web Benchmark for Agentic Information-Seeking☆84Updated last week
- A Comprehensive Benchmark for Software Development.☆127Updated last year
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆95Updated 2 months ago
- ☆25Updated 5 months ago
- ☆332Updated 8 months ago
- The official implementation of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis".☆78Updated last week