yingweima2022 / SWE-ReasonerLinks
☆21Updated 2 months ago
Alternatives and similar repositories for SWE-Reasoner
Users that are interested in SWE-Reasoner are comparing it to the libraries listed below
Sorting:
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆59Updated 8 months ago
- ☆22Updated 3 weeks ago
- ☆12Updated last year
- An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories☆60Updated 10 months ago
- [Up-to-date] Awesome Agentic Deep Research Resources☆30Updated this week
- [NeurIPS 2024 poster] Cross-model Control: Improving Multiple Large Language Models in One-time Training☆13Updated 8 months ago
- ☆64Updated 3 weeks ago
- Code for ACL (main) paper "JumpCoder: Go Beyond Autoregressive Coder via Online Modification"☆27Updated last year
- A Lightweight Visual Reasoning Benchmark for Evaluating Large Multimodal Models through Complex Diagrams in Coding Tasks☆12Updated 4 months ago
- Reproducing R1 for Code with Reliable Rewards☆221Updated last month
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆126Updated 9 months ago
- Reinforcement Learning for Repository-Level Code Completion☆33Updated 10 months ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆58Updated last year
- Awesome-Efficient-Inference-for-LRMs is a collection of state-of-the-art, novel, exciting, token-efficient methods for Large Reasoning Mo…☆72Updated last week
- ☆62Updated last week
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆54Updated 6 months ago
- ☆46Updated 7 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆75Updated 3 weeks ago
- ☆47Updated 2 weeks ago
- ☆222Updated last week
- Repo-Level Code generation papers☆188Updated 2 months ago
- [LREC-COLING'24] HumanEval-XL: A Multilingual Code Generation Benchmark for Cross-lingual Natural Language Generalization☆38Updated 3 months ago
- Must-read papers on Repository-level Code Generation & Issue Resolution 🔥☆101Updated this week
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆136Updated 2 weeks ago
- ☆30Updated last month
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".☆78Updated 11 months ago
- [EMNLP 2024] Multi-modal reasoning problems via code generation.☆23Updated 4 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆228Updated 3 weeks ago
- A research repo for experiments about Reinforcement Finetuning☆48Updated 2 months ago
- SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner☆17Updated last week