yingweima2022 / SWE-ReasonerLinks
☆24Updated 3 months ago
Alternatives and similar repositories for SWE-Reasoner
Users that are interested in SWE-Reasoner are comparing it to the libraries listed below
Sorting:
- ☆22Updated 4 months ago
- ☆12Updated last year
- An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories☆66Updated last year
- ☆280Updated 4 months ago
- A research repo for experiments about Reinforcement Finetuning☆52Updated 7 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆254Updated 2 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆81Updated 10 months ago
- Neural Code Intelligence Survey 2024; Reading lists and resources☆275Updated 3 months ago
- Official Repository of "Learning what reinforcement learning can't"☆69Updated 2 months ago
- Code for ACL (main) paper "JumpCoder: Go Beyond Autoregressive Coder via Online Modification"☆27Updated last year
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".☆83Updated last year
- ☆69Updated 4 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆144Updated last year
- ☆55Updated 5 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆282Updated 2 weeks ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond☆312Updated 2 weeks ago
- this is an implementation for the paper Improve Mathematical Reasoning in Language Models by Automated Process Supervision from google de…☆42Updated 4 months ago
- Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"☆117Updated 2 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆56Updated 11 months ago
- Reproducing R1 for Code with Reliable Rewards☆11Updated 7 months ago
- SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks☆102Updated last month
- Reproducing R1 for Code with Reliable Rewards☆264Updated 6 months ago
- [EMNLP 2024] Multi-modal reasoning problems via code generation.☆26Updated 9 months ago
- ☆307Updated 5 months ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆66Updated 3 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆85Updated 5 months ago
- Repo-Level Code generation papers☆219Updated 3 months ago
- ☆147Updated last week
- ☆50Updated last year
- A comprehensive collection of process reward models.☆116Updated last month