bdqnghi / SWE-EVOLinks

☆26

Alternatives and similar repositories for SWE-EVO

Users that are interested in SWE-EVO are comparing it to the libraries listed below

Sorting:

yuleiqin / RAIF
A Recipe for Building LLM Reasoners to Solve Complex Instructions
☆29Updated 3 months ago
sunblaze-ucb / AgentSynth
AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents
☆36Updated 3 months ago
shangshang-wang / Resa
Resa: Transparent Reasoning Models via SAEs
☆47Updated 3 months ago
OpenMOSS / Lorsa
☆29Updated 2 months ago
sunblaze-ucb / omega
☆45Updated 6 months ago
sotopia-lab / sotopia-rl
Sotopia-RL: Reward Design for Social Intelligence
☆46Updated 5 months ago
dinobby / MAgICoRE
☆23Updated last year
DualityRL / multi-attempt
☆19Updated 10 months ago
sail-sg / SkyLadder
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆41Updated 3 weeks ago
du-nlp-lab / MLR-Copilot
☆67Updated 9 months ago
zjunlp / OneEdit
OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.
☆18Updated last year
sunblaze-ucb / reasoning_ladder
☆35Updated 8 months ago
hsaest / Agent-Planning-Analysis
[NAACL'25] "Revealing the Barriers of Language Agents in Planning"
☆13Updated 6 months ago
LAMDASZ-ML / Self-Backtracking
☆50Updated 11 months ago
complex-reasoning / RPG
Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)
☆63Updated 2 weeks ago
facebookresearch / BigOBench
BigOBench assesses the capacity of Large Language Models (LLMs) to comprehend time-space computational complexity of input or generated c…
☆39Updated 9 months ago
metal-chart-generation / metal
☆42Updated 7 months ago
facebookresearch / AbstentionBench
A holistic benchmark for LLM abstention
☆68Updated 4 months ago
bigai-nlco / Native-Parallel-Reasoner
Official Repository of Native Parallel Reasoner
☆96Updated last month
open-compass / GPassK
[ACL 2025] Are Your LLMs Capable of Stable Reasoning?
☆32Updated 5 months ago
ByteDance-BandAI / LLM-I
🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…
☆37Updated 2 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆61Updated last year
tianyi-lab / C3PO
[COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"
☆19Updated 9 months ago
SalesforceAIResearch / swecomm
☆28Updated 2 months ago
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆36Updated last year
TIGER-AI-Lab / One-Shot-CFT
The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]
☆33Updated 4 months ago
TIGER-AI-Lab / Hierarchical-Reasoner
Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning
☆56Updated 2 months ago
yayayacc / MUR
☆47Updated 3 months ago
mandyyyyii / east
☆19Updated 5 months ago
yueqis / API-Based-Agent
☆61Updated 6 months ago