Marvis-Lab / SWE-EVOLinks

☆29

Alternatives and similar repositories for SWE-EVO

Users that are interested in SWE-EVO are comparing it to the libraries listed below

Sorting:

sunblaze-ucb / AgentSynth
AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents
☆36Updated 3 months ago
sunblaze-ucb / omega
☆45Updated 7 months ago
yuleiqin / RAIF
A Recipe for Building LLM Reasoners to Solve Complex Instructions
☆29Updated 3 months ago
shangshang-wang / Resa
Resa: Transparent Reasoning Models via SAEs
☆47Updated 4 months ago
OpenMOSS / Lorsa
☆29Updated 2 months ago
chanchimin / AgentMonitor
Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"
☆13Updated last year
Aloriosa / srmt
The original Shared Recurrent Memory Transformer implementation
☆33Updated 6 months ago
zjunlp / OneEdit
OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.
☆19Updated last year
sotopia-lab / sotopia-rl
Sotopia-RL: Reward Design for Social Intelligence
☆46Updated 5 months ago
SalesforceAIResearch / swecomm
☆28Updated 2 months ago
dinobby / MAgICoRE
☆23Updated last year
du-nlp-lab / MLR-Copilot
☆67Updated 9 months ago
sunblaze-ucb / reasoning_ladder
☆35Updated 8 months ago
LAMDASZ-ML / Self-Backtracking
☆50Updated 11 months ago
limenlp / safer-instruct
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Updated last year
hsaest / Agent-Planning-Analysis
[NAACL'25] "Revealing the Barriers of Language Agents in Planning"
☆13Updated 7 months ago
sail-sg / SkyLadder
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆42Updated 3 weeks ago
cyzus / thoughtsculpt
THOUGHTSCULPT, a general reasoning and search method for complex tasks
☆13Updated last year
metal-chart-generation / metal
☆42Updated 7 months ago
facebookresearch / BigOBench
BigOBench assesses the capacity of Large Language Models (LLMs) to comprehend time-space computational complexity of input or generated c…
☆40Updated 9 months ago
complex-reasoning / RPG
Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)
☆64Updated 3 weeks ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆61Updated last year
ALT-JS / OthelloSAE
CS194-196 Course Project
☆14Updated 11 months ago
facebookresearch / AbstentionBench
A holistic benchmark for LLM abstention
☆68Updated 4 months ago
tianyi-lab / C3PO
[COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"
☆19Updated 9 months ago
ulab-uiuc / ToMAP
Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"
☆22Updated 4 months ago
TIGER-AI-Lab / One-Shot-CFT
The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]
☆33Updated 4 months ago
GMLR-Penn / Multiplex-Thinking
Multiplex Thinking
☆48Updated last week
shenao-zhang / reward-augmented-preference
The official implementation of Preference Data Reward-Augmentation.
☆18Updated 8 months ago
Fu-Dayuan / PreAct
PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)
☆30Updated last year