nimasteryang / LingxiLinks

☆23

Alternatives and similar repositories for Lingxi

Users that are interested in Lingxi are comparing it to the libraries listed below

Sorting:

sorendunn / Agentless-Lite
Agentless Lite: RAG-based SWE-Bench software engineering scaffold
☆33Updated 2 months ago
Aider-AI / aider-swe-bench
Harness used to benchmark aider against SWE Bench benchmarks
☆72Updated last year
vaibhavagg303 / DARS-Agent
☆64Updated last month
InternLM / SWE-Fixer
☆101Updated last month
SWE-bench / sb-cli
Run SWE-bench evaluations remotely
☆21Updated last month
RepoUnderstander / RepoUnderstander
Official implementation of paper How to Understand Whole Repository? New SOTA on SWE-bench Lite (21.3%)
☆86Updated 3 months ago
SWE-bench / SWE-smith
Scaling Data for SWE-agents
☆265Updated this week
OpenDevin / OD-SWE-bench
Enhanced fork of SWE-bench, tailored for OpenDevin's ecosystem.
☆25Updated last year
aorwall / SWE-bench-docker
☆97Updated 11 months ago
NL2Code / CodeR
☆158Updated 10 months ago
r2e-project / r2e
r2e: turn any github repository into a programming agent environment
☆125Updated 2 months ago
THUDM / SWE-Dev
[ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.
☆40Updated 2 weeks ago
fishmingyu / OrcaLoca
OrcaLoca: An LLM Agent Framework for Software Issue Localization [ICML 25]
☆20Updated 2 months ago
LiveBench / liveswebench
☆42Updated 2 months ago
Aider-AI / refactor-benchmark
Aider's refactoring benchmark exercises based on popular python repos
☆74Updated 8 months ago
codestoryai / swe_bench_traces
Contains the model patches and the eval logs from the passing swe-bench-lite run.
☆10Updated last year
R2E-Gym / R2E-Gym
Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
☆77Updated 2 weeks ago
ozyyshr / RepoGraph
Enhancing AI Software Engineering with Repository-level Code Graph
☆185Updated 2 months ago
multi-swe-bench / multi-swe-bench
Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving
☆195Updated this week
All-Hands-AI / openhands-aci
Agent computer interface for AI software engineer.
☆85Updated this week
Leolty / repobench
✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024
☆167Updated 10 months ago
miralab-ai / autoreason
☆41Updated 6 months ago
SWE-bench / experiments
Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.
☆186Updated this week
facebookresearch / sweet_rl
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆219Updated last month
modelscope / MCPBench
The evaluation benchmark on MCP servers
☆134Updated last month
sony / talkhier
Official Repo for The Paper "Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems"
☆54Updated 4 months ago
multi-agent-systems-failure-taxonomy / MAST
☆211Updated last month
SALT-NLP / collaborative-gym
Framework and toolkits for building and evaluating collaborative agents that can work together with humans.
☆86Updated 2 months ago
StonyBrookNLP / appworld
🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…
☆215Updated last month
allenai / recoma
Reasoning by Communicating with Agents
☆29Updated last month