SWE-Perf/SWE-Perf

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SWE-Perf/SWE-Perf)

SWE-Perf / SWE-Perf

☆47

Alternatives and similar repositories for SWE-Perf

Users that are interested in SWE-Perf are comparing it to the libraries listed below

Sorting:

SWE-Gym / SWE-Bench-Fork
View on GitHub
☆12Mar 5, 2025Updated last year
SWE-EVO / SWE-EVO
View on GitHub
☆34Jan 25, 2026Updated last month
sail-sg / lm-random-memory-access
View on GitHub
☆15Mar 12, 2024Updated last year
peng-weihan / SWE-QA-Bench
View on GitHub
☆45Jan 21, 2026Updated last month
NL2Code / NL2Code.github.io
View on GitHub
Large Language Models Meet NL2Code: A Survey
☆35Nov 19, 2024Updated last year
StonyBrookNLP / teabreac
View on GitHub
Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22
☆19Jun 23, 2023Updated 2 years ago
berlino / weaksp_em19
View on GitHub
Learning Semantic Parsers from Denotations with Latent Structured Alignments and Abstract Programs(EMNLP2019)
☆19Dec 3, 2019Updated 6 years ago
nju-websoft / TSQA
View on GitHub
TSQA: Tabular Scenario Based Question Answering (AAAI 2021)
☆18Dec 17, 2020Updated 5 years ago
benbogin / glt-grounded-latent-trees-qa
View on GitHub
☆22Jan 22, 2026Updated last month
vzhong / silg
View on GitHub
☆20Jan 14, 2022Updated 4 years ago
3B-Group / ConvRe
View on GitHub
🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)
☆24Oct 10, 2023Updated 2 years ago
FengHZ / BAFFLE
View on GitHub
The official implement of paper "Does Federated Learning Really Need Backpropagation?"
☆23Feb 9, 2023Updated 3 years ago
srush / ProbTalk
View on GitHub
☆29Nov 30, 2021Updated 4 years ago
multimodal-art-projection / NL2RepoBench
View on GitHub
☆70Feb 9, 2026Updated 3 weeks ago
Leolty / repobench
View on GitHub
✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024
☆189Aug 16, 2024Updated last year
neulab / neural-lpcfg
View on GitHub
The Return of Lexical Dependencies: Neural Lexicalized PCFGs (TACL)
☆33Sep 22, 2025Updated 5 months ago
IBM / AITQA
View on GitHub
resources for the IBM Airlines Table-Question-Answering Benchmark
☆33Jul 11, 2022Updated 3 years ago
RUC-GSAI / Llama-3-SynE
View on GitHub
Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练提升 …
☆37May 31, 2025Updated 9 months ago
jzbjyb / OmniTab
View on GitHub
Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering
☆30Dec 2, 2022Updated 3 years ago
microsoft / SWE-bench-Live
View on GitHub
[NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!
☆165Feb 25, 2026Updated last week
DaoD / ResearchFigure
View on GitHub
Some example codes for drawing figures in research paper
☆35Mar 3, 2022Updated 4 years ago
luizfernandopavanello / 30-days-of-Python
View on GitHub
30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days.
☆11Dec 15, 2022Updated 3 years ago
0x404 / conventional-commit-classification
View on GitHub
A First Look at Conventional Commits Classification
☆12Nov 18, 2024Updated last year
hkust-nlp / llm-compression-intelligence
View on GitHub
Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]
☆147Sep 20, 2024Updated last year
lcqysl / VideoSSR
View on GitHub
[CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"
☆33Nov 11, 2025Updated 3 months ago
guanqun / mit-jos
View on GitHub
The JOS from MIT open course
☆11Dec 21, 2011Updated 14 years ago
spraakbanken / SuperLim-2
View on GitHub
A Swedish Natural Language Understanding Benchmark
☆11Dec 12, 2025Updated 2 months ago
stephengodderidge / learn-git
View on GitHub
Base Repo for lecture at BYU. Feel free to contribute!
☆10Jan 19, 2024Updated 2 years ago
cjerry1243 / M3Act
View on GitHub
[CVPR2024] Learning from Synthetic Human Group Activities
☆14Feb 24, 2025Updated last year
OpenGPTX / lm-evaluation-harness
View on GitHub
A framework for few-shot evaluation of autoregressive language models.
☆12Jul 14, 2025Updated 7 months ago
AweAI-Team / BeyondSWE
View on GitHub
☆28Updated this week
domaineval / DomainEval
View on GitHub
DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …
☆14Dec 12, 2024Updated last year
zhimin-z / zhimin-z
View on GitHub
☆12Jan 11, 2026Updated last month
guoday / Dialog-to-Action
View on GitHub
The code for the 2018 NeurIPS paper "Dialog-to-Action: Conversational Question Answering Over a Large-Scale Knowledge Base"
☆38Oct 28, 2020Updated 5 years ago
cqu-isse / CARLCS-CNN
View on GitHub
☆11Jul 25, 2020Updated 5 years ago
csbench / csbench
View on GitHub
☆46Oct 28, 2025Updated 4 months ago
Spico197 / MoE-SFT
View on GitHub
🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts
☆41Sep 29, 2024Updated last year
ziyuwan / ReMA-public
View on GitHub
Reinforced Multi-LLM Agents training
☆73Jan 18, 2026Updated last month
RepoUnderstander / RepoUnderstander
View on GitHub
Official implementation of paper How to Understand Whole Repository? New SOTA on SWE-bench Lite (21.3%)
☆97Mar 26, 2025Updated 11 months ago