☆47Oct 28, 2025Updated 4 months ago
Alternatives and similar repositories for SWE-Perf
Users that are interested in SWE-Perf are comparing it to the libraries listed below
Sorting:
- ☆12Mar 5, 2025Updated last year
- ☆34Jan 25, 2026Updated last month
- ☆15Mar 12, 2024Updated last year
- ☆45Jan 21, 2026Updated last month
- Large Language Models Meet NL2Code: A Survey☆35Nov 19, 2024Updated last year
- Repository for Teaching Broad Reasoning Skills for Multi-Step QA by Generating Hard Contexts, EMNLP22☆19Jun 23, 2023Updated 2 years ago
- Learning Semantic Parsers from Denotations with Latent Structured Alignments and Abstract Programs(EMNLP2019)☆19Dec 3, 2019Updated 6 years ago
- TSQA: Tabular Scenario Based Question Answering (AAAI 2021)☆18Dec 17, 2020Updated 5 years ago
- ☆22Jan 22, 2026Updated last month
- ☆20Jan 14, 2022Updated 4 years ago
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆24Oct 10, 2023Updated 2 years ago
- The official implement of paper "Does Federated Learning Really Need Backpropagation?"☆23Feb 9, 2023Updated 3 years ago
- ☆29Nov 30, 2021Updated 4 years ago
- ☆70Feb 9, 2026Updated 3 weeks ago
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆189Aug 16, 2024Updated last year
- The Return of Lexical Dependencies: Neural Lexicalized PCFGs (TACL)☆33Sep 22, 2025Updated 5 months ago
- resources for the IBM Airlines Table-Question-Answering Benchmark☆33Jul 11, 2022Updated 3 years ago
- Llama-3-SynE: A Significantly Enhanced Version of Llama-3 with Advanced Scientific Reasoning and Chinese Language Capabilities | 继续预训练 提升 …☆37May 31, 2025Updated 9 months ago
- Pretraining with Natural and Synthetic Data for Few-shot Table-based Question Answering☆30Dec 2, 2022Updated 3 years ago
- [NeurIPS 2025 D&B] 🚀 SWE-bench Goes Live!☆165Feb 25, 2026Updated last week
- Some example codes for drawing figures in research paper☆35Mar 3, 2022Updated 4 years ago
- 30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days.☆11Dec 15, 2022Updated 3 years ago
- A First Look at Conventional Commits Classification☆12Nov 18, 2024Updated last year
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆147Sep 20, 2024Updated last year
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆33Nov 11, 2025Updated 3 months ago
- The JOS from MIT open course☆11Dec 21, 2011Updated 14 years ago
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- Base Repo for lecture at BYU. Feel free to contribute!☆10Jan 19, 2024Updated 2 years ago
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- ☆28Updated this week
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- ☆12Jan 11, 2026Updated last month
- The code for the 2018 NeurIPS paper "Dialog-to-Action: Conversational Question Answering Over a Large-Scale Knowledge Base"☆38Oct 28, 2020Updated 5 years ago
- ☆11Jul 25, 2020Updated 5 years ago
- ☆46Oct 28, 2025Updated 4 months ago
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆41Sep 29, 2024Updated last year
- Reinforced Multi-LLM Agents training☆73Jan 18, 2026Updated last month
- Official implementation of paper How to Understand Whole Repository? New SOTA on SWE-bench Lite (21.3%)☆97Mar 26, 2025Updated 11 months ago