yiqingxyq/RepoST

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yiqingxyq/RepoST)

yiqingxyq / RepoST

Code for "[COLM'25] RepoST: Scalable Repository-Level Coding Environment Construction with Sandbox Testing"

☆24

Alternatives and similar repositories for RepoST

Users that are interested in RepoST are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GAIR-NLP / LIMOPro
View on GitHub
☆15May 27, 2025Updated last year
jzbjyb / X-FACTR
View on GitHub
☆24Jun 12, 2023Updated 3 years ago
siat-nlp / NLP-docs
View on GitHub
Docs of NLP/deep Learning/machine learning, etc. https://siat-nlp.github.io/docs
☆11Jul 17, 2019Updated 7 years ago
choosewhatulike / case2code
View on GitHub
☆17Apr 7, 2025Updated last year
lfy79001 / S3Eval
View on GitHub
[NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models
☆33Jun 10, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
iesl / leopard
View on GitHub
☆24Nov 27, 2020Updated 5 years ago
adapter-hub / playground
View on GitHub
☆14Aug 9, 2024Updated last year
DAMO-NLP-SG / RemeMo
View on GitHub
[EMNLP 2023] Once Upon a *Time* in *Graph*: Relative-Time Pretraining for Complex Temporal Reasoning
☆17Oct 31, 2023Updated 2 years ago
yafuly / CoGnition
View on GitHub
☆17Nov 10, 2021Updated 4 years ago
microsoft / LEMA
View on GitHub
official repo for the paper "Learning From Mistakes Makes LLM Better Reasoner"
☆60Dec 20, 2023Updated 2 years ago
thunlp / Knowledge-Inheritance
View on GitHub
Source code for paper: Knowledge Inheritance for Pre-trained Language Models
☆37Apr 24, 2022Updated 4 years ago
TIGER-AI-Lab / CritiqueFineTuning
View on GitHub
Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]
☆182Jul 8, 2025Updated last year
ablghtianyi / ICL_Modular_Arithmetic
View on GitHub
☆19Mar 25, 2025Updated last year
yegcjs / mixinglaws
View on GitHub
☆113Jul 15, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
GAIR-NLP / BeHonest
View on GitHub
BeHonest: Benchmarking Honesty in Large Language Models
☆35Aug 15, 2024Updated last year
sail-sg / SkyLadder
View on GitHub
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆43Dec 29, 2025Updated 6 months ago
Yifan-Song793 / GoodBadGreedy
View on GitHub
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism
☆31Jul 17, 2024Updated 2 years ago
microsoft / DynSP
View on GitHub
Search-based-Neural-Structured-Learning-for-Sequential-Question-Answering
☆33Jun 12, 2023Updated 3 years ago
richardodliu / OpenCodeEval
View on GitHub
☆52Mar 9, 2026Updated 4 months ago
GAIR-NLP / scaleeval
View on GitHub
Scalable Meta-Evaluation of LLMs as Evaluators
☆43Feb 15, 2024Updated 2 years ago
INK-USC / ReCross
View on GitHub
ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation
☆23May 1, 2022Updated 4 years ago
Tencent / WebAggregator
View on GitHub
[ACL 2026 Main Conference] WebAggregator
☆69Oct 18, 2025Updated 9 months ago
DualityRL / multi-attempt
View on GitHub
☆19Mar 10, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zide05 / CDEvalSumm
View on GitHub
☆12Nov 3, 2022Updated 3 years ago
zhaoyu-li / PyEuclid
View on GitHub
[CAV 2025] PyEuclid: A Versatile Formal Plane Geometry System in Python
☆15Jun 27, 2025Updated last year
TIGER-AI-Lab / AceCoder
View on GitHub
The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]
☆100Apr 9, 2025Updated last year
siddhartha-gadgil / MetaExamples
View on GitHub
Examples using MetaProgramming for writing tactics etc.
☆19Nov 26, 2025Updated 7 months ago
kyegomez / Reka-Torch
View on GitHub
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
☆29Updated this week
shtoshni / span-rep
View on GitHub
Code for Repl4NLP paper "A Cross-Task Analysis of Text Span Representations"
☆21Nov 4, 2022Updated 3 years ago
nec-research / KGEval
View on GitHub
A framework for evaluating Knowledge Graph Embedding Models in a fine-grained manner.
☆15Aug 3, 2022Updated 3 years ago
frenzymath / LeanSearch-v2
View on GitHub
☆17May 18, 2026Updated 2 months ago
TIGER-AI-Lab / BrowserAgent
View on GitHub
BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions [TMLR2025]
☆34Jan 13, 2026Updated 6 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ZurichRain / HMCGR
View on GitHub
code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"
☆10Oct 20, 2022Updated 3 years ago
GAIR-NLP / benbench
View on GitHub
Benchmarking Benchmark Leakage in Large Language Models
☆61May 20, 2024Updated 2 years ago
LaVi-Lab / LongContextReasoner
View on GitHub
[ACL 2024] Making Long-Context Language Models Better Multi-Hop Reasoners
☆20May 28, 2024Updated 2 years ago
Sphere-AI-Lab / FormalMATH-Bench
View on GitHub
Repository of <FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models>
☆75Jan 8, 2026Updated 6 months ago
3B-Group / ConvRe
View on GitHub
🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)
☆24Oct 10, 2023Updated 2 years ago
ruc-ai4math / LeanStateSearch
View on GitHub
☆19Apr 5, 2025Updated last year
yale-nlp / InstruSum
View on GitHub
☆23Feb 26, 2024Updated 2 years ago