wyt2000 / InverseCoderLinks
[AAAI 2025] The official code of the paper "InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct"(https://arxiv.org/abs/2407.05700).
☆13Updated last year
Alternatives and similar repositories for InverseCoder
Users that are interested in InverseCoder are comparing it to the libraries listed below
Sorting:
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".☆83Updated last year
- Repo-Level Code generation papers☆214Updated 3 months ago
- Official repository for our paper "FullStack Bench: Evaluating LLMs as Full Stack Coders"☆106Updated 5 months ago
- Reproducing R1 for Code with Reliable Rewards☆259Updated 5 months ago
- Must-read papers on Repository-level Code Generation & Issue Resolution 🔥☆177Updated last week
- [TOSEM'25] The official GitHub page for the survey paper "A Survey on Large Language Models for Code Generation".☆160Updated 3 months ago
- Benchmark ClassEval for class-level code generation.☆145Updated 11 months ago
- LeetCode Training and Evaluation Dataset☆37Updated 5 months ago
- ☆44Updated 10 months ago
- An Evolving Code Generation Benchmark Aligned with Real-world Code Repositories☆63Updated last year
- A collection of practical code generation tasks and tests in open source projects. Complementary to HumanEval by OpenAI.☆151Updated 9 months ago
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆158Updated 2 months ago
- Pip compatible CodeBLEU metric implementation available for linux/macos/win☆116Updated 6 months ago
- Code for the paper LEGO-Prover: Neural Theorem Proving with Growing Libraries☆68Updated last year
- ☆32Updated last month
- Async pipelined version of Verl☆119Updated 6 months ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆153Updated last year
- ☆37Updated 2 months ago
- Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"☆116Updated last year
- ☆34Updated 2 months ago
- Neural Code Intelligence Survey 2024; Reading lists and resources☆274Updated 2 months ago
- A Comprehensive Benchmark for Software Development.☆115Updated last year
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆143Updated last year
- ☆49Updated 2 years ago
- ☆47Updated last month
- Making code edting up to 7.7x faster using multi-layer speculation☆24Updated 7 months ago
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆58Updated 6 months ago
- A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories☆33Updated last year
- CodeRAG-Bench: Can Retrieval Augment Code Generation?☆156Updated 11 months ago
- DafnyBench: A Benchmark for Formal Software Verification☆48Updated 10 months ago