qishenghu / CodeInstruct

InstructCoder (former name:Codelnstruct) enables LLMs to edit code

☆47

Related projects: ⓘ

CodeEditorBench / CodeEditorBench
☆39Updated 3 months ago
bigcode-project / astraios
Astraios: Parameter-Efficient Instruction Tuning Code Language Models
☆57Updated 5 months ago
facebookresearch / cruxeval
CRUXEval: Code Reasoning, Understanding, and Execution Evaluation
☆99Updated last month
niansong1996 / lever
Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)
☆76Updated last year
zorazrw / odex
[EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation
☆42Updated 8 months ago
YuxiXie / SelfEval-Guided-Decoding
☆80Updated 9 months ago
evalplus / repoqa
RepoQA: Evaluating Long-Context Code Understanding
☆96Updated this week
OSU-NLP-Group / llm-planning-eval
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆45Updated 6 months ago
StonyBrookNLP / appworld
🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…
☆81Updated last month
martin-wey / CodeUltraFeedback
CodeUltraFeedback: aligning large language models to coding preferences
☆62Updated 2 months ago
xlang-ai / arks
☆48Updated 3 months ago
Ablustrund / APPS_Plus
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback
☆48Updated 2 weeks ago
nyu-mll / ILF-for-code-generation
☆73Updated last year
crux-eval / eval-arena
☆16Updated last month
amazon-science / Repoformer
Repoformer: Selective Retrieval for Repository-Level Code Completion (ICML 2024)
☆33Updated 2 months ago
shunzh / Code-AI-Tree-Search
☆101Updated 2 months ago
SparksofAGI / MHPP
☆25Updated last week
ozyyshr / RepoGraph
Enhancing AI Software Engineering with Repository-level Code Graph
☆60Updated 3 weeks ago
OSU-NLP-Group / Fuxi
Repository for paper Tools Are Instrumental for Language Agents in Complex Environments
☆32Updated 8 months ago
WENGSYX / Self-Verification
We have released the code and demo program required for LLM with self-verification
☆45Updated 11 months ago
ntunlp / ExecEval
A distributed, extensible, secure solution for evaluating machine generated code with unit tests in multiple programming languages.
☆36Updated last month
SalesforceAIResearch / CodeChain
Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"
☆28Updated 9 months ago
reasoning-machines / CoCoGen
Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)
☆85Updated last year
liyucheng09 / Contamination_Detector
Lightweight tool to identify Data Contamination in LLMs evaluation
☆39Updated 6 months ago
allenai / easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆44Updated 8 months ago
WHGTyen / BIG-Bench-Mistake
A dataset of LLM-generated chain-of-thought steps annotated with mistake location.
☆65Updated last month
bigcode-project / bigcodebench-annotation
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions
☆19Updated last month
GAIR-NLP / OlympicArena
This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"
☆79Updated last month
chujiezheng / LLM-Extrapolation
Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"
☆62Updated 3 months ago
theblackcat102 / evol-dataset
evol augment any dataset online
☆55Updated last year