microsoft / CodeRankerLinks

Fault-aware neural code rankers

☆28

Alternatives and similar repositories for CodeRanker

Users that are interested in CodeRanker are comparing it to the libraries listed below

Sorting:

microsoft / PLOG
☆22Updated 2 years ago
microsoft / DeFacto
DeFacto - Demonstrations and Feedback for improving factual consistency of text summarization
☆29Updated 2 years ago
zorazrw / odex
[EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation
☆48Updated last year
microsoft / deep-language-networks
We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts…
☆94Updated last year
openai / human-eval-infilling
Code for the paper "Efficient Training of Language Models to Fill in the Middle"
☆183Updated 2 years ago
EleutherAI / github-downloader
Script for downloading GitHub.
☆96Updated last year
evalplus / repoqa
RepoQA: Evaluating Long-Context Code Understanding
☆113Updated 9 months ago
NL2Code / NL2Code.github.io
Large Language Models Meet NL2Code: A Survey
☆35Updated 8 months ago
nyu-mll / ILF-for-code-generation
☆78Updated 4 months ago
reddy-lab-code-research / PPOCoder
Code for the TMLR 2023 paper "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"
☆114Updated last year
microsoft / TraceCodegen
☆27Updated 2 years ago
microsoft / JigsawDataset
Jigsaw Dataset: Natural language to Python Pandas code
☆53Updated 2 years ago
ntunlp / xCodeEval
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
☆86Updated 10 months ago
terryyz / ice-score
[EACL 2024] ICE-Score: Instructing Large Language Models to Evaluate Code
☆76Updated last year
WHGTyen / BIG-Bench-Mistake
A dataset of LLM-generated chain-of-thought steps annotated with mistake location.
☆81Updated 11 months ago
Zyq-scut / RLTF
Accepted by Transactions on Machine Learning Research (TMLR)
☆130Updated 10 months ago
niansong1996 / lever
Code for paper "LEVER: Learning to Verifiy Language-to-Code Generation with Execution" (ICML'23)
☆89Updated 2 years ago
facebookresearch / cruxeval
CRUXEval: Code Reasoning, Understanding, and Execution Evaluation
☆151Updated 9 months ago
facebookresearch / coder_reviewer_reranking
Official code release for the paper Coder Reviewer Reranking for Code Generation.
☆45Updated 2 years ago
amazon-science / recode
Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"
☆52Updated last year
microsoft / WaveCoder
Advancing LLM with Diverse Coding Capabilities
☆75Updated last year
microsoft / DiVeRSe
☆38Updated 3 years ago
THUDM / NaturalCodeBench
NaturalCodeBench (Findings of ACL 2024)
☆68Updated 9 months ago
amazon-science / cceval
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)
☆153Updated last year
microsoft / coderec_programming_states
Code and Data for: Reading Between the Lines: Modeling User Behavior and Costs in AI-Assisted Programming
☆32Updated last year
reasoning-machines / CoCoGen
Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)
☆86Updated 2 years ago
qishenghu / InstructCoder
InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw
☆62Updated 10 months ago
microsoft / NLG_Instructions_MetaLearning
Boosting Natural Language Generation from Instructions with Meta-Learning
☆10Updated 2 years ago
Ablustrund / APPS_Plus
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback
☆67Updated 11 months ago
aorwall / moatless-testbeds
Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…
☆14Updated 4 months ago