microsoft / CodeRanker
Fault-aware neural code rankers
☆28Updated 2 years ago
Alternatives and similar repositories for CodeRanker
Users that are interested in CodeRanker are comparing it to the libraries listed below
Sorting:
- CyBERTron-LM is a project which collects some pre-trained Transformer-based models.☆12Updated last year
- ☆27Updated last year
- ☆22Updated last year
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆45Updated 4 months ago
- [NeurIPS 2024] Self-Optimization Improves the Efficiency of Code Generation☆12Updated last week
- DeFacto - Demonstrations and Feedback for improving factual consistency of text summarization☆29Updated 2 years ago
- ☆24Updated 6 months ago
- Large Language Models Meet NL2Code: A Survey☆36Updated 5 months ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆48Updated last year
- ☆14Updated last year
- We release the UICaption dataset. The dataset consists of UI images (icons and screenshots) and associated text descriptions. This datase…☆38Updated 2 years ago
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation☆48Updated last month
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆71Updated 2 years ago
- Training and Benchmarking LLMs for Code Preference.☆33Updated 6 months ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆11Updated last month
- ☆75Updated last month
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆52Updated last year
- Knowledge transfer from high-resource to low-resource programming languages for Code LLMs☆13Updated 8 months ago
- ☆64Updated 5 months ago
- UNISUMM: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning☆60Updated last year
- Codebase for Inference-Time Policy Adapters☆23Updated last year
- xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval☆81Updated 8 months ago
- Script for downloading GitHub.☆93Updated 10 months ago
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆64Updated 8 months ago
- Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"☆14Updated 2 years ago
- Boosting Natural Language Generation from Instructions with Meta-Learning☆10Updated 2 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- ☆38Updated 9 months ago
- Web queries dataset for code search☆32Updated last year
- ☆43Updated 3 months ago