microsoft / monitors4codegen
Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context". `multispy` is a lsp client library in Python intended to be used to build applications around language servers.
☆205Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for monitors4codegen
- multispy is a lsp client library in Python intended to be used to build applications around language servers.☆58Updated last month
- Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions☆40Updated 3 months ago
- Binary Python wheels for all tree sitter languages.☆164Updated 4 months ago
- Grep source code and see useful code context about matching lines☆134Updated 3 months ago
- Efficient and general syntactical decoding for Large Language Models☆198Updated this week
- EvoEval: Evolving Coding Benchmarks via LLM☆60Updated 7 months ago
- Graph-based method for end-to-end code completion with context awareness on repository☆47Updated 2 months ago
- Enhancing AI Software Engineering with Repository-level Code Graph☆96Updated 2 months ago
- ☆117Updated last year
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)☆122Updated 3 months ago
- ✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024☆133Updated 3 months ago
- ☆81Updated 4 months ago
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation☆270Updated 2 weeks ago
- r2e: turn any github repository into a programming agent environment☆89Updated 3 weeks ago
- Pip compatible CodeBLEU metric implementation available for linux/macos/win☆64Updated this week
- Repilot, a patch generation tool introduced in the ESEC/FSE'23 paper "Copiloting the Copilots: Fusing Large Language Models with Completi…☆127Updated last year
- A multi-programming language benchmark for LLMs☆207Updated this week
- Fast and robust AST parsing of any language☆28Updated 6 months ago
- Artifact repository for the paper "Lost in Translation: A Study of Bugs Introduced by Large Language Models while Translating Code", In P…☆40Updated 5 months ago
- ⚒️ Tree-sitter custom toolkit for extracting function and class from raw source file☆39Updated 4 months ago
- ☆269Updated this week
- CodeBERTScore: an automatic metric for code generation, based on BERTScore☆168Updated 8 months ago
- Open-source Self-Instruction Tuning Code LLM☆168Updated last year
- Code and data for XLCoST: A Benchmark Dataset for Cross-lingual Code Intelligence☆66Updated last year
- LLM verified with Monte Carlo Tree Search☆251Updated 2 months ago
- A trace analysis tool for AI agents.☆124Updated last month
- RepoQA: Evaluating Long-Context Code Understanding☆100Updated 2 weeks ago
- Harness used to benchmark aider against SWE Bench benchmarks☆53Updated 4 months ago
- ☆152Updated 2 months ago
- Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024☆52Updated 2 months ago