uiuc-focal-lab / syncode

Efficient and general syntactical decoding for Large Language Models

☆242

Alternatives and similar repositories for syncode:

Users that are interested in syncode are comparing it to the libraries listed below

epfl-dlab / transformers-CFG
🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers
☆110Updated this week
r2e-project / r2e
r2e: turn any github repository into a programming agent environment
☆101Updated last week
Saibo-creator / Awesome-LLM-Constrained-Decoding
A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.
☆172Updated last week
facebookresearch / cruxeval
CRUXEval: Code Reasoning, Understanding, and Execution Evaluation
☆131Updated 5 months ago
evalplus / repoqa
RepoQA: Evaluating Long-Context Code Understanding
☆105Updated 4 months ago
evo-eval / evoeval
EvoEval: Evolving Coding Benchmarks via LLM
☆67Updated 11 months ago
nuprl / MultiPL-E
A multi-programming language benchmark for LLMs
☆235Updated last month
Leolty / repobench
✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024
☆146Updated 6 months ago
amazon-science / mxeval
☆106Updated 7 months ago
amazon-science / cceval
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)
☆133Updated 7 months ago
microsoft / monitors4codegen
Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context". `multis…
☆243Updated 7 months ago
FudanSELab / ClassEval
Benchmark ClassEval for class-level code generation.
☆135Updated 4 months ago
RepoUnderstander / RepoUnderstander
Official implementation of paper How to Understand Whole Repository? New SOTA on SWE-bench Lite (21.3%)
☆70Updated 4 months ago
gonglinyuan / safim
☆30Updated 4 months ago
ozyyshr / RepoGraph
Enhancing AI Software Engineering with Repository-level Code Graph
☆143Updated 2 months ago
logic-star-ai / swt-bench
[NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation
☆36Updated this week
k4black / codebleu
Pip compatible CodeBLEU metric implementation available for linux/macos/win
☆80Updated last week
sun-wendy / DafnyBench
DafnyBench: A Benchmark for Formal Software Verification
☆25Updated 3 months ago
shrivastavadisha / repo_level_prompt_generation
☆121Updated last year
nuprl / CanItEdit
Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions
☆41Updated 7 months ago
uiuc-focal-lab / llm-priming-attacks
☆13Updated last year
microsoft / CodePlan
Data and evaluation scripts for "CodePlan: Repository-level Coding using LLMs and Planning", FSE 2024
☆64Updated 6 months ago
bigcode-project / selfcodealign
[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation
☆298Updated 2 weeks ago
ise-uiuc / xft
XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts
☆30Updated 8 months ago
ntunlp / xCodeEval
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
☆78Updated 5 months ago
FloridSleeves / LLMDebugger
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step
☆504Updated 6 months ago
CoderEval / CoderEval
A collection of practical code generation tasks and tests in open source projects. Complementary to HumanEval by OpenAI.
☆134Updated 2 months ago
gonglinyuan / ast_t5
☆59Updated 10 months ago
aorwall / SWE-bench-docker
☆85Updated 7 months ago