structuredllm / syncodeLinks
Efficient and general syntactical decoding for Large Language Models
β305Updated last week
Alternatives and similar repositories for syncode
Users that are interested in syncode are comparing it to the libraries listed below
Sorting:
- π€ A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformersβ130Updated 8 months ago
- A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.β297Updated last month
- [ICML '24] R2E: Turn any GitHub Repository into a Programming Agent Environmentβ136Updated 7 months ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.β225Updated last week
- β75Updated last year
- β¨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024β182Updated last year
- A multi-programming language benchmark for LLMsβ284Updated 3 weeks ago
- EvoEval: Evolving Coding Benchmarks via LLMβ80Updated last year
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)β164Updated 3 months ago
- Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructionsβ48Updated 2 months ago
- Enhancing AI Software Engineering with Repository-level Code Graphβ232Updated 8 months ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluationβ161Updated last year
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generationβ321Updated 9 months ago
- Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context". `multisβ¦β276Updated last year
- [FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repositoryβ68Updated last year
- [NeurIPS '25] Challenging Software Optimization Tasks for Evaluating SWE-Agentsβ57Updated last week
- β126Updated 6 months ago
- RepoQA: Evaluating Long-Context Code Understandingβ125Updated last year
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generationβ63Updated 2 months ago
- Pip compatible CodeBLEU metric implementation available for linux/macos/winβ126Updated 8 months ago
- Reproduction Package for the paper "Type-Constrained Code Generation with Language Models" [PLDI 2025]β79Updated 5 months ago
- Iterate on LLM-based structured generation forward and backwardβ22Updated 8 months ago
- CodeBERTScore: an automatic metric for code generation, based on BERTScoreβ206Updated last year
- Training language models to make programs fasterβ96Updated last year
- Benchmark ClassEval for class-level code generation.β145Updated last year
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolvingβ290Updated 2 weeks ago
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agentsβ479Updated last week
- Must-read papers on Repository-level Code Generation & Issue Resolution π₯β219Updated this week
- β127Updated 2 years ago
- β111Updated last year