structuredllm / syncode
Efficient and general syntactical decoding for Large Language Models
β265Updated this week
Alternatives and similar repositories for syncode
Users that are interested in syncode are comparing it to the libraries listed below
Sorting:
- π€ A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformersβ117Updated last month
- A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.β198Updated 2 weeks ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluationβ137Updated 7 months ago
- A certifier for bias in LLMsβ24Updated last month
- β¨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024β163Updated 9 months ago
- r2e: turn any github repository into a programming agent environmentβ119Updated 3 weeks ago
- EvoEval: Evolving Coding Benchmarks via LLMβ70Updated last year
- Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static Analysis of Repository Context". `multisβ¦β257Updated 9 months ago
- Enhancing AI Software Engineering with Repository-level Code Graphβ164Updated last month
- Iterate on LLM-based structured generation forward and backwardβ15Updated last month
- [NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generationβ307Updated 2 months ago
- CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion (NeurIPS 2023)β140Updated 9 months ago
- RepoQA: Evaluating Long-Context Code Understandingβ108Updated 6 months ago
- [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generationβ48Updated 3 weeks ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.β171Updated this week
- The repository for paper "DebugBench: "Evaluating Debugging Capability of Large Language Models".β74Updated 10 months ago
- Official implementation of paper How to Understand Whole Repository? New SOTA on SWE-bench Lite (21.3%)β85Updated last month
- β110Updated 9 months ago
- β61Updated last year
- LeanUniverse: A Library for Consistent and Scalable Lean4 Dataset Managementβ63Updated 4 months ago
- [FORGE 2025] Graph-based method for end-to-end code completion with context awareness on repositoryβ62Updated 8 months ago
- Evaluation of LLMs on latest math competitionsβ119Updated this week
- [FSE-2024] Towards AI-Assisted Synthesis of Verified Dafny Methodsβ46Updated 11 months ago
- Multi-SWE-bench: A Multilingual Benchmark for Issue Resolvingβ157Updated this week
- [ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".β243Updated 6 months ago
- Large Language Models for Software Engineeringβ225Updated this week
- The project page for "LOGIC-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning"β318Updated 11 months ago
- Super-fast Structured Outputsβ230Updated last week
- Pip compatible CodeBLEU metric implementation available for linux/macos/winβ89Updated last month
- [NeurIPS 2023 D&B] Code repository for InterCode benchmark https://arxiv.org/abs/2306.14898β217Updated last year