Saibo-creator / Awesome-LLM-Constrained-DecodingLinks

A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.

☆241

Alternatives and similar repositories for Awesome-LLM-Constrained-Decoding

Users that are interested in Awesome-LLM-Constrained-Decoding are comparing it to the libraries listed below

Sorting:

epfl-dlab / transformers-CFG
🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers
☆123Updated 3 months ago
princeton-nlp / AutoCompressors
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
☆309Updated 10 months ago
allenai / olmes
Reproducible, flexible LLM evaluations
☆227Updated 3 weeks ago
nightdessert / Retrieval_Head
open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality
☆205Updated last year
nelson-liu / lost-in-the-middle
Code and data for "Lost in the Middle: How Language Models Use Long Contexts"
☆354Updated last year
zorazrw / awesome-tool-llm
☆237Updated 11 months ago
RulinShao / retrieval-scaling
Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".
☆209Updated last week
OpenBMB / InfiniteBench
Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
☆343Updated 10 months ago
WildEval / ZeroEval
A simple unified framework for evaluating LLMs
☆235Updated 3 months ago
alon-albalak / data-selection-survey
A Survey on Data Selection for Language Models
☆246Updated 3 months ago
cooperleong00 / Awesome-LLM-Interpretability
A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..
☆262Updated 4 months ago
mlfoundations / evalchemy
Automatic evals for LLMs
☆496Updated last month
ezelikman / STaR
Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)
☆206Updated 2 years ago
ZubinGou / math-evaluation-harness
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
☆239Updated last year
SuperBruceJia / Awesome-LLM-Self-Consistency
Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models
☆105Updated 2 weeks ago
booydar / babilong
BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
☆209Updated 3 months ago
lucidrains / speculative-decoding
Explorations into some recent techniques surrounding speculative decoding
☆275Updated 7 months ago
HKUNLP / ChunkLlama
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
☆431Updated 9 months ago
FranxYao / Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆468Updated last year
samkhur006 / awesome-llm-planning-reasoning
A curated collection of LLM reasoning and planning resources, including key papers, limitations, benchmarks, and additional learning mate…
☆285Updated 5 months ago
agentica-project / verl-pipeline
Async pipelined version of Verl
☆112Updated 4 months ago
google-deepmind / loft
LOFT: A 1 Million+ Token Long-Context Benchmark
☆207Updated last month
sail-sg / oat
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
☆425Updated last week
Leolty / repobench
✨ RepoBench: Benchmarking Repository-Level Code Auto-Completion Systems - ICLR 2024
☆169Updated 11 months ago
tianyi-lab / Reflection_Tuning
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
☆360Updated 11 months ago
voidism / DoLa
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
☆504Updated 6 months ago
princeton-nlp / HELMET
The HELMET Benchmark
☆162Updated 3 months ago
structuredllm / syncode
Efficient and general syntactical decoding for Large Language Models
☆283Updated this week
xlang-ai / DS-1000
[ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".
☆251Updated 9 months ago
allenai / reward-bench
RewardBench: the first evaluation tool for reward models.
☆622Updated last month