benlipkin / decoding
Composable inference algorithms with LLMs and programmable logic
☆56Updated last month
Alternatives and similar repositories for decoding:
Users that are interested in decoding are comparing it to the libraries listed below
- 🔗 LINC: Logical Inference via Neurosymbolic Computation [EMNLP2023]☆58Updated last year
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆40Updated last month
- ☆45Updated last year
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆45Updated last year
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆34Updated 11 months ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆21Updated 5 months ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆56Updated 7 months ago
- ☆75Updated 5 months ago
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.☆133Updated 3 months ago
- NaturalProver: Grounded Mathematical Proof Generation with Language Models☆36Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆67Updated last year
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆76Updated 9 months ago
- NeurIPS 2024 tutorial on LLM Inference☆38Updated last month
- ☆32Updated 11 months ago
- Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"☆29Updated 7 months ago
- ☆37Updated 5 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆69Updated last month
- [ICLR 2024] COLLIE: Systematic Construction of Constrained Text Generation Tasks☆52Updated last year
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated this week
- ☆48Updated 11 months ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆19Updated 5 months ago
- ☆38Updated 9 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated 11 months ago
- ☆34Updated 5 months ago
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆53Updated 4 months ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆33Updated last month
- A unified benchmark for math reasoning☆87Updated 2 years ago
- ☆20Updated last year
- Code and data associated with the AmbiEnt dataset in "We're Afraid Language Models Aren't Modeling Ambiguity" (Liu et al., 2023)☆57Updated 11 months ago