benlipkin / decodingLinks
Composable inference algorithms with LLMs and programmable logic
β68Updated 5 months ago
Alternatives and similar repositories for decoding
Users that are interested in decoding are comparing it to the libraries listed below
Sorting:
- A repository for transformer critique learning and generationβ89Updated last year
- π LINC: Logical Inference via Neurosymbolic Computation [EMNLP2023]β70Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flβ¦β75Updated 9 months ago
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.β147Updated 7 months ago
- Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"β30Updated 11 months ago
- A unified benchmark for math reasoningβ88Updated 2 years ago
- Code and data associated with the AmbiEnt dataset in "We're Afraid Language Models Aren't Modeling Ambiguity" (Liu et al., 2023)β62Updated last year
- the instructions and demonstrations for building a formal logical reasoning capable GLMβ53Updated 9 months ago
- β48Updated last year
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"β33Updated 5 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"β54Updated last year
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)β32Updated last year
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofsβ36Updated last year
- [ICLR 2024] COLLIE: Systematic Construction of Constrained Text Generation Tasksβ51Updated last year
- β21Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluatorsβ42Updated last year
- Inspecting and Editing Knowledge Representations in Language Modelsβ116Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"β84Updated 9 months ago
- Minimum Bayes Risk Decoding for Hugging Face Transformersβ58Updated last year
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Leβ¦β91Updated 3 years ago
- β44Updated 9 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAIβ107Updated 2 weeks ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasksβ43Updated 6 months ago
- β114Updated 3 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Modelsβ55Updated 3 months ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"β23Updated last month
- datasets from the paper "Towards Understanding Sycophancy in Language Models"β76Updated last year
- π» Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"β55Updated last year
- LILO: Library Induction with Language Observationsβ86Updated 9 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044β33Updated 8 months ago