benlipkin / decodingLinks
Composable inference algorithms with LLMs and programmable logic
☆70Updated 8 months ago
Alternatives and similar repositories for decoding
Users that are interested in decoding are comparing it to the libraries listed below
Sorting:
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.☆147Updated 9 months ago
- 🔗 LINC: Logical Inference via Neurosymbolic Computation [EMNLP2023]☆71Updated last year
- A unified benchmark for math reasoning☆88Updated 2 years ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆86Updated last year
- A repository for transformer critique learning and generation☆90Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated 11 months ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆46Updated 8 months ago
- ☆208Updated 2 years ago
- Grammar Prompting for Domain-Specific Language Generation with Large Language Models☆75Updated last year
- The official code repo for "Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations".☆83Updated last year
- ☆39Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Self-Alignment with Principle-Following Reward Models☆162Updated 3 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆93Updated 8 months ago
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆38Updated last year
- ☆48Updated last year
- ☆49Updated 11 months ago
- ☆119Updated last year
- ☆41Updated 5 months ago
- Neural theorem proving tutorial, version II☆38Updated last year
- [NeurIPS 2023] Learning Transformer Programs☆162Updated last year
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆223Updated 8 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆149Updated 6 months ago
- Exploring the Limitations of Large Language Models on Multi-Hop Queries☆27Updated 5 months ago
- [ACL 2025 Main] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆36Updated 7 months ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆150Updated last year
- ☆51Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- ☆138Updated 6 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆131Updated last year