interp-reasoning / thought-anchors.comLinks
⚓️ Interactive playground for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.
☆17Updated last month
Alternatives and similar repositories for thought-anchors.com
Users that are interested in thought-anchors.com are comparing it to the libraries listed below
Sorting:
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Updated 9 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆74Updated 9 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Updated last year
- ☆41Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆78Updated last year
- ☆29Updated last month
- Simple repository for training small reasoning models☆49Updated last year
- Evaluating LLMs with CommonGen-Lite☆94Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆28Updated last year
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆31Updated 8 months ago
- ☆56Updated last year
- ☆68Updated last year
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆63Updated 4 months ago
- Shaping capabilities with token-level pretraining data filtering☆75Updated 2 weeks ago
- Verifiers for LLM Reinforcement Learning☆80Updated 9 months ago
- ☆93Updated last month
- Minimum Description Length probing for neural network representations☆20Updated last year
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆45Updated 2 years ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated last year
- Evaluation of neuro-symbolic engines☆41Updated last year
- ☆53Updated 2 years ago
- Repository for the paper Stream of Search: Learning to Search in Language☆153Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Updated last year
- Bayesian scaling laws for in-context learning.☆15Updated 10 months ago
- ☆48Updated last year
- ☆86Updated 2 years ago
- ☆105Updated last year