⚓️ Repository for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.
☆127Oct 27, 2025Updated 7 months ago
Alternatives and similar repositories for thought-anchors
Users that are interested in thought-anchors are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ⚓️ Interactive playground for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.☆18Dec 20, 2025Updated 5 months ago
- ☆37Jul 9, 2025Updated 11 months ago
- Implementations of several self-supervised pretext tasks for language and vision modalities in PyTorch.☆13Jan 19, 2021Updated 5 years ago
- ☆22Feb 13, 2026Updated 3 months ago
- Persistent caching for Python functions☆18Dec 10, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆38Oct 2, 2025Updated 8 months ago
- Exemplary, annotated machine learning pipeline for any tabular data problem.☆27Aug 30, 2019Updated 6 years ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning [ICLR26]☆65Apr 11, 2026Updated 2 months ago
- A library for training crosscoders☆17May 28, 2025Updated last year
- Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"☆47May 31, 2024Updated 2 years ago
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆21Dec 14, 2024Updated last year
- [CVPR 2026 Main] MultiBanana: A Challenging Benchmark for Multi-Reference Text-to-Image Generation☆26Jun 1, 2026Updated last week
- Code repo for the model organisms and convergent directions of EM papers.☆69Sep 22, 2025Updated 8 months ago
- Code for Neurips 2024 paper: "Pure Message Passing Can Estimate Common Neighbor for Link Prediction"☆17Oct 8, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆25Jun 3, 2024Updated 2 years ago
- Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University☆319Feb 8, 2026Updated 4 months ago
- ☆109Aug 8, 2024Updated last year
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆30Aug 9, 2025Updated 10 months ago
- ☆67Jul 14, 2025Updated 10 months ago
- James' cookbook of evaluations and finetuning experiments☆28Feb 19, 2026Updated 3 months ago
- ☆13Jun 13, 2022Updated 3 years ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆152Sep 14, 2022Updated 3 years ago
- ☆17Feb 14, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- https://transformer-circuits.pub/2025/attribution-graphs/methods.html☆99Mar 27, 2025Updated last year
- the indexer and search engine for irchiver, see https://irchiver.com for license and other information☆15Dec 2, 2021Updated 4 years ago
- ☆41Jun 14, 2025Updated 11 months ago
- Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges☆29May 14, 2025Updated last year
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated 4 months ago
- GoldFinch and other hybrid transformer components☆46Jul 20, 2024Updated last year
- ☆22Apr 15, 2025Updated last year
- Sparse Autoencoder Training Library☆57May 1, 2025Updated last year
- ☆15Feb 21, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆43Sep 18, 2025Updated 8 months ago
- the handbook for nilenso☆14Feb 6, 2024Updated 2 years ago
- ☆26Dec 20, 2023Updated 2 years ago
- Website for Princeton's undergraduate reinforcement learning course☆15May 12, 2025Updated last year
- Sparsify transformers with SAEs and transcoders☆725Updated this week
- ☆83Feb 25, 2025Updated last year
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆35Oct 28, 2025Updated 7 months ago