justinlovelace / Diffusion-Guided-LM
☆22Updated 6 months ago
Alternatives and similar repositories for Diffusion-Guided-LM:
Users that are interested in Diffusion-Guided-LM are comparing it to the libraries listed below
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆68Updated 2 months ago
- ☆71Updated 6 months ago
- ☆20Updated 8 months ago
- ☆44Updated 6 months ago
- [EMNLP 2023, Findings] GRACE: Discriminator-Guided Chain-of-Thought Reasoning☆47Updated 4 months ago
- ☆28Updated 3 months ago
- ☆23Updated 5 months ago
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆23Updated 11 months ago
- Implementation of PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆32Updated 3 months ago
- ☆58Updated 9 months ago
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆56Updated 3 weeks ago
- ☆76Updated 6 months ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆46Updated 2 months ago
- Codebase for Instruction Following without Instruction Tuning☆33Updated 4 months ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆27Updated 11 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆25Updated 10 months ago
- A library for efficient patching and automatic circuit discovery.☆53Updated this week
- ☆26Updated last month
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆52Updated 10 months ago
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- Replicating O1 inference-time scaling laws☆82Updated 2 months ago
- Directional Preference Alignment☆56Updated 4 months ago
- ☆55Updated 3 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆70Updated 3 months ago
- [EMNLP Findings 2024 & ACL 2024 NLRSE Oral] Enhancing Mathematical Reasoning in Language Models with Fine-grained Rewards☆48Updated 9 months ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆40Updated 2 months ago
- ☆12Updated 11 months ago
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆74Updated 6 months ago