dvruette / concept-guidance
Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vectors that control the behavior of LLMs at inference time.
☆19Updated 10 months ago
Alternatives and similar repositories for concept-guidance:
Users that are interested in concept-guidance are comparing it to the libraries listed below
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆16Updated last month
- Minimum Description Length probing for neural network representations☆18Updated last week
- ☆49Updated 4 months ago
- Training hybrid models for dummies.☆16Updated this week
- ☆37Updated 5 months ago
- Latent Diffusion Language Models☆68Updated last year
- ☆31Updated 7 months ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆28Updated 2 months ago
- Implementation of Spectral State Space Models☆16Updated 10 months ago
- ☆20Updated 3 months ago
- ☆22Updated last year
- Understanding how features learned by neural networks evolve throughout training☆32Updated 2 months ago
- ☆33Updated 4 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆39Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated last month
- LLM training in simple, raw C/CUDA☆14Updated last month
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 10 months ago
- QLoRA for Masked Language Modeling☆21Updated last year
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated last year
- Latent Large Language Models☆17Updated 4 months ago
- ☆26Updated 10 months ago
- The repository contains code for Adaptive Data Optimization☆21Updated last month
- ☆20Updated 2 months ago
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling☆35Updated last year
- ☆15Updated last month
- Code supporting the preprint "Training Language Models on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆10Updated 3 months ago
- Utilities for PyTorch distributed☆23Updated last year
- ☆26Updated 10 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- ☆31Updated last year