dvruette / concept-guidanceLinks
Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vectors that control the behavior of LLMs at inference time.
☆21Updated last year
Alternatives and similar repositories for concept-guidance
Users that are interested in concept-guidance are comparing it to the libraries listed below
Sorting:
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated last year
- ☆29Updated last year
- Simple GRPO scripts and configurations.☆59Updated 8 months ago
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆66Updated 2 years ago
- Multi-Domain Expert Learning☆66Updated last year
- ☆55Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- ☆57Updated 2 weeks ago
- Sparse and discrete interpretability tool for neural networks☆64Updated last year
- Latent Diffusion Language Models☆68Updated 2 years ago
- ☆102Updated 9 months ago
- Simple repository for training small reasoning models☆40Updated 8 months ago
- Understanding how features learned by neural networks evolve throughout training☆39Updated 11 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆109Updated 5 months ago
- ☆39Updated last year
- Measuring the situational awareness of language models☆38Updated last year
- ☆53Updated last year
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆56Updated last week
- various experiments for scaling inference time compute with small reasoning models☆17Updated 9 months ago
- ☆35Updated 6 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆44Updated last year
- Experiments for efforts to train a new and improved t5☆75Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆90Updated last year
- The repository contains code for Adaptive Data Optimization☆25Updated 10 months ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆51Updated 11 months ago
- LLM training in simple, raw C/CUDA☆15Updated 10 months ago
- ☆49Updated last year
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆28Updated last year
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆71Updated last year
- Official repo for Learning to Reason for Long-Form Story Generation☆71Updated 5 months ago