dvruette / concept-guidanceLinks
Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vectors that control the behavior of LLMs at inference time.
☆21Updated last year
Alternatives and similar repositories for concept-guidance
Users that are interested in concept-guidance are comparing it to the libraries listed below
Sorting:
- Simple GRPO scripts and configurations.☆59Updated 7 months ago
- Multi-Domain Expert Learning☆67Updated last year
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆54Updated 5 months ago
- ☆58Updated 4 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆71Updated 5 months ago
- ☆49Updated last year
- Code for☆27Updated 9 months ago
- Latent Diffusion Language Models☆69Updated 2 years ago
- ☆55Updated last year
- ☆100Updated 8 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆45Updated last year
- ☆39Updated last year
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated 2 years ago
- ☆69Updated last year
- Code repository for the c-BTM paper☆107Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- Understanding how features learned by neural networks evolve throughout training☆39Updated 11 months ago
- ☆46Updated last year
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆19Updated this week
- Merge LLM that are split in to parts☆27Updated last month
- A repository for research on medium sized language models.☆77Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated 5 months ago
- ☆82Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆88Updated last year
- ☆122Updated 7 months ago
- ☆32Updated 5 months ago
- Simple repository for training small reasoning models☆40Updated 7 months ago
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆38Updated 3 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆44Updated last year
- ☆34Updated last year