dvruette / concept-guidanceLinks
Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vectors that control the behavior of LLMs at inference time.
☆21Updated last year
Alternatives and similar repositories for concept-guidance
Users that are interested in concept-guidance are comparing it to the libraries listed below
Sorting:
- ☆29Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated last year
- ☆39Updated last year
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆66Updated 3 years ago
- Multi-Domain Expert Learning☆66Updated last year
- ☆37Updated 7 months ago
- ☆50Updated last year
- Simple GRPO scripts and configurations.☆59Updated 9 months ago
- Sparse and discrete interpretability tool for neural networks☆64Updated last year
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆56Updated last month
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆62Updated 2 years ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆44Updated last year
- ☆56Updated last year
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆181Updated last week
- Code repository for the c-BTM paper☆108Updated 2 years ago
- Official repo for Learning to Reason for Long-Form Story Generation☆72Updated 6 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Code for☆27Updated 11 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆110Updated 6 months ago
- Simple repository for training small reasoning models☆45Updated 9 months ago
- ☆124Updated 8 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- A repository for transformer critique learning and generation☆89Updated last year
- A repository for research on medium sized language models.☆78Updated last year
- ☆21Updated last year
- The repository contains code for Adaptive Data Optimization☆28Updated 11 months ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆28Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆91Updated last year
- ☆23Updated last year
- ☆26Updated 10 months ago