andersonbcdefg / dpo-lora
direct preference optimization with only 1 model copy :)
☆12Updated last year
Alternatives and similar repositories for dpo-lora:
Users that are interested in dpo-lora are comparing it to the libraries listed below
- ☆11Updated last week
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆60Updated 8 months ago
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆30Updated last year
- Simplex Random Feature attention, in PyTorch☆72Updated last year
- ☆24Updated 5 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆26Updated last month
- Public Inflection Benchmarks☆69Updated 10 months ago
- ☆58Updated 8 months ago
- look how they massacred my boy☆63Updated 3 months ago
- ☆60Updated last year
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆182Updated 7 months ago
- Using modal.com to process FineWeb-edu data☆19Updated last month
- ☆48Updated last year
- Functional Benchmarks and the Reasoning Gap☆82Updated 3 months ago
- ☆20Updated 2 months ago
- ☆37Updated 5 months ago
- Experiments for efforts to train a new and improved t5☆77Updated 9 months ago
- ☆13Updated last year
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆41Updated 4 months ago
- Code repository for the c-BTM paper☆105Updated last year
- Sparse autoencoders for Contra text embedding models☆25Updated 8 months ago
- train with kittens!☆52Updated 2 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆46Updated 7 months ago
- A synthetic story narration dataset to study small audio LMs.☆31Updated 11 months ago
- [WIP] Transformer to embed Danbooru labelsets☆13Updated 9 months ago
- Data preparation code for CrystalCoder 7B LLM☆43Updated 8 months ago
- alternative way to calculating self attention☆18Updated 7 months ago
- ☆22Updated last year
- ☆18Updated last year
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆16Updated last month