clevcode / reversal-curseLinks
Reversal Curse Experiment
☆15Updated last year
Alternatives and similar repositories for reversal-curse
Users that are interested in reversal-curse are comparing it to the libraries listed below
Sorting:
- Measuring the situational awareness of language models☆35Updated last year
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- ☆51Updated last year
- Understanding how features learned by neural networks evolve throughout training☆34Updated 7 months ago
- SILO Language Models code repository☆81Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last year
- Experiments for efforts to train a new and improved t5☆77Updated last year
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- ☆51Updated last year
- ☆48Updated 10 months ago
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated last month
- ☆15Updated last month
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆23Updated 6 months ago
- Latent Large Language Models☆18Updated 9 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆52Updated 2 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆65Updated last year
- ☆29Updated last year
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆22Updated 4 months ago
- ☆23Updated last year
- ☆32Updated 4 months ago
- Minimum Description Length probing for neural network representations☆19Updated 4 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆62Updated last year
- Training hybrid models for dummies.☆21Updated 4 months ago
- Lottery Ticket Adaptation☆39Updated 6 months ago
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 10 months ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆18Updated 4 months ago
- Efficiently computing & storing token n-grams from large corpora☆23Updated 8 months ago