clevcode / reversal-curseLinks
Reversal Curse Experiment
☆15Updated last year
Alternatives and similar repositories for reversal-curse
Users that are interested in reversal-curse are comparing it to the libraries listed below
Sorting:
- Code for reproducing our paper "Are Sparse Autoencoders Useful? A Case Study in Sparse Probing"☆20Updated 2 months ago
- Lottery Ticket Adaptation☆39Updated 7 months ago
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 10 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆33Updated 8 months ago
- ☆32Updated 5 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated 2 months ago
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- ☆76Updated 3 months ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆27Updated last year
- Understanding how features learned by neural networks evolve throughout training☆36Updated 8 months ago
- Measuring the situational awareness of language models☆35Updated last year
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆25Updated last year
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆23Updated 6 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated 2 years ago
- Experiments for efforts to train a new and improved t5☆77Updated last year
- ☆53Updated last year
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆22Updated 4 months ago
- ☆65Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆65Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated 2 years ago
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆21Updated last year
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆19Updated 5 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆102Updated 2 months ago
- ☆15Updated 2 months ago
- ☆26Updated 5 months ago
- Replicating O1 inference-time scaling laws☆87Updated 6 months ago