clevcode / reversal-curse
Reversal Curse Experiment
☆13Updated 11 months ago
Related projects: ⓘ
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆30Updated last month
- Repository for the paper Stream of Search: Learning to Search in Language☆70Updated last month
- Official implementation of Goldfish Loss: Mitigating Memorization in Generative LLMs☆68Updated 2 months ago
- ☆29Updated 2 weeks ago
- Experiments for efforts to train a new and improved t5☆76Updated 5 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆23Updated last year
- 📜 [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswa…☆36Updated 10 months ago
- ☆130Updated this week
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆23Updated 3 months ago
- ☆73Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆40Updated 8 months ago
- ☆40Updated 4 months ago
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆36Updated 3 weeks ago
- ☆54Updated last week
- ☆30Updated 4 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆62Updated last year
- A repository for research on medium sized language models.☆71Updated 3 months ago
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluating☆26Updated last week
- ☆50Updated last month
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated last month
- ☆91Updated last month
- Small, simple agent task environments for training and evaluation☆13Updated last week
- ☆77Updated last month
- LLMs as Collaboratively Edited Knowledge Bases☆40Updated 7 months ago
- Measuring the situational awareness of language models☆31Updated 7 months ago
- ☆37Updated 5 months ago
- ☆45Updated 7 months ago
- ☆27Updated last year
- ☆39Updated 2 months ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆57Updated last week