clevcode / reversal-curseLinks
Reversal Curse Experiment
☆15Updated last year
Alternatives and similar repositories for reversal-curse
Users that are interested in reversal-curse are comparing it to the libraries listed below
Sorting:
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆41Updated last year
- Measuring the situational awareness of language models☆38Updated last year
- Experiments for efforts to train a new and improved t5☆76Updated last year
- ☆55Updated last year
- ☆72Updated last year
- ☆78Updated 5 months ago
- Functional Benchmarks and the Reasoning Gap☆88Updated 11 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated 5 months ago
- Code repository for the c-BTM paper☆107Updated last year
- Understanding how features learned by neural networks evolve throughout training☆39Updated 10 months ago
- new optimizer☆20Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆44Updated last year
- ☆104Updated 11 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated 2 years ago
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆53Updated 5 months ago
- ☆52Updated last year
- RepoQA: Evaluating Long-Context Code Understanding☆117Updated 10 months ago
- ☆25Updated 8 months ago
- ☆100Updated 8 months ago
- ☆69Updated last year
- ☆39Updated last year
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆78Updated 9 months ago
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆65Updated 10 months ago
- Replicating O1 inference-time scaling laws☆90Updated 9 months ago
- Code for the paper "Fishing for Magikarp"☆165Updated 4 months ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆72Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆62Updated 9 months ago
- Simple and efficient pytorch-native transformer training and inference (batched)☆78Updated last year
- Open Implementations of LLM Analyses☆107Updated 11 months ago