clevcode / reversal-curseLinks
Reversal Curse Experiment
☆15Updated last year
Alternatives and similar repositories for reversal-curse
Users that are interested in reversal-curse are comparing it to the libraries listed below
Sorting:
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- Experiments for efforts to train a new and improved t5☆76Updated last year
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆54Updated 4 months ago
- ☆44Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated 3 months ago
- A simple library for working with Hugging Face models.☆14Updated 6 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆65Updated 2 years ago
- ☆53Updated last year
- Functional Benchmarks and the Reasoning Gap☆88Updated 9 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last year
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆49Updated this week
- A repository for research on medium sized language models.☆77Updated last year
- Measuring the situational awareness of language models☆37Updated last year
- GoldFinch and other hybrid transformer components☆10Updated last week
- Because it's there.☆16Updated 9 months ago
- RepoQA: Evaluating Long-Context Code Understanding☆111Updated 8 months ago
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 11 months ago
- Pre-training code for CrystalCoder 7B LLM☆54Updated last year
- Training hybrid models for dummies.☆25Updated 6 months ago
- ☆31Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆58Updated 7 months ago
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆21Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- new optimizer☆20Updated 11 months ago
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆22Updated 5 months ago
- ☆78Updated 3 months ago
- ☆49Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- ☆87Updated last year