HanClinto / MENTAT
☆9Updated 4 months ago
Alternatives and similar repositories for MENTAT:
Users that are interested in MENTAT are comparing it to the libraries listed below
- ☆123Updated last month
- gzip Predicts Data-dependent Scaling Laws☆34Updated 9 months ago
- ☆88Updated last month
- Thorn in a HaizeStack test for evaluating long-context adversarial robustness.☆26Updated 7 months ago
- a curated list of data for reasoning ai☆130Updated 7 months ago
- A repo to evaluate various LLM's chess playing abilities.☆78Updated 11 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆139Updated last month
- Experiments for efforts to train a new and improved t5☆77Updated 10 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆41Updated last year
- Sparse and discrete interpretability tool for neural networks☆59Updated last year
- ☆60Updated last year
- ☆45Updated 11 months ago
- Functional Benchmarks and the Reasoning Gap☆84Updated 5 months ago
- ☆22Updated last year
- ☆53Updated last year
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆201Updated 3 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆52Updated 3 months ago
- Code repository for the c-BTM paper☆106Updated last year
- Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …☆116Updated 11 months ago
- Alice in Wonderland code base for experiments and raw experiments data☆128Updated last month
- Code for ExploreTom☆76Updated 3 months ago
- The history files when recording human interaction while solving ARC tasks☆97Updated this week
- ☆26Updated 11 months ago
- Understanding how features learned by neural networks evolve throughout training☆33Updated 4 months ago
- ☆78Updated 10 months ago
- The repository contains code for Adaptive Data Optimization☆20Updated 3 months ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆116Updated 2 years ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 6 months ago
- ☆111Updated last month