adamkarvonen / chess_gpt_eval
A repo to evaluate various LLM's chess playing abilities.
☆75Updated 10 months ago
Alternatives and similar repositories for chess_gpt_eval:
Users that are interested in chess_gpt_eval are comparing it to the libraries listed below
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆199Updated 2 months ago
- A repository for training nanogpt-based Chess playing language models.☆23Updated 9 months ago
- This repository explains and provides examples for "concept anchoring" in GPT4.☆72Updated last year
- ☆80Updated last month
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆164Updated this week
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆30Updated 2 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆129Updated last week
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Repository for the paper Stream of Search: Learning to Search in Language☆137Updated last week
- ☆37Updated 6 months ago
- ☆60Updated last year
- Draw more samples☆186Updated 7 months ago
- ☆48Updated last year
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated 9 months ago
- look how they massacred my boy☆63Updated 3 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated 6 months ago
- Chat Markup Language conversation library☆55Updated last year
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆70Updated 6 months ago
- The history files when recording human interaction while solving ARC tasks☆97Updated this week
- Code repository for the c-BTM paper☆105Updated last year
- ☆94Updated last year
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated last year
- Use context-free grammars with an LLM☆167Updated 10 months ago
- Evaluating LLMs with CommonGen-Lite☆88Updated 10 months ago
- A strongly typed Python DSL for developing message passing multi agent systems☆52Updated 10 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆221Updated 9 months ago
- ☆74Updated last year