adamkarvonen / chess_gpt_eval
A repo to evaluate various LLM's chess playing abilities.
☆79Updated 11 months ago
Alternatives and similar repositories for chess_gpt_eval:
Users that are interested in chess_gpt_eval are comparing it to the libraries listed below
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆201Updated 4 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- This repository explains and provides examples for "concept anchoring" in GPT4.☆72Updated last year
- A repository for training nanogpt-based Chess playing language models.☆23Updated 10 months ago
- Draw more samples☆186Updated 9 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆168Updated this week
- ☆60Updated last year
- ☆80Updated 2 months ago
- Full finetuning of large language models without large memory requirements☆93Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆137Updated last month
- look how they massacred my boy☆63Updated 5 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆142Updated last month
- a curated list of data for reasoning ai☆131Updated 7 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 7 months ago
- Evaluating LLMs with CommonGen-Lite☆89Updated last year
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated 10 months ago
- ☆38Updated 7 months ago
- Simple Transformer in Jax☆136Updated 9 months ago
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- Code repository for the c-BTM paper☆106Updated last year
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated last year
- The history files when recording human interaction while solving ARC tasks☆97Updated this week
- ☆111Updated 3 months ago
- ☆48Updated last year
- ☆19Updated last year
- gzip Predicts Data-dependent Scaling Laws☆34Updated 9 months ago