carlini / chess-llmLinks
Play chess against large language models.
☆49Updated 2 months ago
Alternatives and similar repositories for chess-llm
Users that are interested in chess-llm are comparing it to the libraries listed below
Sorting:
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆191Updated 2 years ago
- Measuring the situational awareness of language models☆39Updated last year
- ☆221Updated 2 years ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆214Updated last week
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆194Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆133Updated 2 years ago
- σ-GPT: A New Approach to Autoregressive Models☆70Updated last year
- Benchmarking Agentic LLM and VLM Reasoning On Games☆207Updated this week
- Implementation of the Llama architecture with RLHF + Q-learning☆168Updated 9 months ago
- A repo to evaluate various LLM's chess playing abilities.☆85Updated last year
- ☆128Updated last year
- Repository for the paper Stream of Search: Learning to Search in Language☆151Updated 9 months ago
- Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.☆39Updated 2 years ago
- RuLES: a benchmark for evaluating rule-following in language models☆240Updated 9 months ago
- ☆100Updated last year
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆119Updated last week
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆138Updated last year
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆210Updated 2 years ago
- ☆23Updated last year
- Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT☆224Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆22Updated last year
- Multi-Domain Expert Learning☆67Updated last year
- ☆142Updated 4 months ago
- A dataset of alignment research and code to reproduce it☆78Updated 2 years ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆130Updated 3 years ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Evaluating LLMs with CommonGen-Lite☆91Updated last year
- Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI☆31Updated last year