carlini / chess-llmLinks
Play chess against large language models.
☆49Updated 3 months ago
Alternatives and similar repositories for chess-llm
Users that are interested in chess-llm are comparing it to the libraries listed below
Sorting:
- ☆144Updated 5 months ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆134Updated 2 years ago
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆197Updated 2 years ago
- Measuring the situational awareness of language models☆39Updated last year
- RuLES: a benchmark for evaluating rule-following in language models☆245Updated 10 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- A puzzle to learn about prompting☆135Updated 2 years ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆198Updated last year
- OMNI: Open-endedness via Models of human Notions of Interestingness☆57Updated 11 months ago
- (NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling☆130Updated 2 years ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆221Updated last month
- ☆220Updated 2 years ago
- A dataset of alignment research and code to reproduce it☆78Updated 2 years ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆216Updated 2 weeks ago
- ☆64Updated this week
- ☆100Updated last year
- General multi-task deep RL Agent☆185Updated last year
- Implementation of OpenAI's 'Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets' paper.☆40Updated 2 years ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆132Updated 3 years ago
- Multi-Domain Expert Learning☆67Updated last year
- ☆92Updated 11 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆65Updated 10 months ago
- Implementation of the Llama architecture with RLHF + Q-learning☆170Updated 11 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- ☆55Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Updated 2 years ago
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆218Updated last year
- SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. …☆144Updated last year
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆210Updated 2 years ago
- Sparse and discrete interpretability tool for neural networks☆65Updated last year