carlini / chess-llm
Play chess against large language models.
☆46Updated last year
Alternatives and similar repositories for chess-llm:
Users that are interested in chess-llm are comparing it to the libraries listed below
- ☆96Updated 10 months ago
- Measuring the situational awareness of language models☆34Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆98Updated 4 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- 🔬 Interpretability for Leela Chess Zero networks.☆12Updated this week
- Repo to reproduce the First-Explore paper results☆37Updated 4 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 10 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆74Updated last year
- Repository for the paper Stream of Search: Learning to Search in Language☆145Updated 2 months ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆73Updated 5 months ago
- ☆21Updated 6 months ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆55Updated 2 months ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆69Updated 10 months ago
- Adversarial Attacks on GPT-4 via Simple Random Search [Dec 2023]☆43Updated 11 months ago
- ☆128Updated 3 weeks ago
- ☆133Updated 5 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆44Updated 2 months ago
- ☆26Updated last year
- Experiments for efforts to train a new and improved t5☆77Updated last year
- ☆73Updated this week
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆58Updated 5 months ago
- Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.☆40Updated 2 months ago
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆45Updated last month
- Sparse and discrete interpretability tool for neural networks☆62Updated last year
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆189Updated 10 months ago
- ☆29Updated last year
- ☆43Updated last year
- unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"☆78Updated 2 years ago
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆204Updated 5 months ago
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆19Updated last year