StampyAI / stampy-chat
Conversational chatbot to answer questions about AI Safety & Alignment based on information retrieved from the Alignment Research Dataset
☆13Updated 2 months ago
Related projects: ⓘ
- A dataset of alignment research and code to reproduce it☆68Updated last year
- Drive a browser with Cohere☆72Updated last year
- Measuring the situational awareness of language models☆31Updated 7 months ago
- ☆58Updated this week
- a writeup on some experiments on a sequence model for chess games☆27Updated 3 years ago
- ☆24Updated 5 months ago
- ☆18Updated last year
- Language-annotated Abstraction and Reasoning Corpus☆76Updated last year
- Formal Contracts for Multi-Agent Reinforcement Learning☆16Updated 10 months ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14Updated 4 months ago
- Domain Specific Language for the Abstraction and Reasoning Corpus☆152Updated last month
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆62Updated last year
- ☆91Updated 5 months ago
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆42Updated last year
- ☆19Updated 4 months ago
- Factored Cognition Primer: How to write compositional language model programs☆48Updated last year
- Mechanistic Interpretability for Transformer Models☆48Updated 2 years ago
- The history files when recording human interaction while solving ARC tasks☆91Updated this week
- Fast inference of Instruct tuned LLaMa on your personal devices.☆22Updated last year
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆23Updated 3 months ago
- My writings about ARC (Abstraction and Reasoning Corpus)☆55Updated 3 weeks ago
- Materials for ConceptARC paper☆71Updated 4 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆60Updated 4 months ago
- Tools for studying developmental interpretability in neural networks.☆69Updated this week
- ☆72Updated 2 months ago
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆16Updated 6 months ago
- Redwood Research's transformer interpretability tools☆11Updated 2 years ago
- ☆120Updated 2 months ago
- ☆29Updated 10 months ago