meaningalignment / dft
Democratic Fine-tuning with a Moral Graph
☆22Updated 3 months ago
Alternatives and similar repositories for dft:
Users that are interested in dft are comparing it to the libraries listed below
- Public repo for my book Symphony of Thought: Orchestrating Artificial Cognition☆114Updated 2 years ago
- A Loom implementation in Obsidian☆280Updated 5 months ago
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆78Updated this week
- A dataset of alignment research and code to reproduce it☆73Updated last year
- General-Sum variant of the game Diplomacy for evaluating AIs.☆29Updated 10 months ago
- Interactive Composition Explorer: a debugger for compositional language model programs☆544Updated last month
- Code for the paper: "Large Language Models as Corporate Lobbyists" (2023).☆171Updated 2 years ago
- ☆253Updated 7 months ago
- A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API☆30Updated last month
- Promoting critical thinking through machine-generated prompts.☆18Updated 3 years ago
- Problem solving by engaging multiple AI agents in conversation with each other and the user.☆211Updated last year
- Command-line recursive question-answering with immutable contexts and explicit data store☆25Updated 6 years ago
- ☆59Updated 9 months ago
- METR Task Standard☆142Updated 2 weeks ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆195Updated last week
- Reduce suffering, increase prosperity, increase understanding. A proposed framework to address the Control Problem.☆138Updated last year
- Causal DAG Extraction from Text (DEFT)☆65Updated last month
- The AI assistant for Obsidian that helps you write better and think more clearly☆66Updated 2 years ago
- this is a TypeScript-based MCP server that implements a simple loom and makes it available for Claude to use.☆18Updated 2 months ago
- Factored Cognition Primer: How to write compositional language model programs☆48Updated last year
- An initiative to create concise and widely shareable educational resources, infographics, and animated explainers on the latest contribut…☆18Updated last year
- ☆90Updated 8 months ago
- A langchain app to visualise a debate using Tree-of-Thought reasoning☆58Updated 11 months ago
- ☆52Updated last year
- ☆15Updated last week
- Sphynx Hallucination Induction☆52Updated 3 weeks ago
- Record and replay LLM interactions for langchain☆79Updated 7 months ago
- Python version of cognitive architecture MicroPsi☆179Updated 2 years ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆64Updated 2 months ago
- Collection of Tree of Thoughts prompting techniques I've found useful to start with, then stylize, then iterate☆83Updated last year