openai / democratic-inputsLinks
☆74Updated 10 months ago
Alternatives and similar repositories for democratic-inputs
Users that are interested in democratic-inputs are comparing it to the libraries listed below
Sorting:
- ☆58Updated 4 months ago
- Repo for the paper on Escalation Risks of AI systems☆44Updated last year
- An Open Source Playground with Agent Datasets and APIs for building and testing your own Autonomous Web Agents☆201Updated 2 years ago
- ☆118Updated 3 weeks ago
- Azure Command-Line Interface☆11Updated 2 years ago
- Cognition's results and methodology on SWE-bench☆123Updated last year
- Interactive Textbook Demo☆53Updated 3 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆34Updated last year
- Specification for creating reliable LLM-based conversational agents☆65Updated 3 months ago
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆59Updated last year
- ☆25Updated last year
- Public repository containing METR's DVC pipeline for eval data analysis☆199Updated last week
- ☆329Updated last year
- A framework for generative software.☆115Updated 7 months ago
- A system that tries to resolve all issues on a github repo with OpenHands.☆117Updated last year
- Data processing for the Collective Constitutional AI project (a collaboration between The Collective Intelligence Project & Anthropic)☆26Updated 2 years ago
- Sphynx Hallucination Induction☆53Updated last year
- Testing baseline LLMs performance across various models☆336Updated this week
- Problem solving by engaging multiple AI agents in conversation with each other and the user.☆237Updated 2 years ago
- ☆28Updated last year
- Finetune Llama-3-8b on the MathInstruct dataset☆115Updated last year
- Lightweight demo using the Anthropic Python SDK to experiment with Claude's Search and Retrieval capabilities over a variety of knowledge…☆177Updated last year
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆133Updated last week
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆127Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- Curation of prompts that are known to be adversarial to large language models☆188Updated 2 years ago
- Draw more samples☆198Updated last year
- Fluentd output plugin that sends events to Amazon Kinesis Streams and Amazon Kinesis Firehose.☆12Updated 2 years ago
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated 2 years ago
- ☆49Updated 7 months ago