openai / democratic-inputsLinks
☆74Updated 5 months ago
Alternatives and similar repositories for democratic-inputs
Users that are interested in democratic-inputs are comparing it to the libraries listed below
Sorting:
- ☆103Updated 6 months ago
- ☆35Updated last year
- Repo for the paper on Escalation Risks of AI systems☆44Updated last year
- An Open Source Playground with Agent Datasets and APIs for building and testing your own Autonomous Web Agents☆196Updated last year
- A system that tries to resolve all issues on a github repo with OpenHands.☆113Updated 10 months ago
- Interaction-first method for generating demonstrations for web-agents on any website☆48Updated 4 months ago
- A framework for generative software.☆113Updated 2 months ago
- A library that allows interacting with Replit's code-exec API☆26Updated 8 months ago
- ☆298Updated last year
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆59Updated 9 months ago
- Lightweight demo using the Anthropic Python SDK to experiment with Claude's Search and Retrieval capabilities over a variety of knowledge…☆168Updated last year
- Testing baseline LLMs performance across various models☆309Updated last month
- Public repository containing METR's DVC pipeline for eval data analysis☆108Updated 5 months ago
- A repo built for the purpose of benchmarking the performance of agents, regardless of how they are set up and how they work.☆277Updated last year
- Meta-prompt: a simple self-improving language agent☆89Updated 2 years ago
- Specification for creating reliable LLM-based conversational agents☆54Updated 2 months ago
- ☆48Updated last week
- Cognition's results and methodology on SWE-bench☆120Updated last year
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆96Updated 5 months ago
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆115Updated last year
- ☆23Updated last year
- Sphynx Hallucination Induction☆53Updated 7 months ago
- 🚀 The LLM Automatic Computer Framework: L2MAC☆136Updated 8 months ago
- ☆29Updated last year
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆54Updated 6 months ago
- WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?☆208Updated this week
- Finetune Llama-3-8b on the MathInstruct dataset☆111Updated 11 months ago
- Curation of prompts that are known to be adversarial to large language models☆185Updated 2 years ago
- ☆132Updated 2 years ago
- An experimental open-source attempt to make GPT-4 fully autonomous.☆31Updated 2 years ago