openai / democratic-inputsLinks
☆71Updated 2 months ago
Alternatives and similar repositories for democratic-inputs
Users that are interested in democratic-inputs are comparing it to the libraries listed below
Sorting:
- ☆29Updated last year
- Code for ExploreTom☆84Updated 6 months ago
- Interaction-first method for generating demonstrations for web-agents on any website☆41Updated last month
- LLM-powered autonomous agent with hierarchical task management☆49Updated 2 years ago
- An Open Source Playground with Agent Datasets and APIs for building and testing your own Autonomous Web Agents☆193Updated last year
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆51Updated 4 months ago
- ☆98Updated 3 months ago
- ☆82Updated last year
- Repo for the paper on Escalation Risks of AI systems☆40Updated last year
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37Updated last year
- Finetune Llama-3-8b on the MathInstruct dataset☆110Updated 8 months ago
- Sphynx Hallucination Induction☆54Updated 4 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆70Updated 2 months ago
- A distributed agent orchestration framework for market agents☆102Updated this week
- The Foundation Model Transparency Index☆81Updated last year
- Repository for the paper Stream of Search: Learning to Search in Language☆148Updated 4 months ago
- A system that tries to resolve all issues on a github repo with OpenHands.☆108Updated 7 months ago
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆108Updated last year
- ☆89Updated last week
- ☆134Updated 7 months ago
- A benchmark for evaluating learning agents based on just language feedback☆81Updated 2 weeks ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆88Updated 3 months ago
- Scale your LLM-as-a-judge.☆240Updated 2 weeks ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆184Updated this week
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsembles☆35Updated last month
- A preprint version of our recent research on the capability of frontier AI systems to do self-replication☆59Updated 6 months ago
- ☆29Updated last year
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆173Updated 3 months ago
- Draw more samples☆191Updated last year
- LLM finetuning☆42Updated last year