openai / GPTs-are-GPTsLinks
☆50Updated last month
Alternatives and similar repositories for GPTs-are-GPTs
Users that are interested in GPTs-are-GPTs are comparing it to the libraries listed below
Sorting:
- ☆29Updated 9 months ago
- ☆75Updated 7 months ago
- Interactive Textbook Demo☆50Updated last month
- Open Source Replication of Anthropic's Alignment Faking Paper☆51Updated 7 months ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated last month
- 🔔🧠 Easily experiment with popular language agents across diverse reasoning/decision-making benchmarks!☆54Updated 4 months ago
- Streamlit app for recommending eval functions using prompt diffs☆30Updated last year
- ☆62Updated 5 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆55Updated 3 months ago
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆83Updated 9 months ago
- 🧠 Societies of Mind & Economy of Minds☆70Updated 8 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 10 months ago
- ☆44Updated 5 months ago
- Agent computer interface for AI software engineer.☆114Updated 2 months ago
- Specification for creating reliable LLM-based conversational agents☆64Updated last month
- ☆11Updated 2 years ago
- [NAACL2025] LiteWebAgent: The Open-Source Suite for VLM-Based Web-Agent Applications☆129Updated 4 months ago
- ☆21Updated last year
- LLM reads a paper and produce a working prototype☆58Updated 7 months ago
- The Library for LLM-based multi-agent applications☆91Updated 4 months ago
- Azure Command-Line Interface☆10Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆106Updated 7 months ago
- ☆14Updated 2 years ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆84Updated 5 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆138Updated 7 months ago
- CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.☆48Updated 3 weeks ago
- ☆55Updated last year
- An agent orchestration framework for economic agents☆108Updated 3 months ago
- ☆53Updated last year
- Powerful Auto Research powered by LangChain, and Anthropic.☆29Updated last year