definitive-io / human-eval-sampling-benchmark
OpenAI's human-eval sampling benchmark
☆13Updated 11 months ago
Alternatives and similar repositories for human-eval-sampling-benchmark:
Users that are interested in human-eval-sampling-benchmark are comparing it to the libraries listed below
- Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuo…☆171Updated 9 months ago
- Build robust, production grade function calling assistants that work. Declarative and extensible. Built on top of LangChain ⚡️☆78Updated 7 months ago
- A framework for LLM's that works as a GPS to reduce hallucinations in production☆44Updated 8 months ago
- 😎 Sagentic.ai Agent Framework - Sagentic.ai is a unified platform for building, running and scaling autonomous agents.☆69Updated this week
- A toolkit for building multimodal AI agents☆133Updated this week
- Turn any developer documentation into a GPT☆81Updated 3 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated 9 months ago
- The Identity layer for the agentic world☆159Updated this week
- A couple scripts to grab stats from email☆40Updated 4 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆100Updated 10 months ago
- OpenAI's Realtime API minus the enterprise bloat☆42Updated last month
- ☆46Updated 9 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆73Updated this week
- Keeping my personal experiments separate from the main repo☆64Updated 4 months ago
- A repository Payman + Langgraph integration examples that allow AI Agent to simply create tasks for Humans on Payman that pay them money …☆75Updated 3 months ago
- Scrapybara Python SDK☆36Updated this week
- auto fine tune of models with synthetic data☆74Updated 11 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆58Updated 6 months ago
- Official Scrapybara demos as seen on X☆60Updated last week
- An open-source Discord bot, created using LlamaIndex, that - Listens to your server conversations, continuously learns from them & answe…☆75Updated 11 months ago
- ☆44Updated 7 months ago
- ☆113Updated 7 months ago
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆86Updated 3 months ago
- A growing collection of guides and tools based on Anthropic's Model Context Protocol standard for interfacing with LLMs☆42Updated this week
- ☆45Updated last month
- Build your Swarm of Internet Agents using MultiOn 🚀☆76Updated last year
- Giving Claude ability to run code with E2B via MCP (Model Context Protocol)☆64Updated last month
- Declarative framework to build LLM-based applications☆112Updated 2 months ago
- Prompt design in Python☆49Updated last month