definitive-io / human-eval-sampling-benchmark
OpenAI's human-eval sampling benchmark
☆13Updated last year
Alternatives and similar repositories for human-eval-sampling-benchmark:
Users that are interested in human-eval-sampling-benchmark are comparing it to the libraries listed below
- Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuo…☆171Updated 10 months ago
- Build robust, production grade function calling assistants that work. Declarative and extensible. Built on top of LangChain ⚡️☆78Updated 9 months ago
- Turn any developer documentation into a GPT☆87Updated 4 months ago
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆141Updated 10 months ago
- A couple scripts to grab stats from email☆41Updated 5 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆102Updated 11 months ago
- ☆38Updated 11 months ago
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆118Updated last year
- Annoucing Instructor Cloud☆34Updated 6 months ago
- auto fine tune of models with synthetic data☆75Updated last year
- Chat with your git repo☆156Updated last year
- A framework that uses multi-agents to enable users to perform a systematic data science pipeline with just two inputs.☆38Updated 6 months ago
- The Identity layer for the agentic world☆165Updated this week
- A memory manager essential for evolving AI to be more human-like, enabling dynamic, context-aware responses through structured memory han…☆27Updated 10 months ago
- Build your Swarm of Internet Agents using MultiOn 🚀☆77Updated last year
- ☆265Updated 6 months ago
- Sample web apps built with xRx-Core☆155Updated 3 weeks ago
- A repository Payman + Langgraph integration examples that allow AI Agent to simply create tasks for Humans on Payman that pay them money …☆79Updated 4 months ago
- Improve your questions! The AI for Inquiry - QuestionImprover Agent is an LLM-driven “tool for thought” designed to enhance the depth and…☆141Updated this week
- ☆29Updated 2 months ago
- LLM and Langchain powered chatbot to handle Google Calendar tasks☆167Updated last year
- Action library for AI Agent☆209Updated this week
- ⛓️ build cognitive systems, pythonic☆331Updated 3 months ago
- ☆72Updated last year
- Your automated SWE fleet to get your tickets from the Backlog to Prod!☆95Updated 10 months ago
- Giving Claude ability to run code with E2B via MCP (Model Context Protocol)☆90Updated last week
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated 10 months ago
- Tools for LLM agents.☆59Updated 2 months ago
- A spotify playlist agent using CrewAI☆81Updated 8 months ago