definitive-io / human-eval-sampling-benchmarkLinks
OpenAI's human-eval sampling benchmark
☆13Updated last year
Alternatives and similar repositories for human-eval-sampling-benchmark
Users that are interested in human-eval-sampling-benchmark are comparing it to the libraries listed below
Sorting:
- Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuo…☆175Updated last year
- Annoucing Instructor Cloud☆36Updated 9 months ago
- ☆47Updated last year
- ☆4Updated 9 months ago
- Turn any developer documentation into a GPT☆95Updated 3 months ago
- A couple scripts to grab stats from email☆42Updated 8 months ago
- Build robust, production grade function calling assistants that work. Declarative and extensible. Built on top of LangChain ⚡️☆77Updated last year
- Build AI Agents with Your Existing Python Code!☆57Updated 7 months ago
- auto fine tune of models with synthetic data☆75Updated last year
- Like Claude Artifacts but lives in a single static HTML page which you can use with any language model of your choosing☆206Updated 3 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆102Updated last year
- Chat with your git repo☆155Updated last year
- Globot is an agent that controls your browser using playwright and GPT-4V.☆133Updated last year
- Your automated SWE fleet to get your tickets from the Backlog to Prod!☆96Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- automatically generate @openai plugins by specifying your API in markdown in smol-developer style☆121Updated 2 years ago
- AI agent workflow for generating profiles of clients and running research tasks for them. There is an agent for each part of the process:…☆81Updated 7 months ago
- converts url content into JSON with a simple prefix☆68Updated last year
- OpenAI's Realtime API minus the enterprise bloat☆46Updated 6 months ago
- ☆79Updated 2 weeks ago
- ☆79Updated last year
- Local Groq Desktop chat app with MCP support☆268Updated 3 weeks ago
- Fluid Database☆114Updated 8 months ago
- ☆18Updated last week
- Turn a Github Repo's contents into a big prompt for long-context models like Claude 3 Opus.☆211Updated 3 months ago
- ☆18Updated last year
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆81Updated last year
- ☆172Updated 9 months ago
- A memory manager essential for evolving AI to be more human-like, enabling dynamic, context-aware responses through structured memory han…☆29Updated last year
- ⛓️ build cognitive systems, pythonic☆336Updated 6 months ago