definitive-io / human-eval-sampling-benchmark
OpenAI's human-eval sampling benchmark
☆13Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for human-eval-sampling-benchmark
- Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuo…☆169Updated 7 months ago
- auto fine tune of models with synthetic data☆72Updated 9 months ago
- Annoucing Instructor Cloud☆34Updated 3 months ago
- 🤖 Headless IDE for AI agents☆133Updated this week
- Build robust, production grade function calling assistants that work. Declarative and extensible. Built on top of LangChain ⚡️☆77Updated 6 months ago
- A couple scripts to grab stats from email☆40Updated 2 months ago
- An open-source Discord bot, created using LlamaIndex, that - Listens to your server conversations, continuously learns from them & answe…☆72Updated 9 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆99Updated 8 months ago
- AI agent to automatically check grammar and spelling on documentation files☆59Updated last month
- ☆46Updated 7 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆58Updated 7 months ago
- Turn any developer documentation into a GPT☆73Updated last month
- 🔓 The open-source autonomous agent LLM initiative 🔓☆90Updated 9 months ago
- This project involves using llamaindex Multi Agents concierge system and Qdrant vector database to customize the RAG application with use…☆43Updated 3 months ago
- A simple wrapper for OpenAI to log input/outputs.☆103Updated last year
- The next evolution of Agents☆46Updated last week
- Routing on Random Forest (RoRF)☆84Updated last month
- A simple Claude-powered code Interpreter☆82Updated 6 months ago
- CrewAI agents that gather and analyze YouTube comments to generate insights to inform better content creation.☆54Updated 5 months ago
- For LLMs to better code with Jina API☆108Updated last week
- ☆3Updated 3 months ago
- Come join the best place on the internet to learn AI skills. Use code "airouterchat" for an extra 20% off.☆164Updated 3 months ago
- ☆40Updated 6 months ago
- self-improving user memory framework for conversational AI apps☆145Updated last week
- converts url content into JSON with a simple prefix☆61Updated 6 months ago
- A strongly typed Python DSL for developing message passing multi agent systems☆50Updated 7 months ago
- ☆43Updated 5 months ago
- they've simulated websites, worlds, and imaginary CLIs... but what if they simulated *you*?☆94Updated last month
- ⛓️ build cognitive systems, pythonic☆326Updated this week