definitive-io / human-eval-sampling-benchmark
OpenAI's human-eval sampling benchmark
☆13Updated last year
Alternatives and similar repositories for human-eval-sampling-benchmark:
Users that are interested in human-eval-sampling-benchmark are comparing it to the libraries listed below
- Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuo…☆174Updated 11 months ago
- A framework for LLM's that works as a GPS to reduce hallucinations in production [WIP] - The Linux Kernel for Agents☆45Updated 2 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆101Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated 11 months ago
- ☆52Updated last year
- auto fine tune of models with synthetic data☆75Updated last year
- Build robust, production grade function calling assistants that work. Declarative and extensible. Built on top of LangChain ⚡️☆78Updated 10 months ago
- A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live…☆41Updated last year
- Fluid Database☆114Updated 6 months ago
- OpenAI's Realtime API minus the enterprise bloat☆44Updated 4 months ago
- 😎 Sagentic.ai Agent Framework - Sagentic.ai is a unified platform for building, running and scaling autonomous agents.☆69Updated last month
- A spotify playlist agent using CrewAI☆81Updated 10 months ago
- ☆170Updated 7 months ago
- A prompt management, versioning, testing, and evaluation inference server and UI toolkit. Provider agnostic and OpenAI API compatible.☆55Updated last week
- Chat with PDF using Zephyr 7B Alpha, Langchain, ChromaDB, and Gradio with Free Google Colab☆137Updated last year
- ☆38Updated last year
- Generate dynamic UI forms from text using OpenAI's structured output API☆54Updated 8 months ago
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 5 months ago
- ☆77Updated last year
- 🔓 The open-source autonomous agent LLM initiative 🔓☆91Updated last year
- Turn any developer documentation into a GPT☆91Updated last month
- ☆29Updated 4 months ago
- ☆4Updated 7 months ago
- Keeping my personal experiments separate from the main repo☆65Updated last month
- A couple scripts to grab stats from email☆42Updated 6 months ago
- finance agent + with generative ui☆96Updated last year
- ☆58Updated this week
- ⛓️ build cognitive systems, pythonic☆333Updated 4 months ago
- Chat with your git repo☆155Updated last year
- Turn a Github Repo's contents into a big prompt for long-context models like Claude 3 Opus.☆192Updated last month