openai / evalsLinks
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
☆16,215Updated 5 months ago
Alternatives and similar repositories for evals
Users that are interested in evals are comparing it to the libraries listed below
Sorting:
- The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.☆21,183Updated 10 months ago
- ☆21,498Updated 6 months ago
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆14,662Updated 2 months ago
- Examples and guides for using the OpenAI API☆64,360Updated this week
- JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf☆24,150Updated 8 months ago
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆41,989Updated this week
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆30,009Updated 10 months ago
- AI PDF chatbot agent built with LangChain & LangGraph☆15,518Updated 3 months ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,101Updated this week
- 🦜🔗 Build context-aware reasoning applications☆108,279Updated this week
- 🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.☆34,179Updated last month
- Instruct-tune LLaMA on consumer hardware☆18,904Updated 10 months ago
- The official Python library for the OpenAI API☆26,844Updated this week
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…☆37,362Updated 9 months ago
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,679Updated 8 months ago
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,566Updated 8 months ago
- the AI-native open-source embedding database☆20,090Updated this week
- A guidance language for controlling large language models.☆20,238Updated last week
- ☆34,466Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,670Updated last week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,446Updated 11 months ago
- Get a ChatGPT plugin up and running in under 5 minutes!☆4,236Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,411Updated 9 months ago
- ☆5,900Updated 2 weeks ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,634Updated 8 months ago
- ☆9,010Updated last year
- StableLM: Stability AI Language Models☆15,832Updated last year
- An Open-Ended Embodied Agent with Large Language Models☆6,145Updated last year
- AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus o…☆175,738Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆41,517Updated 5 months ago