openai / evalsLinks
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
☆17,593Updated 2 months ago
Alternatives and similar repositories for evals
Users that are interested in evals are comparing it to the libraries listed below
Sorting:
- The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.☆21,230Updated last year
- Examples and guides for using the OpenAI API☆71,101Updated this week
- ☆22,096Updated last year
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆46,545Updated this week
- A guidance language for controlling large language models.☆21,201Updated 3 weeks ago
- 🦜🔗 The platform for reliable agents.☆125,145Updated this week
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆30,273Updated last year
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,690Updated last week
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,909Updated 6 months ago
- Inference code for Llama models☆59,075Updated last year
- Instruct-tune LLaMA on consumer hardware☆18,980Updated last year
- OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset☆7,529Updated 2 years ago
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,375Updated 7 months ago
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆17,066Updated 3 months ago
- Get a ChatGPT plugin up and running in under 5 minutes!☆4,238Updated last year
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,380Updated last year
- ☆6,229Updated 3 weeks ago
- DSPy: The framework for programming—not prompting—language models☆31,716Updated this week
- Large Language Model Text Generation Inference☆10,739Updated 2 weeks ago
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…☆37,468Updated last year
- Open-source search and retrieval database for AI applications.☆25,639Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,822Updated last year
- StableLM: Stability AI Language Models☆15,774Updated last year
- Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning …☆4,562Updated 5 months ago
- Universal LLM Deployment Engine with ML Compilation☆21,943Updated this week
- A collection of libraries to optimise AI model performances☆8,354Updated last year
- The official Python library for the OpenAI API☆29,752Updated this week
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆4,262Updated last month
- NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.☆5,538Updated this week
- ☆3,395Updated 2 years ago