v7labs / benchllm
Continuous Integration for LLM powered applications
☆236Updated last year
Alternatives and similar repositories for benchllm:
Users that are interested in benchllm are comparing it to the libraries listed below
- 🎸 Integrating AI plugins to LLMs☆230Updated last year
- ☆154Updated last year
- Fiddler Auditor is a tool to evaluate language models.☆174Updated 10 months ago
- ☆80Updated 2 weeks ago
- ☆195Updated 8 months ago
- A framework for event based autonomous multi-agent systems.☆299Updated 4 months ago
- ☆75Updated last year
- ☆219Updated last year
- Python SDK for running evaluations on LLM generated responses☆258Updated this week
- Plan-Validate-Solve (PVS) Agent for accurate, reliable and reproducable workflow automation☆326Updated last year
- A tool for evaluating LLMs☆400Updated 8 months ago
- ☆184Updated last year
- Test suite for LLM prompts☆46Updated 8 months ago
- A Python library for building GPT-powered agents with state machine logic and chat history memory.☆64Updated last year
- This repository implements the chain of verification paper by Meta AI☆160Updated last year
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆79Updated last year
- Python client library for improving your LLM app accuracy☆96Updated last week
- ☆91Updated last year
- A Toolkit for Creating and Deploying LangChain Apps☆168Updated last year
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)☆262Updated 10 months ago
- ⛓️ build cognitive systems, pythonic☆329Updated 2 months ago
- ☆270Updated last year
- data cleaning and curation for unstructured text☆329Updated 5 months ago
- ☆197Updated last year
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆117Updated 3 months ago
- 🔓 The open-source autonomous agent LLM initiative 🔓☆90Updated 11 months ago
- Logging and caching superpowers for the openai sdk☆102Updated 10 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆48Updated 4 months ago
- 🦜💯 Flex those feathers!☆239Updated 3 months ago
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆145Updated 9 months ago