v7labs / benchllm
Continuous Integration for LLM powered applications
☆238Updated last year
Alternatives and similar repositories for benchllm:
Users that are interested in benchllm are comparing it to the libraries listed below
- Domain Adapted Language Modeling Toolkit - E2E RAG☆320Updated 5 months ago
- ☆185Updated last year
- A tool for evaluating LLMs☆417Updated 11 months ago
- ☆219Updated last year
- Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)☆263Updated last year
- A python implementation of priompt - a neat way of managing context from diverse sources for LLM applications.☆109Updated 8 months ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆149Updated 6 months ago
- ☆161Updated last year
- Natural Language Interfaces Powered by LLMs☆90Updated 8 months ago
- Local LLM ReAct Agent with Guidance☆158Updated last year
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆144Updated last year
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆107Updated 7 months ago
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆81Updated last year
- ✦ The intuitive python LLM framework☆171Updated 4 months ago
- 🎸 Integrating AI plugins to LLMs☆229Updated last year
- ☆75Updated last year
- A framework for event based autonomous multi-agent systems.☆305Updated 7 months ago
- Plan-Validate-Solve (PVS) Agent for accurate, reliable and reproducable workflow automation☆333Updated last year
- LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores a…☆51Updated last week
- 🦜💯 Flex those feathers!☆245Updated 6 months ago
- syntactic sugar 🍭 for langchain☆233Updated 6 months ago
- 📚 Datasets and models for instruction-tuning☆238Updated last year
- Leverage your LangChain trace data for fine tuning☆41Updated 8 months ago
- PanML is a high level generative AI/ML development and analysis library designed for ease of use and fast experimentation.☆116Updated last year
- Python client library for improving your LLM app accuracy☆98Updated 2 months ago
- Fiddler Auditor is a tool to evaluate language models.☆179Updated last year
- 🔓 The open-source autonomous agent LLM initiative 🔓☆91Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆422Updated last year
- ☆205Updated last year