fiddler-labs / fiddler-auditor
Fiddler Auditor is a tool to evaluate language models.
☆176Updated last year
Alternatives and similar repositories for fiddler-auditor:
Users that are interested in fiddler-auditor are comparing it to the libraries listed below
- Red-Teaming Language Models with DSPy☆175Updated last month
- A tool for evaluating LLMs☆407Updated 10 months ago
- Sample notebooks and prompts for LLM evaluation☆123Updated 3 months ago
- ☆184Updated last year
- 📚 A curated list of papers & technical articles on AI Quality & Safety☆171Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆316Updated 4 months ago
- WorkBench: a Benchmark Dataset for Agents in a Realistic Workplace Setting.☆37Updated 8 months ago
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆107Updated 6 months ago
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆100Updated this week
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆235Updated 5 months ago
- ☆159Updated last year
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆288Updated 4 months ago
- ☆85Updated last year
- 🦜💯 Flex those feathers!☆242Updated 5 months ago
- ☆165Updated this week
- Collection of recipes aiding Gen AI model development☆100Updated last week
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆107Updated 9 months ago
- ☆165Updated 9 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated 11 months ago
- A framework-less approach to robust agent development.☆156Updated this week
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆80Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆107Updated last week
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆144Updated 11 months ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆148Updated 5 months ago
- Framework for building data agent workflows☆83Updated 7 months ago
- Leverage your LangChain trace data for fine tuning☆41Updated 7 months ago
- AI Evaluation Platform☆46Updated last week
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)☆392Updated last year
- 📚 Datasets and models for instruction-tuning☆235Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year