fiddler-labs / fiddler-auditorLinks
Fiddler Auditor is a tool to evaluate language models.
☆188Updated last year
Alternatives and similar repositories for fiddler-auditor
Users that are interested in fiddler-auditor are comparing it to the libraries listed below
Sorting:
- A tool for evaluating LLMs☆424Updated last year
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆115Updated 2 months ago
- 📚 A curated list of papers & technical articles on AI Quality & Safety☆192Updated 6 months ago
- Sample notebooks and prompts for LLM evaluation☆151Updated 2 weeks ago
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)☆397Updated last year
- Continuous Integration for LLM powered applications☆254Updated 2 years ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆329Updated 11 months ago
- ☆186Updated 2 years ago
- Python SDK for running evaluations on LLM generated responses☆292Updated 4 months ago
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆318Updated 3 months ago
- ☆172Updated last week
- Red-Teaming Language Models with DSPy☆221Updated 8 months ago
- Stanford CRFM's initiative to assess potential compliance with the draft EU AI Act☆93Updated 2 years ago
- ☆175Updated last year
- Framework for LLM evaluation, guardrails and security☆113Updated last year
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆247Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆120Updated last month
- 🦜💯 Flex those feathers!☆252Updated last year
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆144Updated last week
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated last year
- AI Verify☆36Updated 2 weeks ago
- 📚 Datasets and models for instruction-tuning☆237Updated 2 years ago
- ☆88Updated 2 years ago
- Masked Python SDK wrapper for OpenAI API. Use public LLM APIs securely.☆119Updated 2 years ago
- A curated list of awesome synthetic data tools (open source and commercial).☆215Updated last year
- Framework for building data agent workflows☆84Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆106Updated last year
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆113Updated last year
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆79Updated 8 months ago