fiddler-labs / fiddler-auditorLinks
Fiddler Auditor is a tool to evaluate language models.
☆188Updated last year
Alternatives and similar repositories for fiddler-auditor
Users that are interested in fiddler-auditor are comparing it to the libraries listed below
Sorting:
- A tool for evaluating LLMs☆424Updated last year
- Sample notebooks and prompts for LLM evaluation☆137Updated 3 months ago
- Framework for LLM evaluation, guardrails and security☆113Updated last year
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆115Updated 2 months ago
- The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…☆248Updated last year
- Continuous Integration for LLM powered applications☆252Updated 2 years ago
- ☆169Updated this week
- ☆177Updated last year
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆83Updated last year
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆79Updated 7 months ago
- Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)☆397Updated last year
- ☆186Updated last year
- Red-Teaming Language Models with DSPy☆214Updated 7 months ago
- Masked Python SDK wrapper for OpenAI API. Use public LLM APIs securely.☆119Updated 2 years ago
- ☆87Updated 2 years ago
- 📚 A curated list of papers & technical articles on AI Quality & Safety☆193Updated 5 months ago
- Python SDK for running evaluations on LLM generated responses☆292Updated 4 months ago
- 📚 Datasets and models for instruction-tuning☆239Updated 2 years ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.☆318Updated 2 months ago
- Python client library for improving your LLM app accuracy☆98Updated 7 months ago
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆142Updated last week
- Stanford CRFM's initiative to assess potential compliance with the draft EU AI Act☆93Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆116Updated 6 months ago
- ☆89Updated last year
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆105Updated 9 months ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆105Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆151Updated 11 months ago
- Automated knowledge graph creation SDK☆122Updated 10 months ago
- Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.☆123Updated last week