anyscale / factuality-evalLinks
Library for iPython notebooks for evaluating factuality.
☆51Updated 2 years ago
Alternatives and similar repositories for factuality-eval
Users that are interested in factuality-eval are comparing it to the libraries listed below
Sorting:
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- 📚 Datasets and models for instruction-tuning☆238Updated last year
- ☆82Updated 2 years ago
- Classy-fire is multiclass text classification approach leveraging OpenAI LLM model APIs optimally using clever parameter tuning and promp…☆79Updated last year
- ☆78Updated last year
- ☆211Updated last year
- Notebooks for training universal 0-shot classifiers on many different tasks☆137Updated 8 months ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆223Updated 2 years ago
- ☆87Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆35Updated 2 years ago
- Sample notebooks and prompts for LLM evaluation☆138Updated 3 months ago
- Stanford CRFM's initiative to assess potential compliance with the draft EU AI Act☆93Updated last year
- Find and fix bugs in natural language machine learning models using adaptive testing.☆185Updated last year
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆113Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆105Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆329Updated 10 months ago
- Course for Interpreting ML Models☆52Updated 2 years ago
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆188Updated last year
- Fiddler Auditor is a tool to evaluate language models.☆187Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆103Updated last year
- ☆89Updated 2 years ago
- ☆462Updated last year
- AI Data Management & Evaluation Platform☆216Updated last year
- ☆169Updated 2 weeks ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated last year
- ☆207Updated last year
- Command Line Interface for Hugging Face Inference Endpoints☆66Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆117Updated this week
- ☆186Updated last year
- PanML is a high level generative AI/ML development and analysis library designed for ease of use and fast experimentation.☆117Updated 2 years ago