anyscale / factuality-evalLinks
Library for iPython notebooks for evaluating factuality.
☆50Updated 2 years ago
Alternatives and similar repositories for factuality-eval
Users that are interested in factuality-eval are comparing it to the libraries listed below
Sorting:
- ☆83Updated 2 years ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆136Updated 9 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆36Updated 2 years ago
- ☆216Updated last year
- 📚 Datasets and models for instruction-tuning☆238Updated 2 years ago
- Fiddler Auditor is a tool to evaluate language models.☆188Updated last year
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆38Updated last year
- Sample notebooks and prompts for LLM evaluation☆151Updated this week
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆32Updated last year
- ☆77Updated last year
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆223Updated 2 years ago
- ☆88Updated 2 years ago
- AI Data Management & Evaluation Platform☆216Updated 2 years ago
- ☆33Updated 3 years ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆119Updated 3 weeks ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆329Updated 11 months ago
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆113Updated last year
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆102Updated last year
- Find and fix bugs in natural language machine learning models using adaptive testing.☆186Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆106Updated last year
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆68Updated last year
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆189Updated last year
- ☆206Updated last year
- Reward Model framework for LLM RLHF☆61Updated 2 years ago
- This is the repo for the container that holds the models for the text2vec-transformers module☆55Updated 2 weeks ago
- ☆48Updated last year
- Course for Interpreting ML Models☆52Updated 2 years ago
- experiments with inference on llama☆103Updated last year
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆115Updated 2 months ago