anyscale / factuality-evalLinks
Library for iPython notebooks for evaluating factuality.
β51Updated 2 years ago
Alternatives and similar repositories for factuality-eval
Users that are interested in factuality-eval are comparing it to the libraries listed below
Sorting:
- π Datasets and models for instruction-tuningβ238Updated 2 years ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeedβ36Updated 2 years ago
- Notebooks for training universal 0-shot classifiers on many different tasksβ137Updated 11 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ107Updated 3 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAGβ334Updated last year
- β84Updated 2 years ago
- Fast & more realistic evaluation of chat language models. Includes leaderboard.β189Updated last year
- LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.β69Updated last year
- Sample notebooks and prompts for LLM evaluationβ156Updated last month
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.β116Updated 4 months ago
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ103Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models accessβ¦β114Updated last year
- β78Updated last year
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.β222Updated 3 years ago
- data cleaning and curation for unstructured textβ328Updated last year
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.β444Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo rankerβ125Updated last month
- β89Updated 2 years ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.β39Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ196Updated last year
- Mixing Language Models with Self-Verification and Meta-Verificationβ110Updated last year
- β23Updated 2 years ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β33Updated last year
- β171Updated last month
- Library for creating causal chains using language models.β81Updated 2 years ago
- β89Updated 2 years ago
- Reimplementation of the task generation part from the Alpaca paperβ119Updated 2 years ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ119Updated 8 months ago
- Leverage your LangChain trace data for fine tuningβ46Updated last year
- Reward Model framework for LLM RLHFβ61Updated 2 years ago