anyscale / factuality-eval
Library for iPython notebooks for evaluating factuality.
☆50Updated last year
Alternatives and similar repositories for factuality-eval:
Users that are interested in factuality-eval are comparing it to the libraries listed below
- Leverage your LangChain trace data for fine tuning☆41Updated 6 months ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- ☆51Updated 2 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆29Updated 5 months ago
- ☆77Updated 8 months ago
- ☆76Updated 2 years ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated 10 months ago
- ☆76Updated 8 months ago
- ☆88Updated last year
- Retrieval Augmented Generation applications☆26Updated last year
- Preprocessing pipeline notebooks and API supporting text extraction from SEC documents☆143Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆100Updated 10 months ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆100Updated last year
- ☆93Updated last year
- End-to-End LLM Guide☆101Updated 7 months ago
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆106Updated 5 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆93Updated 2 months ago
- 📚 Datasets and models for instruction-tuning☆234Updated last year
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆101Updated last year
- A Chainlit App Used to Showcase: Async, Caching, Additional Chainlit Methods, and more!☆11Updated 4 months ago
- Sample notebooks and prompts for LLM evaluation☆120Updated 2 months ago
- ☆29Updated last year
- Examples of using Evidently to evaluate, test and monitor ML models.☆20Updated this week
- Web App for generating synthetic data☆46Updated 5 months ago
- Prototyping a question and answer bot over PDFs☆38Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆106Updated last week
- Framework for building and maintaining self-updating prompts for LLMs☆60Updated 8 months ago
- meta_llama_2finetuned_text_generation_summarization☆21Updated last year
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year