anyscale / factuality-eval
Library for iPython notebooks for evaluating factuality.
☆50Updated last year
Alternatives and similar repositories for factuality-eval:
Users that are interested in factuality-eval are comparing it to the libraries listed below
- ☆78Updated 10 months ago
- Framework for building and maintaining self-updating prompts for LLMs☆61Updated 9 months ago
- ☆85Updated last year
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆29Updated 7 months ago
- 📚 Datasets and models for instruction-tuning☆238Updated last year
- A python package that provides a custom streamlit connection to query data from weaviate, the AI native vector database☆54Updated 8 months ago
- Retrieval Augmented Generation applications☆26Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated 11 months ago
- Web App for generating synthetic data☆46Updated 7 months ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆65Updated 2 years ago
- Drift detection module for machine learning pipelines.☆21Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated 11 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆101Updated this week
- ☆76Updated 9 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆102Updated 3 months ago
- ☆16Updated last year
- ☆93Updated last year
- Repository of the code base for KT Generation process that we worked at Google Cloud and Searce GenAI Hackathon.☆74Updated last year
- Leverage your LangChain trace data for fine tuning☆41Updated 7 months ago
- ☆46Updated 2 years ago
- Course for Interpreting ML Models☆52Updated 2 years ago
- ☆66Updated last year
- Notebooks for training universal 0-shot classifiers on many different tasks☆120Updated 3 months ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆103Updated last year
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year
- llm_using_petals☆16Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆107Updated 6 months ago