anyscale / factuality-eval
Library for iPython notebooks for evaluating factuality.
☆51Updated last year
Related projects ⓘ
Alternatives and complementary repositories for factuality-eval
- ☆75Updated 5 months ago
- Low latency, High Accuracy, Custom Query routers for Co-pilots and Agents. Built by Prithivi Da☆31Updated this week
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆27Updated 2 months ago
- ☆75Updated 5 months ago
- Sample notebooks and prompts for LLM evaluation☆114Updated this week
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆97Updated 7 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆36Updated 7 months ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆100Updated 9 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆98Updated 10 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆106Updated 7 months ago
- Command Line Interface for Hugging Face Inference Endpoints☆66Updated 7 months ago
- A python package that provides a custom streamlit connection to query data from weaviate, the AI native vector database☆52Updated 3 months ago
- Leverage your LangChain trace data for fine tuning☆38Updated 3 months ago
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆110Updated 3 months ago
- Research notes and extra resources for all the work at explodinggradients.com☆20Updated this week
- 🤗 Collection of examples on how to train, deploy and monitor HuggingFace models in Google Cloud Vertex AI☆19Updated 8 months ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆75Updated 4 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year
- Retrieval Augmented Generation applications☆27Updated last year
- ☆15Updated 5 months ago
- Writing Blog Posts with Generative Feedback Loops!☆43Updated 8 months ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated 10 months ago
- Constrain LLM output☆106Updated 4 months ago
- ☆51Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆311Updated 2 weeks ago
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector search☆21Updated 11 months ago
- ☆24Updated last year
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆47Updated 10 months ago