Muhtasham / summarization-evalLinks
π Reference-Free automatic summarization evaluation with potential hallucination detection
β103Updated last year
Alternatives and similar repositories for summarization-eval
Users that are interested in summarization-eval are comparing it to the libraries listed below
Sorting:
- β80Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ104Updated last week
- β46Updated last year
- β210Updated 3 months ago
- Generalist and Lightweight Model for Text Classificationβ161Updated 3 months ago
- Writing Blog Posts with Generative Feedback Loops!β50Updated last year
- data cleaning and curation for unstructured textβ328Updated last year
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive argumentsβ89Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β80Updated last year
- An introduction to LLM Samplingβ79Updated 9 months ago
- Notebooks for training universal 0-shot classifiers on many different tasksβ137Updated 9 months ago
- Pre-train Static Word Embeddingsβ85Updated 3 weeks ago
- PyLate efficient inference engineβ64Updated 2 weeks ago
- Python library to use Pleias-RAG modelsβ62Updated 5 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β32Updated last year
- β82Updated 10 months ago
- β78Updated last year
- Simple UI for debugging correlations of text embeddingsβ291Updated 4 months ago
- β49Updated 7 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ66Updated last year
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, impβ¦β189Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo rankerβ119Updated this week
- β157Updated 9 months ago
- Chat Markup Language conversation libraryβ55Updated last year
- PyTorch implementation for MRLβ19Updated last year
- NLP with Rust for Python π¦πβ65Updated 4 months ago
- Efficient vector database for hundred millions of embeddings.β208Updated last year
- A framework for evaluating function calls made by LLMsβ38Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)β104Updated last year