Muhtasham / summarization-eval
π Reference-Free automatic summarization evaluation with potential hallucination detection
β100Updated last year
Alternatives and similar repositories for summarization-eval:
Users that are interested in summarization-eval are comparing it to the libraries listed below
- β77Updated 10 months ago
- Generalist and Lightweight Model for Text Classificationβ115Updated this week
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β128Updated 3 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ99Updated last year
- β40Updated 2 months ago
- β48Updated last year
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.β50Updated 6 months ago
- NLP with Rust for Python π¦πβ61Updated 10 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β67Updated 5 months ago
- β78Updated 10 months ago
- A framework for evaluating function calls made by LLMsβ37Updated 8 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive argumentsβ77Updated 6 months ago
- β66Updated 5 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β80Updated last year
- β92Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 9 months ago
- Chunk your text using gpt4o-mini more accuratelyβ44Updated 8 months ago
- Tools to make language models a bit easier to useβ41Updated this week
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, impβ¦β174Updated 7 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.β37Updated last year
- β151Updated 4 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)β100Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimizationβ59Updated last year
- Pre-train Static Word Embeddingsβ52Updated last month
- Simple GRPO scripts and configurations.β58Updated 2 months ago
- Writing Blog Posts with Generative Feedback Loops!β47Updated last year
- XTR/WARP is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.β122Updated 5 months ago
- Chat Markup Language conversation libraryβ55Updated last year
- utilities for loading and running text embeddings with onnxβ44Updated 8 months ago
- Evaluating LLMs with CommonGen-Liteβ89Updated last year