π Reference-Free automatic summarization evaluation with potential hallucination detection
β104Jan 15, 2024Updated 2 years ago
Alternatives and similar repositories for summarization-eval
Users that are interested in summarization-eval are comparing it to the libraries listed below
Sorting:
- π LLM inference optimization simulator, modeling compute-bound prefill and memory-bound decode phases.β14Jul 12, 2025Updated 8 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training dataβ32Sep 22, 2024Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ109Sep 19, 2025Updated 6 months ago
- β78May 27, 2024Updated last year
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β44Jan 18, 2024Updated 2 years ago
- β19Mar 16, 2025Updated last year
- LLM plugin for models hosted by Anyscale Endpointsβ35Apr 22, 2024Updated last year
- Helper scripts and notes that were used while porting various nlp modelsβ50Mar 22, 2022Updated 4 years ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 linesβ197May 6, 2024Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.β166Apr 26, 2024Updated last year
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.β12Feb 11, 2024Updated 2 years ago
- minimal pytorch implementation of bm25 (with sparse tensors)β104Oct 28, 2025Updated 4 months ago
- β20Mar 10, 2025Updated last year
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmarkβ165Oct 14, 2025Updated 5 months ago
- Generate textbook-quality synthetic LLM pretraining dataβ509Oct 19, 2023Updated 2 years ago
- β15Oct 4, 2024Updated last year
- Repository of my thoughts on creating GPTs, and instructions and files for Better GPT Builderβ33Jan 29, 2024Updated 2 years ago
- RAG example using DSPy, Gradio, FastAPIβ92Apr 11, 2024Updated last year
- LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Developmentβ21Jul 24, 2023Updated 2 years ago
- β15Nov 30, 2021Updated 4 years ago
- Python tools for easily translating your blog content to podcasts & YouTubeβ211Sep 4, 2024Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAIβ222Apr 29, 2024Updated last year
- Probabilistic LLM evaluations. [CogSci2023; ACL2023]β73Jul 27, 2024Updated last year
- Repository for analysis and experiments in the BigCode project.β128Mar 20, 2024Updated 2 years ago
- γ¬γ€γγΌγγ³γ°οΌγ‘γΏγ’γ³β13Dec 19, 2023Updated 2 years ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-β¦β3,882May 17, 2025Updated 10 months ago
- Minimum Bayes Risk Decoding for Hugging Face Transformersβ60Jun 3, 2024Updated last year
- Inquisitive Parrots for Searchβ200Jun 5, 2025Updated 9 months ago
- Repo for Turkish Wiki NER dataset.β12Jul 11, 2023Updated 2 years ago
- Example for Logging LLM Evaluator Prompt Responsesβ18Aug 14, 2023Updated 2 years ago
- A Data Source for Reasoning Embodied Agentsβ19Sep 18, 2023Updated 2 years ago
- β12Feb 22, 2024Updated 2 years ago
- Tools to make language models a bit easier to useβ65Mar 12, 2026Updated last week
- RaKUn 2.0 - A fast keyword detection algorithmβ72Aug 5, 2025Updated 7 months ago
- β70Jan 18, 2026Updated 2 months ago
- β18Dec 18, 2023Updated 2 years ago
- An app for generating promptsβ28Aug 16, 2025Updated 7 months ago
- Vector Embedding Server in under 100 lines of codeβ22Mar 1, 2024Updated 2 years ago
- The code used to evaluate embedding models on the Massive Legal Embedding Benchmark (MLEB).β33Feb 24, 2026Updated 3 weeks ago