google-deepmind / long-form-factualityView external linksLinks
Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".
☆666Feb 5, 2026Updated last week
Alternatives and similar repositories for long-form-factuality
Users that are interested in long-form-factuality are comparing it to the libraries listed below
Sorting:
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆415Apr 13, 2025Updated 10 months ago
- Official repo for "Make Your LLM Fully Utilize the Context"☆263May 15, 2024Updated last year
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.☆112Jan 6, 2024Updated 2 years ago
- ☆75Feb 16, 2024Updated last year
- FacTool: Factuality Detection in Generative AI☆912Aug 19, 2024Updated last year
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆208Jan 9, 2026Updated last month
- ☆4,346Jul 31, 2025Updated 6 months ago
- ☆67Mar 30, 2025Updated 10 months ago
- Reference implementation of Megalodon 7B model☆528May 17, 2025Updated 8 months ago
- LOFT: A 1 Million+ Token Long-Context Benchmark☆225Jun 13, 2025Updated 8 months ago
- Official repository for ORPO☆471May 31, 2024Updated last year
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆473Mar 19, 2024Updated last year
- [ICML 2024] CLLMs: Consistency Large Language Models☆411Nov 16, 2024Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆459Apr 18, 2024Updated last year
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,555Jan 14, 2026Updated last month
- Evaluate your LLM's response with Prometheus and GPT4 💯☆1,045Apr 25, 2025Updated 9 months ago
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆667Jun 1, 2024Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆136Mar 14, 2024Updated last year
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆944Feb 16, 2025Updated 11 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,885Updated this week
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,657Mar 8, 2024Updated last year
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Mar 22, 2024Updated last year
- ☆1,033Dec 17, 2024Updated last year
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆484Mar 19, 2024Updated last year
- Tools for merging pretrained large language models.☆6,783Jan 26, 2026Updated 2 weeks ago
- AllenAI's post-training codebase☆3,573Updated this week
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆201Jul 17, 2024Updated last year
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]☆111Feb 20, 2025Updated 11 months ago
- [ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.☆2,711Feb 4, 2026Updated last week
- Robust recipes to align language models with human and AI preferences☆5,495Sep 8, 2025Updated 5 months ago
- [ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"☆445Oct 16, 2024Updated last year
- ☆321Sep 18, 2024Updated last year
- [ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs☆1,826Jun 24, 2025Updated 7 months ago
- ☆48Jan 7, 2024Updated 2 years ago
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,395Nov 29, 2024Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆2,173Aug 17, 2024Updated last year
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Nov 17, 2024Updated last year
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models☆601Jun 26, 2024Updated last year