RARR: Researching and Revising What Language Models Say, Using Language Models
☆53Jun 22, 2023Updated 2 years ago
Alternatives and similar repositories for RARR
Users that are interested in RARR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆64Dec 25, 2023Updated 2 years ago
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆56Jul 3, 2023Updated 2 years ago
- Code for the arxiv paper: Complex Claim Verification with Evidence Retrieved in the Wild☆13Nov 27, 2023Updated 2 years ago
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.☆115Jan 6, 2024Updated 2 years ago
- fact checking of GPT and other LLMs☆22Jul 18, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆43Sep 3, 2024Updated last year
- code for EACL2024-main:Generative Dense Retrieval: Memory Can Be a Burden☆32Jan 19, 2024Updated 2 years ago
- Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context☆17Nov 15, 2024Updated last year
- ☆77Feb 16, 2024Updated 2 years ago
- ☆76Nov 27, 2024Updated last year
- ☆19Nov 8, 2023Updated 2 years ago
- ☆22Dec 9, 2023Updated 2 years ago
- ACL2023 - AlignScore, a metric for factual consistency evaluation.☆164Mar 11, 2024Updated 2 years ago
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"☆11Sep 20, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆518Oct 9, 2024Updated last year
- Median is an open-source flashcard application that leverages the power of spaced repetition and artificial intelligence to transform the…☆22Nov 4, 2024Updated last year
- ☆28May 3, 2023Updated 3 years ago
- "How to Trust Your Diffusion Models: A Convex Optimization Approach to Conformal Risk Control"☆17Jan 6, 2026Updated 5 months ago
- Official implementation of the ACL 2023 paper: "Zero-shot Faithful Factual Error Correction"☆17Aug 14, 2023Updated 2 years ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆138Mar 14, 2024Updated 2 years ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆251Dec 2, 2024Updated last year
- Companion repo for "Evaluating Verifiability in Generative Search Engines".☆87May 12, 2023Updated 3 years ago
- We believe the ability of an LLM to attribute the text that it generates is likely to be crucial for both system developers and users in …☆55Jul 28, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆440Apr 13, 2025Updated last year
- Code and dataset for the paper: Generating Literal and Implied Subquestions to Fact-check Complex Claims☆28May 30, 2023Updated 3 years ago
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆32Aug 18, 2024Updated last year
- Data and Code for EMNLP 2023 paper "QTSumm: Query-Focused Summarization over Tabular Data"☆23Mar 29, 2024Updated 2 years ago
- The dataset and code for PeerSum at EMNLP'23.☆16Oct 20, 2025Updated 7 months ago
- OpenOrca-KO dataset을 활용하여 llama2를 fine-tuning한 Korean-OpenOrca☆18Nov 1, 2023Updated 2 years ago
- FacTool: Factuality Detection in Generative AI☆932Aug 19, 2024Updated last year
- About Data and Codes for EMNLP 2023 System Demo Paper "QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking"☆19Dec 19, 2023Updated 2 years ago
- An original implementation of EMNLP 2020, "AmbigQA: Answering Ambiguous Open-domain Questions"☆123Apr 23, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆59Jun 7, 2024Updated 2 years ago
- EACL 2017☆26Apr 22, 2018Updated 8 years ago
- A repository to keep tools, scripts, data for SMART task.☆11May 24, 2022Updated 4 years ago
- Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.☆14Jan 23, 2022Updated 4 years ago
- IAI Style Guide☆11Jun 27, 2025Updated 11 months ago
- ☆11Jul 15, 2020Updated 5 years ago
- Repository for "Zero is Not Hero Yet: Benchmarking Zero-Shot Performance of LLMs for Financial Tasks"☆26Jul 31, 2023Updated 2 years ago