anthonywchen/RARR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/anthonywchen/RARR)

anthonywchen / RARR

RARR: Researching and Revising What Language Models Say, Using Language Models

☆54

Alternatives and similar repositories for RARR

Users that are interested in RARR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hkust-nlp / felm
View on GitHub
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆65Dec 25, 2023Updated 2 years ago
panditakshay402 / PsyCare-AI
View on GitHub
PsyCare-AI is an AI-powered mental health prediction project, offering a user-friendly interface to predict potential mental health issue…
☆10Jul 19, 2023Updated 3 years ago
OSU-NLP-Group / AttrScore
View on GitHub
Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"
☆56Jul 3, 2023Updated 3 years ago
jifan-chen / Fact-checking-via-Raw-Evidence
View on GitHub
Code for the arxiv paper: Complex Claim Verification with Evidence Retrieved in the Wild
☆13Nov 27, 2023Updated 2 years ago
yuxiaw / Factcheck-GPT
View on GitHub
Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.
☆116Jan 6, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ypw0102 / GDR
View on GitHub
code for EACL2024-main:Generative Dense Retrieval: Memory Can Be a Burden
☆32Jan 19, 2024Updated 2 years ago
bgalitsky / Truth-O-Meter-Making-ChatGPT-Truthful
View on GitHub
fact checking of GPT and other LLMs
☆22Jul 18, 2024Updated 2 years ago
WilliamZR / ProTrix
View on GitHub
Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context
☆17Nov 15, 2024Updated last year
abhika-m / FAVA
View on GitHub
☆77Feb 16, 2024Updated 2 years ago
Complex-data / MUSER
View on GitHub
☆19Nov 8, 2023Updated 2 years ago
MichSchli / AVeriTeC
View on GitHub
☆75Nov 27, 2024Updated last year
yuh-zha / AlignScore
View on GitHub
ACL2023 - AlignScore, a metric for factual consistency evaluation.
☆164Mar 11, 2024Updated 2 years ago
princeton-nlp / ALCE
View on GitHub
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
☆522Oct 9, 2024Updated last year
zcrwind / PREFER
View on GitHub
☆22Dec 9, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yale-nlp / ODSum
View on GitHub
Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"
☆11Sep 20, 2024Updated last year
amazon-science / tofueval
View on GitHub
☆32May 10, 2024Updated 2 years ago
chaitanyamalaviya / ExpertQA
View on GitHub
[Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers
☆139Mar 14, 2024Updated 2 years ago
5uru / Median
View on GitHub
Median is an open-source flashcard application that leverages the power of spaced repetition and artificial intelligence to transform the…
☆21Nov 4, 2024Updated last year
khuangaf / ZeroFEC
View on GitHub
Official implementation of the ACL 2023 paper: "Zero-shot Faithful Factual Error Correction"
☆17Aug 14, 2023Updated 2 years ago
philschmid / optimum-static-quantization
View on GitHub
☆28May 3, 2023Updated 3 years ago
psunlpgroup / ReaLMistake
View on GitHub
This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".
☆32Aug 18, 2024Updated last year
google-research-datasets / Attributed-QA
View on GitHub
We believe the ability of an LLM to attribute the text that it generates is likely to be crucial for both system developers and users in …
☆55Jul 28, 2023Updated 2 years ago
shmsw25 / FActScore
View on GitHub
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…
☆450Apr 13, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
jifan-chen / subquestions-for-fact-checking
View on GitHub
Code and dataset for the paper: Generating Literal and Implied Subquestions to Fact-check Complex Claims
☆28May 30, 2023Updated 3 years ago
yale-nlp / QTSumm
View on GitHub
Data and Code for EMNLP 2023 paper "QTSumm: Query-Focused Summarization over Tabular Data"
☆23Mar 29, 2024Updated 2 years ago
Marker-Inc-Korea / Korean-OpenOrca
View on GitHub
OpenOrca-KO dataset을 활용하여 llama2를 fine-tuning한 Korean-OpenOrca
☆18Nov 1, 2023Updated 2 years ago
oaimli / PeerSum
View on GitHub
The dataset and code for PeerSum at EMNLP'23.
☆16Oct 20, 2025Updated 9 months ago
GAIR-NLP / factool
View on GitHub
FacTool: Factuality Detection in Generative AI
☆934Aug 19, 2024Updated last year
XinyuanLu00 / QACheck
View on GitHub
About Data and Codes for EMNLP 2023 System Demo Paper "QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking"
☆19Dec 19, 2023Updated 2 years ago
mitmedialab / empathic-stories
View on GitHub
☆17Nov 18, 2024Updated last year
ParticleMedia / RAGTruth
View on GitHub
Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"
☆260Dec 2, 2024Updated last year
eugeneyan / learning-typescript
View on GitHub
☆16Jun 5, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jzbjyb / FLARE
View on GitHub
Forward-Looking Active REtrieval-augmented generation (FLARE)
☆669Nov 20, 2023Updated 2 years ago
yuxiaw / OpenFactCheck
View on GitHub
☆60Jun 7, 2024Updated 2 years ago
tqfang / comet-deepspeed
View on GitHub
Train large COMET (T5-3B/GPT2-XL) with small memory (on 11GB memory GPUs like 1080/2080) using DeepSpeed.
☆14Jan 23, 2022Updated 4 years ago
nixiesearch / onnx-convert
View on GitHub
An ONNX converter script focused on embedding models
☆34Jan 14, 2025Updated last year
smart-task / smart-dataset
View on GitHub
A repository to keep tools, scripts, data for SMART task.
☆11May 24, 2022Updated 4 years ago
stayallive / whisper-subtitles
View on GitHub
Generate subtitles (.srt and .vtt) from audio files using OpenAI's Whisper models.
☆30May 23, 2023Updated 3 years ago
zmzhang2000 / trustworthy-alignment
View on GitHub
Official repository for Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
☆12Sep 2, 2024Updated last year