yuxiaw / OpenFactCheckLinks
☆52Updated last year
Alternatives and similar repositories for OpenFactCheck
Users that are interested in OpenFactCheck are comparing it to the libraries listed below
Sorting:
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆85Updated 11 months ago
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.☆100Updated last year
- ☆72Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆131Updated last year
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆56Updated 2 years ago
- ☆68Updated 2 years ago
- Contrastive Chain-of-Thought Prompting☆64Updated last year
- ☆43Updated last year
- ☆46Updated 11 months ago
- ☆41Updated 5 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆45Updated last year
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆33Updated 7 months ago
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆96Updated last year
- Code/data for MARG (multi-agent review generation)☆44Updated 8 months ago
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated 5 months ago
- The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Gen…☆63Updated 2 years ago
- Token-level Reference-free Hallucination Detection☆94Updated last year
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)☆64Updated last year
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks☆57Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- ☆151Updated last year
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆27Updated 2 years ago
- A Human-LLM Collaborative Dataset for Generative Information-seeking with Attribution☆33Updated last year
- Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extraction☆52Updated last year
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆40Updated 2 years ago
- ☆124Updated 9 months ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆47Updated 5 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆48Updated 7 months ago
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆53Updated 11 months ago
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆60Updated 2 years ago