freshllms/freshqa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/freshllms/freshqa)

freshllms / freshqa

Data and code for FreshLLMs (https://arxiv.org/abs/2310.03214)

☆400

Alternatives and similar repositories for freshqa

Users that are interested in freshqa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

realtimeqa / realtimeqa_public
View on GitHub
☆87Jul 18, 2026Updated last week
princeton-nlp / ALCE
View on GitHub
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
☆522Oct 9, 2024Updated last year
wzhouad / context-faithful-llm
View on GitHub
Code and data for paper "Context-faithful Prompting for Large Language Models".
☆41Mar 23, 2023Updated 3 years ago
yizhongw / llm-temporal-alignment
View on GitHub
Methods and evaluation for aligning language models temporally
☆31Mar 2, 2024Updated 2 years ago
velocityCavalry / CREPE
View on GitHub
An original implementation of the paper "CREPE: Open-Domain Question Answering with False Presuppositions"
☆16Nov 5, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
jzbjyb / FLARE
View on GitHub
Forward-Looking Active REtrieval-augmented generation (FLARE)
☆669Nov 20, 2023Updated 2 years ago
pillowsofwind / Knowledge-Conflicts-Survey
View on GitHub
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
☆159Sep 21, 2024Updated last year
RUCAIBox / LLM-Knowledge-Boundary
View on GitHub
Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"
☆82Jul 31, 2023Updated 2 years ago
orionw / FollowIR
View on GitHub
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
☆56Jul 3, 2024Updated 2 years ago
facebookresearch / lss_eval
View on GitHub
This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…
☆31Aug 25, 2023Updated 2 years ago
AlexTMallen / adaptive-retrieval
View on GitHub
☆192Jul 2, 2025Updated last year
AkariAsai / self-rag
View on GitHub
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…
☆2,410May 25, 2024Updated 2 years ago
castorini / umbrela
View on GitHub
☆58Apr 18, 2026Updated 3 months ago
voidism / DoLa
View on GitHub
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
☆557Jul 12, 2026Updated 2 weeks ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
OSU-NLP-Group / LLM-Knowledge-Conflict
View on GitHub
[ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"
☆84Apr 12, 2024Updated 2 years ago
mlfoundations / scaling
View on GitHub
Language models scale reliably with over-training and on downstream tasks
☆102Apr 2, 2024Updated 2 years ago
allenai / easy-to-hard-generalization
View on GitHub
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Jan 17, 2024Updated 2 years ago
sebastian-hofstaetter / tas-balanced-dense-retrieval
View on GitHub
SIGIR 2021: Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling
☆60Jul 11, 2021Updated 5 years ago
openai / prm800k
View on GitHub
800,000 step-level correctness labels on LLM solutions to MATH problems
☆2,152Jun 1, 2023Updated 3 years ago
GAIR-NLP / factool
View on GitHub
FacTool: Factuality Detection in Generative AI
☆934Aug 19, 2024Updated last year
MARIO-Math-Reasoning / Super_MARIO
View on GitHub
☆341Jun 5, 2025Updated last year
googleinterns / localizing-paragraph-memorization
View on GitHub
☆15Feb 21, 2024Updated 2 years ago
huiwy / reflection-on-trees
View on GitHub
☆14May 9, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hkust-nlp / felm
View on GitHub
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆65Dec 25, 2023Updated 2 years ago
huggingface / alignment-handbook
View on GitHub
Robust recipes to align language models with human and AI preferences
☆5,645May 26, 2026Updated 2 months ago
QwenLM / AutoIF
View on GitHub
☆336Jul 25, 2024Updated 2 years ago
openai / simple-evals
View on GitHub
☆4,583Apr 22, 2026Updated 3 months ago
facebookresearch / MetaICL
View on GitHub
An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi
☆274Apr 15, 2023Updated 3 years ago
hkust-nlp / deita
View on GitHub
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
☆600Dec 9, 2024Updated last year
facebookresearch / contriever
View on GitHub
Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
☆780Apr 7, 2023Updated 3 years ago
nelson-liu / evaluating-verifiability-in-generative-search-engines
View on GitHub
Companion repo for "Evaluating Verifiability in Generative Search Engines".
☆87May 12, 2023Updated 3 years ago
ContextualAI / HALOs
View on GitHub
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
☆908Sep 30, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yidingjiang / ado
View on GitHub
The repository contains code for Adaptive Data Optimization
☆37Dec 9, 2024Updated last year
allenai / open-instruct
View on GitHub
AllenAI's post-training codebase
☆3,809Updated this week
princeton-nlp / AutoCompressors
View on GitHub
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
☆337Sep 9, 2024Updated last year
HillZhang1999 / llm-hallucination-survey
View on GitHub
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large …
☆1,085Sep 27, 2025Updated 9 months ago
reasoning-survey / Awesome-Reasoning-Foundation-Models
View on GitHub
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models
☆657Jun 16, 2025Updated last year
THUDM / AgentBench
View on GitHub
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
☆3,603Feb 8, 2026Updated 5 months ago
Re-Align / URIAL
View on GitHub
☆316Jun 9, 2024Updated 2 years ago