RUCAIBox/LLM-Knowledge-Boundary

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RUCAIBox/LLM-Knowledge-Boundary)

RUCAIBox / LLM-Knowledge-Boundary

Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"

☆82

Alternatives and similar repositories for LLM-Knowledge-Boundary

Users that are interested in LLM-Knowledge-Boundary are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yunx-z / COMBO
View on GitHub
Merging Generated and Retrieved Knowledge for Open-Domain QA (EMNLP 2023)
☆21Oct 8, 2023Updated 2 years ago
RUC-GSAI / YuLan-IR
View on GitHub
YuLan-IR: Information Retrieval Boosted LMs
☆220Mar 4, 2024Updated 2 years ago
ZurichRain / HMCGR
View on GitHub
code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"
☆10Oct 20, 2022Updated 3 years ago
UARK-AICV / FG-CXR
View on GitHub
The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…
☆12Jul 28, 2025Updated 11 months ago
RUCAIBox / HaluEval
View on GitHub
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
☆592Feb 12, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
vipulgupta1011 / CALM
View on GitHub
☆11Oct 2, 2023Updated 2 years ago
zhehengluoK / Biomedical-Text-Summarization-Survey
View on GitHub
This repository lists papers, codes, and datasets in Biomedical Text Summarisation based on PLM
☆23Oct 4, 2022Updated 3 years ago
HillZhang1999 / llm-hallucination-survey
View on GitHub
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large …
☆1,085Sep 27, 2025Updated 9 months ago
jzbjyb / FLARE
View on GitHub
Forward-Looking Active REtrieval-augmented generation (FLARE)
☆669Nov 20, 2023Updated 2 years ago
yzhangcs / ctc-copy
View on GitHub
[EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".
☆20Oct 17, 2023Updated 2 years ago
JiaQiSJTU / FaithEval-FFLM
View on GitHub
A zero-shot faithfulness evaluation metric for text summarization
☆11Oct 17, 2023Updated 2 years ago
Timothyxxx / RetrivalLMPapers
View on GitHub
Paper collections of retrieval-based (augmented) language model.
☆233May 24, 2024Updated 2 years ago
EthanLeo-LYX / LLMQA
View on GitHub
[WWW2024 Oral] Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering
☆15Apr 22, 2025Updated last year
asappresearch / kbc-pomr
View on GitHub
Code for the paper "Knowledge Base Completion for Constructing Problem-Oriented Medical Records" at MLHC 2020
☆11Jun 8, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
CLUEbenchmark / Math24o
View on GitHub
Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark
☆14Mar 27, 2025Updated last year
ziweiji / Self_Reflection_Medical
View on GitHub
Code for paper Towards Mitigating LLM Hallucination via Self Reflection
☆30Oct 9, 2023Updated 2 years ago
princeton-nlp / ALCE
View on GitHub
[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627
☆522Oct 9, 2024Updated last year
princeton-nlp / MQuAKE
View on GitHub
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
☆124Sep 12, 2024Updated last year
OSU-NLP-Group / AttrScore
View on GitHub
Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"
☆56Jul 3, 2023Updated 3 years ago
cubenlp / CERRU
View on GitHub
CCL2024 Chinese Essay Rhetoric Recognition and Understanding
☆17Oct 1, 2024Updated last year
Alibaba-NLP / EBM-Net
View on GitHub
Codes for the EMNLP'2020 paper "Predicting Clinical Trial Results by Implicit Evidence Integration".
☆14Jan 13, 2021Updated 5 years ago
NUSTM / CCAC-ABSA
View on GitHub
☆10Jul 5, 2023Updated 3 years ago
chtmp223 / suri
View on GitHub
Suri: Multi-constraint instruction following for long-form text generation [EMNLP’24]
☆27Oct 3, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ddehun / DEnsity
View on GitHub
Official repository for "DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation (ACL2023 Findings)"
☆11May 23, 2023Updated 3 years ago
RUCAIBox / BAMBOO
View on GitHub
☆36Mar 25, 2024Updated 2 years ago
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
GanjinZero / RAMM
View on GitHub
Codes and Pre-trained models for RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training [ACM MM 202…
☆29Nov 2, 2023Updated 2 years ago
Arvid-pku / ALCUNA
View on GitHub
[EMNLP 2023] ALCUNA: Large Language Models Meet New Knowledge
☆30Oct 30, 2023Updated 2 years ago
srhthu / LM-CompEval-Legal
View on GitHub
Code for the paper "A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction"
☆13Oct 20, 2023Updated 2 years ago
hyintell / awesome-refreshing-llms
View on GitHub
EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.
☆135Dec 12, 2023Updated 2 years ago
hyp1231 / ICLR2023-OpenReviewData
View on GitHub
Crawl & visualize ICLR papers and reviews.
☆18Nov 5, 2022Updated 3 years ago
chaojiang06 / neural-Jacana
View on GitHub
This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.
☆20Feb 12, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
voidism / DoLa
View on GitHub
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
☆557Jul 12, 2026Updated last week
freesunshine0316 / lab-conv-asa
View on GitHub
The project on Conversational Aspect Sentiment Analysis (CASA)
☆13Oct 8, 2022Updated 3 years ago
cambridgeltl / multi3woz
View on GitHub
The official repository for Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapte…
☆17Jan 15, 2024Updated 2 years ago
qingyu-qc / gpt_bionlp_benchmark
View on GitHub
☆25Jan 15, 2024Updated 2 years ago
1429904852 / KEF
View on GitHub
[COLING 2022] Learning from Adjective-Noun Pairs: A Knowledge-enhanced Framework for Target-Oriented Multimodal Sentiment Classification
☆14Apr 19, 2023Updated 3 years ago
yangheng95 / metric-visualizer
View on GitHub
For easy metric logging and visualization
☆14Jan 31, 2025Updated last year
AI21Labs / in-context-ralm
View on GitHub
☆295Dec 20, 2023Updated 2 years ago