Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"
☆82Jul 31, 2023Updated 2 years ago
Alternatives and similar repositories for LLM-Knowledge-Boundary
Users that are interested in LLM-Knowledge-Boundary are comparing it to the libraries listed below
Sorting:
- YuLan-IR: Information Retrieval Boosted LMs☆220Mar 4, 2024Updated 2 years ago
- The repository of the ACCV 2024 paper "FG-CXR: A Radiologist-Aligned Gaze Dataset for Enhancing Interpretability in Chest X-Ray Report Ge…☆11Jul 28, 2025Updated 7 months ago
- Merging Generated and Retrieved Knowledge for Open-Domain QA (EMNLP 2023)☆22Oct 8, 2023Updated 2 years ago
- This repository lists papers, codes, and datasets in Biomedical Text Summarisation based on PLM☆23Oct 4, 2022Updated 3 years ago
- Codes and Pre-trained models for RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training [ACM MM 202…☆29Nov 2, 2023Updated 2 years ago
- code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"☆10Oct 20, 2022Updated 3 years ago
- Codes for the EMNLP'2020 paper "Predicting Clinical Trial Results by Implicit Evidence Integration".☆14Jan 13, 2021Updated 5 years ago
- ☆21Nov 27, 2025Updated 3 months ago
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆54Sep 3, 2024Updated last year
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆554Feb 12, 2024Updated 2 years ago
- ☆25Dec 13, 2024Updated last year
- [EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".☆20Oct 17, 2023Updated 2 years ago
- [ACCV2024 (Oral)] Official pytorch implementation of X-RGen☆19Jan 20, 2025Updated last year
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆27Oct 3, 2025Updated 5 months ago
- ☆29Apr 8, 2025Updated 10 months ago
- Momentum Decoding: Open-ended Text Generation as Graph Exploration☆19Jan 27, 2023Updated 3 years ago
- Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large …☆1,076Sep 27, 2025Updated 5 months ago
- PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions (NeurIPS 2025 D&B track, Spotlight)☆23Feb 11, 2026Updated 3 weeks ago
- Paper collections of retrieval-based (augmented) language model.☆232May 24, 2024Updated last year
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆119Sep 12, 2024Updated last year
- ☆59Aug 1, 2023Updated 2 years ago
- [EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627☆511Oct 9, 2024Updated last year
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆50Sep 4, 2025Updated 6 months ago
- Implementation of the BLUE benchmark with Transformers.☆20Feb 16, 2024Updated 2 years ago
- ☆11Jun 21, 2025Updated 8 months ago
- ☆14Jan 6, 2025Updated last year
- ☆16Dec 21, 2023Updated 2 years ago
- ☆10Jul 5, 2023Updated 2 years ago
- ☆25Jan 15, 2024Updated 2 years ago
- Code for paper Towards Mitigating LLM Hallucination via Self Reflection☆30Oct 9, 2023Updated 2 years ago
- Forward-Looking Active REtrieval-augmented generation (FLARE)☆667Nov 20, 2023Updated 2 years ago
- ☆12Jan 25, 2024Updated 2 years ago
- ☆11Oct 2, 2023Updated 2 years ago
- Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning, release the dataset and the model weight☆13May 26, 2025Updated 9 months ago
- ☆18Apr 5, 2025Updated 11 months ago
- Code for the paper "A Comprehensive Evaluation of Large Language Models on Legal Judgment Prediction"☆12Oct 20, 2023Updated 2 years ago
- ☆39Dec 26, 2025Updated 2 months ago
- [ACM MM 2025 🔥🔥 ] MIRA: A first-of-its-kind medical RAG framework that fuses image features and retrieved knowledge with dynamic contex…☆18Aug 28, 2025Updated 6 months ago
- 🩻 NV-Reason-CXR-3B is a specialized vision-language model designed for medical reasoning and interpretation of chest X-ray images.☆43Feb 25, 2026Updated last week