nkandpa2/long_tail_knowledge

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nkandpa2/long_tail_knowledge)

nkandpa2 / long_tail_knowledge

Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"

☆77

Alternatives and similar repositories for long_tail_knowledge

Users that are interested in long_tail_knowledge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yikee / Knowledge_Conflict
View on GitHub
Resolving Knowledge Conflicts in Large Language Models, COLM 2024
☆18Oct 7, 2025Updated 9 months ago
McGill-NLP / retriever-lm-reasoning
View on GitHub
Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…
☆28Nov 2, 2023Updated 2 years ago
machelreid / m2d2
View on GitHub
M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer
☆54Nov 21, 2022Updated 3 years ago
martiansideofthemoon / relic-retrieval
View on GitHub
Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).
☆20May 14, 2022Updated 4 years ago
SimengSun / ChapterBreak
View on GitHub
☆12Jun 5, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
princeton-nlp / continual-factoid-memorization
View on GitHub
Continual Memorization of Factoids in Large Language Models
☆12Nov 20, 2024Updated last year
swiseman / neighbor-splicing
View on GitHub
☆11Jan 2, 2022Updated 4 years ago
allenai / feb
View on GitHub
Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"
☆12Apr 27, 2022Updated 4 years ago
karlstratos / ammi
View on GitHub
☆11Jul 15, 2020Updated 6 years ago
mlfoundations / scaling
View on GitHub
Language models scale reliably with over-training and on downstream tasks
☆102Apr 2, 2024Updated 2 years ago
AlexTMallen / adaptive-retrieval
View on GitHub
☆192Jul 2, 2025Updated last year
trestad / mitigating-reversal-curse
View on GitHub
Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'
☆14Aug 2, 2024Updated last year
kaistAI / factual-knowledge-acquisition
View on GitHub
☆25Dec 12, 2025Updated 7 months ago
AI-secure / InfoBERT
View on GitHub
[ICLR 2021] "InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective" by Boxin Wang, Shuohang Wang, Y…
☆86Oct 25, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
yasumasaonoe / entity_knowledge_propagation
View on GitHub
☆17Aug 2, 2023Updated 2 years ago
amazon-science / irgr
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
shtoshni / g2p
View on GitHub
Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models
☆15Feb 20, 2019Updated 7 years ago
eaclark07 / engen
View on GitHub
Text generation with entities as context
☆30Jun 13, 2018Updated 8 years ago
EleutherAI / pile_dedupe
View on GitHub
Pile Deduplication Code
☆18May 15, 2023Updated 3 years ago
swj0419 / in-context-pretraining
View on GitHub
☆57Apr 11, 2024Updated 2 years ago
liujch1998 / rainier
View on GitHub
☆29Feb 17, 2024Updated 2 years ago
jiacheng-ye / ZeroGen
View on GitHub
[EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.
☆47Feb 18, 2022Updated 4 years ago
crowsonkb / torch-dist-utils
View on GitHub
Utilities for PyTorch distributed
☆26Feb 27, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
anthonywchen / MOCHA
View on GitHub
Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".
☆16May 3, 2022Updated 4 years ago
XinshuangL / SELF-PARAM
View on GitHub
The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"
☆15May 18, 2025Updated last year
Hritikbansal / jpo
View on GitHub
☆13Jul 2, 2025Updated last year
AlexWan0 / rag-convincingness
View on GitHub
☆29Feb 26, 2024Updated 2 years ago
OpenCausaLab / MORE
View on GitHub
☆15Jan 9, 2026Updated 6 months ago
socialfoundations / tttlm
View on GitHub
Test-time-training on nearest neighbors for large language models
☆50Apr 18, 2024Updated 2 years ago
DevSinghSachan / emdr2
View on GitHub
Code and Models for the paper "End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering" (NeurIPS 20…
☆110Apr 18, 2022Updated 4 years ago
Xingwei-Tan / hyper-event-TempRel
View on GitHub
Poincaré Event Temporal Embeddings and Hyperbolic GRU for Event TempRel Extraction
☆11Nov 8, 2021Updated 4 years ago
allenai / DrawEduMath
View on GitHub
Can VLMs understand students' hand-drawn math work?
☆19Jan 20, 2026Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
swj0419 / detect-pretrain-code
View on GitHub
This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…
☆243Nov 3, 2023Updated 2 years ago
princeton-nlp / QuRating
View on GitHub
[ICML 2024] Selecting High-Quality Data for Training Language Models
☆204Dec 8, 2025Updated 7 months ago
klimzaporojets / consistent-EL
View on GitHub
Implementation of our paper "Towards Consistent Document-Level Entity Linking: Joint Models for Entity Linking and Coreference Resolution…
☆11Nov 13, 2022Updated 3 years ago
collin-burns / discovering_latent_knowledge
View on GitHub
☆287Mar 2, 2024Updated 2 years ago
allenai / everyday-things
View on GitHub
☆17Dec 6, 2023Updated 2 years ago
ZHZisZZ / weak-to-strong-search
View on GitHub
[NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
☆67Dec 10, 2024Updated last year
p-lambda / dsir
View on GitHub
DSIR large-scale data selection framework for language model training
☆275Apr 7, 2024Updated 2 years ago