frankxu2004/knnlm-why

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/frankxu2004/knnlm-why)

frankxu2004 / knnlm-why

Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"

☆59

Alternatives and similar repositories for knnlm-why

Users that are interested in knnlm-why are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jxhe / efficient-knnlm
View on GitHub
Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)
☆75Jan 20, 2022Updated 4 years ago
princeton-nlp / TRIME
View on GitHub
[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674
☆193Jun 14, 2023Updated 3 years ago
neulab / knn-transformers
View on GitHub
PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…
☆288Oct 20, 2022Updated 3 years ago
GSYfate / knnlm-limits
View on GitHub
Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"
☆24Apr 30, 2025Updated last year
urvashik / knnlm
View on GitHub
☆331Jun 7, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
xszheng2020 / memorization
View on GitHub
An Empirical Study of Memorization in NLP (ACL 2022)
☆13Jun 22, 2022Updated 4 years ago
tech-srl / id2vec
View on GitHub
☆11Dec 31, 2019Updated 6 years ago
pdufter / staticlama
View on GitHub
☆13Apr 16, 2021Updated 5 years ago
neulab / retomaton
View on GitHub
PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)
☆77Jul 16, 2022Updated 4 years ago
yuzhaouoe / pretraining-data-packing
View on GitHub
[ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training
☆24Aug 18, 2024Updated last year
uds-lsv / TOKEN-is-a-MASK
View on GitHub
Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"
☆14Aug 19, 2022Updated 3 years ago
gucci-j / light-transformer-emnlp2021
View on GitHub
EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling
☆34Nov 21, 2021Updated 4 years ago
huggingface / olm-training
View on GitHub
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆98Feb 9, 2023Updated 3 years ago
neubig / coderx
View on GitHub
A highly sophisticated sequence-to-sequence model for code generation
☆41Jul 1, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Chung-I / youtube-asr-crawler
View on GitHub
☆10Sep 19, 2022Updated 3 years ago
bigganbing / Fairseq_MorphTE
View on GitHub
[NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings
☆17Oct 29, 2022Updated 3 years ago
HazyResearch / embroid
View on GitHub
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification
☆11Aug 12, 2023Updated 2 years ago
xnancy / russ
View on GitHub
☆16Apr 9, 2021Updated 5 years ago
danieldeutsch / sacrerouge
View on GitHub
SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.
☆150Oct 22, 2022Updated 3 years ago
AkariAsai / evidentiality_qa
View on GitHub
The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).
☆44Dec 25, 2022Updated 3 years ago
adapter-hub / hgiyt
View on GitHub
Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"
☆28Oct 3, 2021Updated 4 years ago
ahmetustun / hyperx
View on GitHub
☆21Dec 5, 2022Updated 3 years ago
huggingface / datasets-tagging
View on GitHub
A Streamlit app to add structured tags to a dataset card
☆23Jun 30, 2022Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
mlfoundations / scaling
View on GitHub
Language models scale reliably with over-training and on downstream tasks
☆102Apr 2, 2024Updated 2 years ago
Kaleidophon / nlp-uncertainty-zoo
View on GitHub
Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.
☆55May 5, 2023Updated 3 years ago
ghrua / NgramRes
View on GitHub
☆23Nov 6, 2022Updated 3 years ago
allenai / few_shot_explanations
View on GitHub
Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"
☆29Apr 28, 2023Updated 3 years ago
salesforce / simplification
View on GitHub
☆23Jun 25, 2026Updated last month
UIC-Liu-Lab / DGA
View on GitHub
[EMNLP 2022] Adapting a Language Model While Preserving its General Knowledge
☆21Feb 12, 2023Updated 3 years ago
awebson / prompt_semantics
View on GitHub
This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”
☆84May 10, 2022Updated 4 years ago
gbegus / DeepPhonologyTool
View on GitHub
Train a fiwGAN or ciwGAN model using your own training data
☆14Oct 13, 2022Updated 3 years ago
wietsedv / xpos
View on GitHub
Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)
☆19May 17, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
chorusai / brave
View on GitHub
Brave is a simple visualisation library for NLP information extraction, built on top of embedded BRAT.
☆15Dec 25, 2019Updated 6 years ago
philschmid / multilingual-serverless-qa-aws-lambda
View on GitHub
☆10Dec 17, 2020Updated 5 years ago
AlexTMallen / adaptive-retrieval
View on GitHub
☆192Jul 2, 2025Updated last year
facebookresearch / text_characterization_toolkit
View on GitHub
A library for computing diverse text characteristics and using them to analyze data sets and models with ease.
☆42Aug 18, 2022Updated 3 years ago
giganticode / jemma
View on GitHub
JEMMA: An Extensible Java dataset for Many ML4Code Applications
☆19Dec 12, 2022Updated 3 years ago
xinyadu / gtt
View on GitHub
Template Filling with Generative Transformers
☆22Jun 8, 2021Updated 5 years ago
da03 / criticize_text_generation
View on GitHub
A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …
☆12Mar 18, 2023Updated 3 years ago