abertsch72/unlimiformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/abertsch72/unlimiformer)

abertsch72 / unlimiformer

Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

☆1,062

Alternatives and similar repositories for unlimiformer

Users that are interested in unlimiformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenGVLab / LLaMA-Adapter
View on GitHub
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
☆5,916Mar 14, 2024Updated 2 years ago
CStanKonrad / long_llama
View on GitHub
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transform…
☆1,465Nov 7, 2023Updated 2 years ago
mosaicml / llm-foundry
View on GitHub
LLM training code for Databricks foundation models
☆4,431Mar 25, 2026Updated 4 months ago
google-research / longt5
View on GitHub
☆183May 26, 2023Updated 3 years ago
epfml / landmark-attention
View on GitHub
Landmark Attention: Random-Access Infinite Context Length for Transformers
☆426Dec 20, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Mivg / SLED
View on GitHub
The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper
☆70May 14, 2023Updated 3 years ago
openlm-research / open_llama
View on GitHub
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
☆7,533Jul 16, 2023Updated 3 years ago
jquesnelle / yarn
View on GitHub
YaRN: Efficient Context Window Extension of Large Language Models
☆1,740Apr 17, 2024Updated 2 years ago
facebookresearch / mega
View on GitHub
Sequence modeling with Mega.
☆303Jan 28, 2023Updated 3 years ago
lucidrains / recurrent-memory-transformer-pytorch
View on GitHub
Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch
☆424Jan 6, 2025Updated last year
BlinkDL / RWKV-LM
View on GitHub
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…
☆14,639Updated this week
microsoft / torchscale
View on GitHub
Foundation Architecture for (M)LLMs
☆3,133Apr 11, 2024Updated 2 years ago
Lightning-AI / lit-llama
View on GitHub
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…
☆6,083Jul 1, 2025Updated last year
artidoro / qlora
View on GitHub
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,968Jun 10, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
mit-han-lab / streaming-llm
View on GitHub
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
☆7,249Jul 11, 2024Updated 2 years ago
booydar / LM-RMT
View on GitHub
Recurrent Memory Transformer
☆159Aug 14, 2023Updated 2 years ago
nlpxucan / WizardLM
View on GitHub
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
☆9,480Jun 7, 2025Updated last year
CarperAI / trlx
View on GitHub
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆4,753Jan 8, 2024Updated 2 years ago
Victorwz / LongMem
View on GitHub
Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".
☆827Mar 30, 2024Updated 2 years ago
princeton-nlp / AutoCompressors
View on GitHub
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
☆337Sep 9, 2024Updated last year
tomaarsen / attention_sinks
View on GitHub
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
☆735Apr 10, 2024Updated 2 years ago
HazyResearch / safari
View on GitHub
Convolutions for Sequence Modeling
☆916Jun 13, 2024Updated 2 years ago
allenai / mmc4
View on GitHub
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
☆953Mar 19, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lucidrains / RETRO-pytorch
View on GitHub
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
☆879Oct 30, 2023Updated 2 years ago
microsoft / unilm
View on GitHub
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆22,171Jan 23, 2026Updated 6 months ago
FMInference / FlexLLMGen
View on GitHub
Running large language models on a single GPU for throughput-oriented scenarios.
☆9,363Oct 28, 2024Updated last year
bojone / rerope
View on GitHub
Rectified Rotary Position Embeddings
☆394May 20, 2024Updated 2 years ago
booydar / recurrent-memory-transformer
View on GitHub
[NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.
☆779Oct 25, 2024Updated last year
Vahe1994 / SpQR
View on GitHub
☆554Feb 8, 2026Updated 5 months ago
OpenLMLab / LOMO
View on GitHub
LOMO: LOw-Memory Optimization
☆994Jul 2, 2024Updated 2 years ago
salesforce / xgen
View on GitHub
Salesforce open-source LLMs with 8k sequence length.
☆727Jun 2, 2026Updated last month
microsoft / LMOps
View on GitHub
General technology for enabling AI capabilities w/ LLMs and MLLMs
☆4,444Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
togethercomputer / RedPajama-Data
View on GitHub
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
☆4,972Jun 3, 2026Updated last month
conceptofmind / PaLM
View on GitHub
An open-source implementation of Google's PaLM models
☆819Jun 21, 2024Updated 2 years ago
JIA-Lab-research / LongLoRA
View on GitHub
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
☆2,689Aug 14, 2024Updated last year
lucidrains / MEGABYTE-pytorch
View on GitHub
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
☆655Dec 27, 2024Updated last year
HazyResearch / TART
View on GitHub
TART: A plug-and-play Transformer module for task-agnostic reasoning
☆201Jun 22, 2023Updated 3 years ago
FasterDecoding / Medusa
View on GitHub
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
☆2,758Jun 25, 2024Updated 2 years ago
lupantech / chameleon-llm
View on GitHub
Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
☆1,140Dec 23, 2023Updated 2 years ago