abacusai / Long-ContextLinks

This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a model’s information retrieval capabilities with context expansion. We also include key experimental results and instructions for reproducing and building on them.

☆591

Alternatives and similar repositories for Long-Context

Users that are interested in Long-Context are comparing it to the libraries listed below

Sorting:

OpenLemur / Lemur
[ICLR 2024] Lemur: Open Foundation Models for Language Agents
☆552Updated last year
arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆628Updated last year
tomaarsen / attention_sinks
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
☆702Updated last year
DachengLi1 / LongChat
Official repository for LongChat and LongEval
☆523Updated last year
VikParuchuri / textbook_quality
Generate textbook-quality synthetic LLM pretraining data
☆501Updated last year
mbzuai-nlp / LaMini-LM
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
☆820Updated 2 years ago
dzhulgakov / llama-mistral
Inference code for Mistral and Mixtral hacked up into original Llama implementation
☆371Updated last year
salesforce / xgen
Salesforce open-source LLMs with 8k sequence length.
☆720Updated 5 months ago
yxuansu / OpenAlpaca
OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA
☆302Updated 2 years ago
LudwigStumpp / llm-leaderboard
A joint community effort to create one central leaderboard for LLMs.
☆304Updated 10 months ago
yuchenlin / LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…
☆951Updated 9 months ago
nlpxucan / evol-instruct
☆270Updated 2 years ago
Victorwz / LongMem
Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".
☆801Updated last year
datamllab / LongLM
[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
☆657Updated last year
nexusflowai / NexusRaven
NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…
☆317Updated last year
SkunkworksAI / hydra-moe
☆416Updated last year
bigcode-project / Megatron-LM
Ongoing research training transformer models at scale
☆389Updated 11 months ago
nexusflowai / NexusRaven-V2
☆414Updated last year
kuleshov-group / llmtools
Finetuning Large Language Models on One Consumer GPU in 2 Bits
☆726Updated last year
zeno-ml / zeno-build
Build, evaluate, understand, and fix LLM-based apps
☆489Updated last year
conceptofmind / toolformer
☆365Updated 2 years ago
kaistAI / SelFee
Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"
☆227Updated 2 years ago
IBM / Dromedary
Dromedary: towards helpful, ethical and reliable LLMs.
☆1,148Updated 2 months ago
huggingface / cosmopedia
☆524Updated 8 months ago
sabetAI / BLoRA
batched loras
☆344Updated last year
GAIR-NLP / factool
FacTool: Factuality Detection in Generative AI
☆879Updated 11 months ago
jondurbin / bagel
A bagel, with everything.
☆322Updated last year
zphang / minimal-llama
☆458Updated last year
reasoning-machines / pal
PaL: Program-Aided Language Models (ICML 2023)
☆502Updated 2 years ago
apoorvumang / prompt-lookup-decoding
☆549Updated 10 months ago