mlfoundations / open_lmLinks

A repository for research on medium sized language models.

☆514

Alternatives and similar repositories for open_lm

Users that are interested in open_lm are comparing it to the libraries listed below

Sorting:

huggingface / datablations
Scaling Data-Constrained Language Models
☆342Updated 3 months ago
allenai / fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
☆267Updated 5 months ago
imoneoi / multipack
Multipack distributed sampler for fast padding-free training of LLMs
☆201Updated last year
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆256Updated 2 years ago
lm-sys / llm-decontaminator
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
☆311Updated last year
huggingface / cosmopedia
☆544Updated 11 months ago
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆273Updated last year
HazyResearch / m2
Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"
☆560Updated 9 months ago
lucidrains / ring-attention-pytorch
Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch
☆542Updated 5 months ago
haoliuhl / ringattention
Large Context Attention
☆743Updated last week
jzhang38 / EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
☆747Updated last year
xfactlab / orpo
Official repository for ORPO
☆463Updated last year
sabetAI / BLoRA
batched loras
☆346Updated 2 years ago
apoorvumang / prompt-lookup-decoding
☆572Updated last year
tomaarsen / attention_sinks
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
☆722Updated last year
persimmon-ai-labs / adept-inference
Inference code for Persimmon-8B
☆413Updated 2 years ago
jondurbin / bagel
A bagel, with everything.
☆324Updated last year
pratyushasharma / laser
The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
☆388Updated last year
srush / annotated-mamba
Annotated version of the Mamba paper
☆489Updated last year
FranxYao / Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆477Updated last year
huggingface / llm_training_handbook
An open collection of methodologies to help with successful training of large language models.
☆536Updated last year
facebookresearch / memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆342Updated 10 months ago
NVIDIA / NeMo-Aligner
Scalable toolkit for efficient model alignment
☆842Updated 2 weeks ago
ContextualAI / HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
☆890Updated 3 weeks ago
arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆628Updated last year
epfLLM / Megatron-LLM
distributed trainer for LLMs
☆581Updated last year
normster / llm_rules
RuLES: a benchmark for evaluating rule-following in language models
☆238Updated 8 months ago
JinjieNi / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆250Updated 11 months ago
datamllab / LongLM
[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
☆662Updated last year
p-lambda / dsir
DSIR large-scale data selection framework for language model training
☆261Updated last year