corl-team / rebasedLinks

Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"

☆166

Alternatives and similar repositories for rebased

Users that are interested in rebased are comparing it to the libraries listed below

Sorting:

FusionBrainLab / LLM-Microscope
☆70Updated last year
Zyphra / Zamba2
PyTorch implementation of models from the Zamba2 series.
☆186Updated 10 months ago
idiap / sigma-gpt
σ-GPT: A New Approach to Autoregressive Models
☆70Updated last year
lucidrains / PEER-pytorch
Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind
☆132Updated last month
BlinkDL / modded-nanogpt-rwkv
RWKV-7: Surpassing GPT
☆100Updated last year
PgLoLo / optiacts
☆20Updated last year
Zyphra / tree_attention
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
☆130Updated last year
bloc97 / DeMo
DeMo: Decoupled Momentum Optimization
☆197Updated last year
VikhrModels / mctslib
☆31Updated last year
schwartz-lab-NLP / TOVA
Token Omission Via Attention
☆127Updated last year
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆73Updated 7 months ago
VITA-Group / Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆202Updated last year
IlyaGusev / ping_pong_bench
A benchmark for role-playing language models
☆112Updated 6 months ago
LucasPrietoAl / grokking-at-the-edge-of-numerical-stability
☆105Updated 4 months ago
epfml / DenseFormer
☆82Updated last year
dvruette / barrel-rec-pytorch
☆53Updated last year
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆108Updated 9 months ago
lucidrains / grokfast-pytorch
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
☆103Updated 11 months ago
euclaise / supertrainer2000
☆50Updated last year
fal-ai / diffusion-speedrun
Focused on fast experimentation and simplicity
☆75Updated 11 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆85Updated 7 months ago
SmerkyG / gptcore
Fast modular code to create and train cutting edge LLMs
☆68Updated last year
tanaymeh / mamba-train
A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM
☆61Updated last year
BorealisAI / neuzip
Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…
☆60Updated last year
NVIDIA / ngpt
Normalized Transformer (nGPT)
☆194Updated last year
apple / ml-sigma-reparam
☆314Updated last year
QuixiAI / grokadamw
☆136Updated last year
ethansmith2000 / fsdp_optimizers
supporting pytorch FSDP for optimizers
☆84Updated last year
Zyphra / Zyda_processing
☆39Updated last year
astramind-ai / BitMat
An efficent implementation of the method proposed in "The Era of 1-bit LLMs"
☆155Updated last year