chengxiang / LinearTransformerLinks
Pytorch code for experiments on Linear Transformers
☆21Updated last year
Alternatives and similar repositories for LinearTransformer
Users that are interested in LinearTransformer are comparing it to the libraries listed below
Sorting:
- ☆70Updated 7 months ago
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆17Updated 7 months ago
- Omnigrok: Grokking Beyond Algorithmic Data☆58Updated 2 years ago
- ☆26Updated 2 weeks ago
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆48Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆102Updated 2 years ago
- ☆233Updated last year
- ☆32Updated 2 years ago
- Bayesian Low-Rank Adaptation for Large Language Models☆34Updated last year
- Neural Tangent Kernel Papers☆115Updated 6 months ago
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆156Updated 3 weeks ago
- Efficient empirical NTKs in PyTorch☆18Updated 3 years ago
- Welcome to the 'In Context Learning Theory' Reading Group☆29Updated 8 months ago
- Universal Neurons in GPT2 Language Models☆30Updated last year
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆59Updated 4 months ago
- Deep Learning & Information Bottleneck☆61Updated 2 years ago
- Code for experiments on transformers using Markovian data.☆17Updated 7 months ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆35Updated 3 months ago
- ☆35Updated 6 months ago
- ☆28Updated last year
- Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)☆13Updated 11 months ago
- ☆99Updated 5 months ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆32Updated 8 months ago
- source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"☆27Updated last year
- nanoGPT-like codebase for LLM training☆100Updated 2 months ago
- Sparse Autoencoder Training Library☆53Updated 2 months ago
- ☆53Updated last year
- A simple PyTorch implementation of influence functions.☆89Updated last year
- ☆29Updated 3 months ago
- ☆35Updated 6 months ago