quanta-fine-tuning / quantaLinks

(NeurIPS 2024) QuanTA: Efficient High-Rank Fine-Tuning of LLMs with Quantum-Informed Tensor Adaptation

☆32

Alternatives and similar repositories for quanta

Users that are interested in quanta are comparing it to the libraries listed below

Sorting:

tnbar / awesome-tensorial-neural-networks
A thoroughly investigated survey for tensorial neural networks.
☆141Updated 10 months ago
zyushun / hessian-spectrum
Code for the paper: Why Transformers Need Adam: A Hessian Perspective
☆63Updated 8 months ago
ion-g-ion / torchTT
Tensor-Train decomposition in pytorch
☆76Updated 9 months ago
epfml / optML-pku
summer school materials
☆46Updated 2 years ago
andyjm3 / SLTrain
SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)
☆36Updated last year
n-gao / pytorch-kfac
Pytorch implementation of KFAC - this is a port of https://github.com/tensorflow/kfac/
☆28Updated last year
vantienpham / Awesome-Tensor-Decomposition
😎 A curated list of tensor decomposition resources for model compression.
☆87Updated last week
AndPotap / einsum-search
☆33Updated last year
nasosger / MuToR
[NeurIPS '25] Multi-Token Prediction Needs Registers
☆24Updated 2 months ago
sail-sg / stde
Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024
☆123Updated 11 months ago
zqOuO / GWT
☆13Updated 10 months ago
zoq / Awesome-Optimizer
Collect optimizer related papers, data, repositories
☆98Updated last year
pilancilab / matrix-compressor
Implementation of LPLR algorithm for matrix compression
☆31Updated 2 years ago
yifanycc / loretta
[NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models
☆38Updated 10 months ago
deep-spin / adasplash
AdaSplash: Adaptive Sparse Flash Attention (aka Flash Entmax Attention)
☆29Updated last month
osehmathias / lisa
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
☆36Updated last year
Lemon-cmd / energy-transformer-graph
This repository contains the official code for Energy Transformer---an efficient Energy-based Transformer variant for graph classificatio…
☆25Updated last year
YefanZhou / TempBalance
[NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training
☆35Updated 7 months ago
OPTML-Group / DeepZero
[ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…
☆65Updated last year
KindXiaoming / Omnigrok
Omnigrok: Grokking Beyond Algorithmic Data
☆62Updated 2 years ago
shikaiqiu / compute-better-spent
☆61Updated last year
BlinkDL / LinearAttentionArena
Here we will test various linear attention designs.
☆61Updated last year
JunLi-Galios / Optimization-on-Stiefel-Manifold-via-Cayley-Transform
Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform
☆43Updated 6 years ago
ZO-Bench / ZO-LLM
[ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".
☆117Updated 4 months ago
facebookresearch / PhysicsLM4
Physics of Language Models, Part 4
☆260Updated 3 months ago
pilancilab / Riemannian_Preconditioned_LoRA
source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"
☆32Updated last year
cjyaras / deep-lora-transformers
Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)
☆13Updated last year
kwignb / NeuralTangentKernel-Papers
Neural Tangent Kernel Papers
☆119Updated 10 months ago
fangyuan-ksgk / selective-attention-transformer
Unofficial Implementation of Selective Attention Transformer
☆17Updated last year
epfml / schedules-and-scaling
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
☆85Updated last year