yifanycc/AdaZeta

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yifanycc/AdaZeta)

yifanycc / AdaZeta

[EMNLP 24] Source code for paper 'AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning'

☆13

Alternatives and similar repositories for AdaZeta

Users that are interested in AdaZeta are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zimingyy / SubZero
View on GitHub
Zeroth-Order Fine-Tuning of LLMs in Random Subspaces (ICCV 2025)
☆20Nov 22, 2024Updated last year
yifanycc / loretta
View on GitHub
[NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models
☆39Jan 9, 2025Updated last year
OpenAssistantBot / OpenAssistantBot
View on GitHub
Chatbot for quickly finding answers to questions.
☆11Oct 25, 2020Updated 5 years ago
Astuary / Spry
View on GitHub
Code for "Thinking Forward: Memory-Efficient Federated Finetuning of Language Models" (NeurIPS 2024). Spry is a federated learning al…
☆13Oct 8, 2024Updated last year
lljbash / FastTT
View on GitHub
Performs a faster tensor train (TT) decomposition for large sparse data
☆14Sep 7, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zhenqincn / FedKSeed
View on GitHub
Implementation of "Federated Full-Parameter Tuning of Billion-Sized Language Models with Communication Cost under 18 Kilobytes" (https://…
☆13May 6, 2024Updated 2 years ago
microsoft / DGT
View on GitHub
Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent
☆16Sep 8, 2022Updated 3 years ago
SuyeonC / Rad-cGAN
View on GitHub
Rad-cGAN v1.0: Radar-based precipitation nowcasting model with conditional Generative Adversarial Networks for multiple dam domains
☆11Jul 22, 2022Updated 4 years ago
nyu-mll / msgs
View on GitHub
This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.
☆21Jan 10, 2022Updated 4 years ago
vitorpamplona / splitlearning
View on GitHub
Simple Python Socket-based Split Learning technique using PyTorch
☆14Mar 13, 2020Updated 6 years ago
William-wAng618 / M2PT
View on GitHub
Official repo of M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning
☆29Mar 23, 2025Updated last year
robintyh1 / neurips2021-meta-gradient-offpolicy-evaluation
View on GitHub
Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021
☆13Nov 3, 2021Updated 4 years ago
tanganke / subspace_fusion
View on GitHub
Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"
☆14Mar 28, 2024Updated 2 years ago
benmltu / JES
View on GitHub
Joint entropy search
☆22Dec 7, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
namkoong-lab / LLM-Tabular-Shifts
View on GitHub
Code for "LLM Embeddings Improve Test-time Adaptation to Tabular Y|X-Shifts"
☆12Oct 17, 2024Updated last year
ZO-Bench / ZO-LLM
View on GitHub
[ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".
☆128Jul 6, 2025Updated last year
chuliang007 / resnet20_training
View on GitHub
☆11Aug 2, 2024Updated last year
avikram2 / RISCVPipelinedProcessor
View on GitHub
Pipelined Processor which implements RV32i Instruction Set. Also contains pipelined L1 4-way set-associative Instruction Cache, direct-ma…
☆15Dec 23, 2022Updated 3 years ago
abdelfattah-lab / BRAMAC
View on GitHub
☆10Nov 27, 2024Updated last year
lecoan / pytorch-RLE
View on GitHub
A implement of run-length encoding for Pytorch tensor using CUDA
☆14Apr 7, 2021Updated 5 years ago
slcz / gomoku-deep-learning
View on GitHub
gomoku AI with deep learning and monte carlo tree search
☆19Mar 23, 2018Updated 8 years ago
amazon-science / mezo_svrg
View on GitHub
Code the ICML 2024 paper: "Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models"
☆12Jun 25, 2024Updated 2 years ago
zfgao66 / OPF
View on GitHub
Pytorch implementation of paper: Small Pre-trained Language Models Can be Fine-tuned as Large Models via Over-Parameterization.
☆12May 18, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
TalSchuster / CATs
View on GitHub
Confident Adaptive Transformers
☆15Apr 18, 2021Updated 5 years ago
flaport / photontorch_paper
View on GitHub
Data and visualizations for the photontorch paper (Scientific Reports)
☆18Aug 4, 2020Updated 5 years ago
OPTML-Group / DeepZero
View on GitHub
[ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…
☆72Oct 9, 2024Updated last year
TimotheeMickus / codwoe
View on GitHub
The CODWOE shared task invites you to compare two types of semantic descriptions: dictionary glosses and word embedding representations. …
☆12Jul 13, 2022Updated 4 years ago
zephyrtronium / bwst
View on GitHub
Burrows-Wheeler-Scott transform
☆14Jun 7, 2013Updated 13 years ago
donglin-wang / LotteryEnsemble
View on GitHub
A repository for LotteryFL re-implementation and experiments
☆13Dec 18, 2020Updated 5 years ago
hplt-project / data-analytics-tool
View on GitHub
HPLT Analytics
☆15Jul 22, 2026Updated last week
stober / isomap
View on GitHub
Isomap in Python
☆10Mar 1, 2013Updated 13 years ago
phitrann / arXivRAG
View on GitHub
A comprehensive tool designed to enhance the retrieval and generation of academic content from the arXiv database, leveraging advanced Re…
☆13Dec 30, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Zengwh02 / GlimpRouter
View on GitHub
GlimpRouter: Efficient Collaborative Inference by Glimpsing One Token of Thoughts
☆16Apr 24, 2026Updated 3 months ago
sfox14 / darknet-zynq
View on GitHub
Accelerating DNN inference and training on Zynq
☆16Jul 22, 2020Updated 6 years ago
twinkle0331 / Xcompression
View on GitHub
[ICLR 2022] Code for paper "Exploring Extreme Parameter Compression for Pre-trained Language Models"(https://arxiv.org/abs/2205.10036)
☆22May 24, 2023Updated 3 years ago
RamyaLab / pluralistic-alignment
View on GitHub
The open-source repository for PAL: Sample-Efficient Personalized Reward Modeling for Pluralistic Alignment, which provides a general per…
☆17Aug 28, 2025Updated 11 months ago
MathIsAll / ZO-AdaMU
View on GitHub
This project is a implementation in PyTorch for ZO-AdaMU optimization: Adapting Perturbation with the Momentum and Uncertainty in Zeroth-…
☆15Dec 12, 2023Updated 2 years ago
VILA-Lab / GBLM-Pruner
View on GitHub
Are gradient information useful for pruning of LLMs?
☆48Aug 23, 2025Updated 11 months ago
Srache / TempQT
View on GitHub
[TMM 2023] Blind Image Quality Assessment via Transformer Predicted Error Map and Perceptual Quality Token
☆14Mar 21, 2024Updated 2 years ago