[EMNLP 24] Source code for paper 'AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning'
☆12Dec 15, 2024Updated last year
Alternatives and similar repositories for AdaZeta
Users that are interested in AdaZeta are comparing it to the libraries listed below
Sorting:
- Code for "LLM Embeddings Improve Test-time Adaptation to Tabular Y|X-Shifts"☆12Oct 17, 2024Updated last year
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Nov 3, 2021Updated 4 years ago
- Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent☆17Sep 8, 2022Updated 3 years ago
- Zeroth-Order Fine-Tuning of LLMs in Random Subspaces (ICCV 2025)☆17Nov 22, 2024Updated last year
- [NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models☆39Jan 9, 2025Updated last year
- This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.☆21Jan 10, 2022Updated 4 years ago
- Code for the paper "SMACE: A New Method for the Interpretability of Composite Decision Systems", ECML 2022☆15Apr 17, 2023Updated 2 years ago
- Enhanced Explainable Neural Network☆10Dec 25, 2021Updated 4 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Rad-cGAN v1.0: Radar-based precipitation nowcasting model with conditional Generative Adversarial Networks for multiple dam domains☆11Jul 22, 2022Updated 3 years ago
- ☆53Mar 18, 2025Updated 11 months ago
- ☆47May 25, 2025Updated 9 months ago
- BBO optimiser☆11Feb 11, 2020Updated 6 years ago
- ☆10Aug 16, 2023Updated 2 years ago
- JSSP dataset for LLMs☆16May 29, 2025Updated 9 months ago
- ☆13Feb 4, 2025Updated last year
- ☆11Jan 13, 2026Updated last month
- ☆10Oct 26, 2022Updated 3 years ago
- SODEN: A Scalable Continuous-Time Survival Model through Ordinary Differential Equation Networks☆14Mar 2, 2023Updated 3 years ago
- Official code for Conformal Isometry of Lie Group Representation in Recurrent Network of Grid Cells (NeurIPS workshop on Symmetry and Geo…☆13Nov 1, 2022Updated 3 years ago
- solver for discrete Mixed Observable Markov Decision Processes☆11Oct 30, 2020Updated 5 years ago
- ☆11Dec 14, 2022Updated 3 years ago
- Temporal summarization framework☆10Dec 4, 2023Updated 2 years ago
- Deep Generative Model (Torch)☆11Apr 19, 2016Updated 9 years ago
- Official implementation of the paper "On the Importance of Environments in Human-Robot Coordination", published in RSS 2021.☆16May 1, 2024Updated last year
- ☆10Apr 15, 2022Updated 3 years ago
- ☆12Feb 27, 2023Updated 3 years ago
- Implementation of "Federated Full-Parameter Tuning of Billion-Sized Language Models with Communication Cost under 18 Kilobytes" (https://…☆13May 6, 2024Updated last year
- Multi-resource Dynamic Coordinated Planning of Flexible Distribution Network☆15Jun 11, 2024Updated last year
- Python Exploitation Framework☆30Updated this week
- Comparing sequential forecasters via confidence sequences & e-processes☆11Oct 24, 2023Updated 2 years ago
- ☆13Feb 27, 2024Updated 2 years ago
- This repository reproduces the results in the paper "How expressive are transformers in spectral domain for graphs?"(published in TMLR)☆12Jul 10, 2022Updated 3 years ago
- Software package for intertemporal pricing optimization under reference effects and consumer heterogeneity estimation. Please see REAMDE.…☆10Mar 7, 2024Updated 2 years ago
- ☆12Oct 5, 2020Updated 5 years ago
- AutoLibra: Metric Induction for Agents from Open-Ended Human Feedback☆17Oct 15, 2025Updated 4 months ago
- ☆10Nov 1, 2019Updated 6 years ago
- Code and webpages for our study on teaching humans to defer to an AI☆12Nov 6, 2023Updated 2 years ago
- Temporal and Causal Relation extraction module for the Newsreader project.☆10Oct 26, 2015Updated 10 years ago