Second-Order Fine-Tuning without Pain for LLMs: a Hessian Informed Zeroth-Order Optimizer
☆23Feb 11, 2025Updated last year
Alternatives and similar repositories for HiZOO
Users that are interested in HiZOO are comparing it to the libraries listed below
Sorting:
- ☆20Dec 5, 2024Updated last year
- Code the ICML 2024 paper: "Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models"☆12Jun 25, 2024Updated last year
- [ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆124Jul 6, 2025Updated 7 months ago
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆70Oct 9, 2024Updated last year
- Implementation of the FedPM framework by the authors of the ICLR 2023 paper "Sparse Random Networks for Communication-Efficient Federated…☆30Feb 10, 2023Updated 3 years ago
- Parse command line arguments by defining dataclasses☆13Oct 13, 2024Updated last year
- LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning☆36Apr 4, 2024Updated last year
- [IEEE TIP] Offical implementation for the work "BadCM: Invisible Backdoor Attack against Cross-Modal Learning".☆14Aug 30, 2024Updated last year
- ☆12Aug 17, 2022Updated 3 years ago
- ☆14Feb 2, 2021Updated 5 years ago
- An up-to-date list of progress made in next-generation AI.☆11Apr 2, 2023Updated 2 years ago
- ☆11Apr 21, 2023Updated 2 years ago
- Task Aware Downscaling for efficient storing and accurate reconstruction in image and video domain☆12Jul 25, 2024Updated last year
- Deep Neural Network Optimization Platform with Gradient-based, Gradient-Free Algorithms☆12Jan 13, 2020Updated 6 years ago
- BH hackathon☆14Apr 4, 2024Updated last year
- Official PyTorch implementation of CD-MOE☆12Mar 29, 2025Updated 11 months ago
- Code for the Secure Triplet Loss approach for biometric template security.☆10Apr 22, 2021Updated 4 years ago
- ☆13Jun 29, 2024Updated last year
- Preprint | Previously at GenBio ICML 2025☆18Aug 20, 2025Updated 6 months ago
- Randomized algorithm class at CU☆15Jul 8, 2025Updated 7 months ago
- Code for reviewers☆12Oct 8, 2024Updated last year
- ☆10Apr 25, 2024Updated last year
- ☆12Nov 21, 2024Updated last year
- ☆10May 31, 2023Updated 2 years ago
- SuperScanner Software (S3), part of SuperScanner project, is a open-source and completely free software environment to implement a low-co…☆11Mar 27, 2018Updated 7 years ago
- The dataset of our work where the application of portable Raman spectroscopy coupled with several supervised machine-learning techniques,…☆14Nov 21, 2019Updated 6 years ago
- The material is covered in my YouTube playlist "Data Wrangling with Python" available on YUNIKARN.☆15Dec 9, 2025Updated 2 months ago
- An introductory course on 5G standards which aims to provide hand-on knowledge on 5G system design and 5G-NR 3GPP standards using 5G Tool…☆23May 12, 2024Updated last year
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆13Jul 21, 2024Updated last year
- [ICLR'24] Heterogeneous Personalized Federated Learning by Local-Global Updates Mixing via Convergence Rate☆13Jun 17, 2025Updated 8 months ago
- ECG Viewer is a graphical interpreter for the ECG Reader sensor. Written in Python with PyQT5 + pyqtgraph.☆11Mar 11, 2025Updated 11 months ago
- Exporter for low-level components (e.g. DCT coefficients, MVDs) from the h.264 codec based on the reference implementation.☆15Feb 7, 2023Updated 3 years ago
- ☆12Jul 6, 2022Updated 3 years ago
- Code for NeurIPS 2024 paper — Cross-Device Collaborative Test-Time Adaptation☆13Feb 28, 2025Updated last year
- Pytorch implementation of paper: Small Pre-trained Language Models Can be Fine-tuned as Large Models via Over-Parameterization.☆12May 18, 2023Updated 2 years ago
- Multi process and multi GPU snake training. Using A2C with a CNN.☆12Apr 8, 2023Updated 2 years ago
- [ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation☆12Jul 31, 2023Updated 2 years ago
- This is the official implementation of the ICML 2023 paper - Can Forward Gradient Match Backpropagation ?☆13May 31, 2023Updated 2 years ago
- Implementation of PReLUNet by chainer (Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification: https…☆11Feb 2, 2017Updated 9 years ago