Sike-Wang / low-bit-ShampooView external linksLinks
4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)
☆13Feb 13, 2025Updated last year
Alternatives and similar repositories for low-bit-Shampoo
Users that are interested in low-bit-Shampoo are comparing it to the libraries listed below
Sorting:
- Zeroth-Order Fine-Tuning of LLMs in Random Subspaces (ICCV 2025)☆15Nov 22, 2024Updated last year
- Single-thread, end-to-end C++ implementation of the Bitnet (1.58-bit weight) model☆14Nov 17, 2024Updated last year
- ☆151Updated this week
- LCM Full Cycle Trainer for Ostris - Ai Toolkit☆16Aug 20, 2024Updated last year
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- QJL: 1-Bit Quantized JL transform for KV Cache Quantization with Zero Overhead☆31Jan 27, 2025Updated last year
- [ECCV 2024] AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer☆41Dec 9, 2024Updated last year
- [WACV 2025] 🌍🚗 SpaGBOL: Spatial-Graph-Based Orientated Localisation 📡🗺️☆14Apr 9, 2025Updated 10 months ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 2 months ago
- ☆53Dec 17, 2025Updated last month
- anonymous github for SGSR: Beyond Social Homophily: Score-based Generative Diffusion Models for Social Recommendations☆12Sep 18, 2025Updated 4 months ago
- [WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection☆16Mar 23, 2025Updated 10 months ago
- ☆10Apr 2, 2024Updated last year
- MATLAB function to fill an area with hatching ~~or speckling~~☆11Mar 4, 2018Updated 7 years ago
- An artificial matrix generator in C☆12Feb 16, 2023Updated 2 years ago
- ☆14Apr 14, 2025Updated 10 months ago
- BERT Sentiment Classification on the IMDb Large Movie Review Dataset.☆16Sep 8, 2022Updated 3 years ago
- Code for the paper "Faster Neural Network Training with Approximate Tensor Operations"☆10Oct 23, 2021Updated 4 years ago
- A compressed SDL_Surface format using the LZ4 compression library.☆14Sep 28, 2022Updated 3 years ago
- PolyLib official git.☆11Jan 27, 2026Updated 2 weeks ago
- ☆12Feb 12, 2025Updated last year
- A stream to RTL compiler based on MLIR and CIRCT☆16Nov 15, 2022Updated 3 years ago
- Single shot neural network pruning before training the model, based on connection sensitivity☆11Aug 7, 2019Updated 6 years ago
- A survey of manufacturer-provided DRAM operating parameters and timings as specified by DRAM chip datasheets from between 1970 and 2021. …☆11May 4, 2022Updated 3 years ago
- [QT] 随机抽奖转盘(重写他人)☆10Feb 27, 2019Updated 6 years ago
- HWFI: Hybrid Warping Fusion for Video Frame Interpolation. IJCV 2022☆11Sep 7, 2022Updated 3 years ago
- CoMeT is a new low-cost RowHammer mitigation that uses Count-Min Sketch-based aggressor row tracking, as described in our HPCA'24 paper h…☆11Jan 23, 2026Updated 3 weeks ago
- ☆10Jun 28, 2023Updated 2 years ago
- A merged read deduplication tool capable to perform merged read deduplication on single end data.☆12Sep 4, 2024Updated last year
- Locality sensitive hash functions for Tensorflow 2.0.☆12Feb 18, 2022Updated 3 years ago
- Sift through Haskell code for analysis purposes☆18Jul 24, 2018Updated 7 years ago
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated 11 months ago
- ☆13Jun 22, 2025Updated 7 months ago
- [IROS 2024] 🦜🌍 BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation 📡🗺️☆15Mar 4, 2025Updated 11 months ago
- Clust_mgr is an important compnent of KunlunBase. It provides a HTTP API for KunlunBase users to do cluster management, provisioning and …☆10Jun 13, 2023Updated 2 years ago
- Exploration of the Piece Table data structure in Haskell☆10Mar 17, 2017Updated 8 years ago
- Musings in GEMM (General Matrix Multiplication)☆14Dec 14, 2025Updated 2 months ago
- CLI utilty to work out proper constants for vpternlogic instruction☆13Jan 22, 2023Updated 3 years ago
- This simulator models multi core systems, intended primarily for studies on main memory management techniques. It models a trace-based ou…☆12Jan 18, 2016Updated 10 years ago