JunLi-Galios / Optimization-on-Stiefel-Manifold-via-Cayley-TransformView external linksLinks
Efficient Riemannian Optimization on Stiefel Manifold via Cayley Transform
☆44Apr 26, 2019Updated 6 years ago
Alternatives and similar repositories for Optimization-on-Stiefel-Manifold-via-Cayley-Transform
Users that are interested in Optimization-on-Stiefel-Manifold-via-Cayley-Transform are comparing it to the libraries listed below
Sorting:
- [COLM 2025] DFRot: Achieving Outlier-Free and Massive Activation-Free for Rotated LLMs with Refined Rotation; 知乎:https://zhuanlan.zhihu.c…☆29Mar 5, 2025Updated 11 months ago
- Code accompanying the NeurIPS 2020 paper: WoodFisher (Singh & Alistarh, 2020)☆53Mar 8, 2021Updated 4 years ago
- [ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)☆15Nov 4, 2024Updated last year
- ☆13Jan 15, 2025Updated last year
- a python library for cats and hypercats☆25Aug 29, 2025Updated 5 months ago
- ☆52Nov 5, 2024Updated last year
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆39Nov 1, 2024Updated last year
- Hessian trace estimation using PyTorch and Hutch++☆20Oct 29, 2020Updated 5 years ago
- [ICML 2024] SINGD: KFAC-like Structured Inverse-Free Natural Gradient Descent (http://arxiv.org/abs/2312.05705)☆24Nov 4, 2024Updated last year
- T-SVDNet: Exploring High-Order Prototypical Correlations for Multi-Source Domain Adaptation☆21Apr 11, 2023Updated 2 years ago
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆50Oct 21, 2023Updated 2 years ago
- JAX implementation of Learning to learn by gradient descent by gradient descent☆28Aug 5, 2025Updated 6 months ago
- Implementation for <Regularizing Neural Networks via Minimizing Hyperspherical Energy> in CVPR'20.☆24Jun 23, 2020Updated 5 years ago
- [CVPRW 21] "BNN - BN = ? Training Binary Neural Networks without Batch Normalization", Tianlong Chen, Zhenyu Zhang, Xu Ouyang, Zechun Liu…☆57Dec 30, 2021Updated 4 years ago
- Code repo for the paper "SpinQuant LLM quantization with learned rotations"☆372Feb 14, 2025Updated last year
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆31Mar 12, 2024Updated last year
- ☆29May 6, 2020Updated 5 years ago
- PyTorch and Torch implementation for our accepted CVPR 2020 paper (Oral): Controllable Orthogonalization in Training DNNs☆24Jan 19, 2021Updated 5 years ago
- Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…☆69Mar 7, 2024Updated last year
- LLM Inference with Microscaling Format☆34Nov 12, 2024Updated last year
- [NeurIPS 2020] Neural Manifold Ordinary Differential Equations (https://arxiv.org/abs/2006.10254)☆125Jul 6, 2023Updated 2 years ago
- ☆31Jan 23, 2026Updated 3 weeks ago
- [ICML 2021] The official PyTorch Implementations of Positive-Negative Momentum Optimizers.☆27Aug 30, 2022Updated 3 years ago
- [CVPR 2024] Friendly Sharpness-Aware Minimization☆36Oct 29, 2024Updated last year
- Experimental paper writing linter.☆35Sep 2, 2024Updated last year
- [NeurIPS '18] "Can We Gain More from Orthogonality Regularizations in Training Deep CNNs?" Official Implementation.☆130Dec 31, 2021Updated 4 years ago
- ☆10Apr 5, 2024Updated last year
- Pytorch implementation for "The Surprising Positive Knowledge Transfer in Continual 3D Object Shape Reconstruction"☆33Sep 9, 2022Updated 3 years ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 2 months ago
- ☆28Jul 16, 2025Updated 7 months ago
- Code for Neurips24 paper: QuaRot, an end-to-end 4-bit inference of large language models.☆483Nov 26, 2024Updated last year
- ☆35Jul 14, 2020Updated 5 years ago
- Implementation for the paper "Latent Weights Do Not Exist: Rethinking Binarized Neural Network Optimization"☆76Dec 8, 2019Updated 6 years ago
- Fast Hadamard transform in CUDA, with a PyTorch interface☆284Oct 19, 2025Updated 3 months ago
- An algorithm for weight-activation quantization (W4A4, W4A8) of LLMs, supporting both static and dynamic quantization☆172Nov 26, 2025Updated 2 months ago
- ☆43Jan 30, 2024Updated 2 years ago
- A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks☆233Dec 27, 2018Updated 7 years ago
- ☆44May 3, 2024Updated last year
- Chinese word segmentation with the neural seq2seq model implement in pytorch☆10Dec 13, 2017Updated 8 years ago