miniHuiHui / awesome-high-order-neural-network
☆42Updated 3 months ago
Alternatives and similar repositories for awesome-high-order-neural-network:
Users that are interested in awesome-high-order-neural-network are comparing it to the libraries listed below
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆49Updated 3 months ago
- ☆185Updated last year
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆83Updated 6 months ago
- A curated list of Model Merging methods.☆89Updated 4 months ago
- Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficien…☆67Updated 2 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆142Updated last month
- A lecture note for understanding deep learning☆232Updated this week
- This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success o…☆257Updated 9 months ago
- ICLR2024 statistics☆47Updated last year
- ☆137Updated 4 months ago
- Collection of papers on state-space models☆568Updated 3 weeks ago
- Decomposing and Editing Predictions by Modeling Model Computation☆131Updated 7 months ago
- EasyLiterature is an open-sourced, Python-based command line tool for automatic literature management.☆246Updated 4 months ago
- A library for calculating the FLOPs in the forward() process based on torch.fx☆94Updated 4 months ago
- [ICLR 2024] Improving Convergence and Generalization Using Parameter Symmetries☆29Updated 7 months ago
- [ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Dise…☆145Updated last year
- tinybig for deep function learning☆59Updated last month
- Awesome-Low-Rank-Adaptation☆61Updated 3 months ago
- summer school materials☆44Updated last year
- ☆123Updated 10 months ago
- A repository for DenseSSMs☆87Updated 9 months ago
- [CVPR 2023 Highlight] This is the official implementation of "Stitchable Neural Networks".☆246Updated last year
- Sharing my research toolchain☆81Updated last year
- Inference Speed Benchmark for Learning to (Learn at Test Time): RNNs with Expressive Hidden States☆51Updated 6 months ago
- Idempotent Generative Network's unofficial pytorch implementation☆45Updated last year
- Reading list for research topics in state-space models☆253Updated 3 weeks ago
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆51Updated 2 months ago
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆157Updated 3 weeks ago
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆91Updated last year
- ICLR2023 statistics☆60Updated last year