miniHuiHui / awesome-high-order-neural-network
☆39Updated last month
Related projects ⓘ
Alternatives and complementary repositories for awesome-high-order-neural-network
- ☆181Updated 11 months ago
- tinybig for deep function learning☆36Updated this week
- Implementation of Switch Transformers from the paper: "Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficien…☆55Updated last week
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆74Updated this week
- A repository for DenseSSMs☆88Updated 7 months ago
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆43Updated last month
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆73Updated 4 months ago
- [AAAI 2024] PDE+: Enhancing Generalization via PDE with Adaptive Distributional Diffusion☆21Updated 8 months ago
- A library for calculating the FLOPs in the forward() process based on torch.fx☆81Updated 2 months ago
- Implementation of Griffin from the paper: "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆49Updated last week
- A lecture note for understanding deep learning☆209Updated last week
- State Space Models☆63Updated 6 months ago
- Implementations of various linear RNN layers using pytorch and triton☆46Updated last year
- [NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.☆58Updated 9 months ago
- ICLR2024 statistics☆47Updated 11 months ago
- ☆131Updated 2 months ago
- ☆96Updated last week
- Implementation of Forward Forward Network proposed by Hinton in NIPS 2022.☆162Updated last year
- An official codebase of paper "Revisiting Sparse Convolutional Model for Visual Recognition"☆123Updated last year
- Collection of papers on state-space models☆556Updated 2 weeks ago
- About Code release for "Flowformer: Linearizing Transformers with Conservation Flows" (ICML 2022), https://arxiv.org/pdf/2202.06258.pdf☆304Updated 4 months ago
- A curated list of Model Merging methods.☆83Updated 2 months ago
- The official repo for CVPR2023 highlight paper "Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization".☆77Updated last year
- EasyLiterature is an open-sourced, Python-based command line tool for automatic literature management.☆238Updated 2 months ago
- ☆169Updated 3 months ago
- Awesome list of papers that extend Mamba to various applications.☆128Updated 2 months ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆24Updated 2 weeks ago
- Reading list for research topics in state-space models☆241Updated 2 weeks ago
- Offical implementation of "Spike-driven Transformer" (NeurIPS2023)☆220Updated 8 months ago
- clone/download repositories from https://anonymous.4open.science/☆85Updated 2 years ago