Offical implementation of "MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map" (NeurIPS2024 Oral)
☆36Jan 18, 2025Updated last year
Alternatives and similar repositories for MetaLA
Users that are interested in MetaLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Offical implementation of "Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation …☆222May 10, 2024Updated last year
- [TNNLS 2024] Implementation of "TCJA-SNN: Temporal-Channel Joint Attention for Spiking Neural Networks"☆62Apr 16, 2024Updated last year
- ☆16Feb 22, 2024Updated 2 years ago
- ☆55Jan 21, 2024Updated 2 years ago
- This project contains code for the paper titled "SpikingBERT: Distilling BERT to Train Spiking Language Models Using Implicit Differentia…☆28Feb 21, 2024Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- [Neural Networks] SpikeBERT: A Language Spikformer Learned from BERT with Knowledge Distillation☆27Apr 11, 2025Updated 11 months ago
- ☆69Jul 8, 2025Updated 9 months ago
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- Enhancing the Performance of Transformer-based Spiking Neural Networks by SNN-optimized Downsampling with Precise Gradient Backpropagatio…☆46Jul 2, 2024Updated last year
- WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models☆30Feb 13, 2026Updated last month
- ☆22Nov 19, 2024Updated last year
- Offical implementation of "Spike-driven Transformer" (NeurIPS2023)☆308Mar 18, 2024Updated 2 years ago
- ☆23Nov 6, 2022Updated 3 years ago
- Offical implementation of High-Performance Temporal Reversible Spiking Neural Networks with $O(L)$ Training Memory and $O(1)$ Inference C…☆22Feb 3, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"☆253Jan 31, 2025Updated last year
- 🔥 A minimal training framework for scaling FLA models☆363Nov 15, 2025Updated 4 months ago
- Offical code of "QKFormer: Hierarchical Spiking Transformer using Q-K Attention" (NeurIPS 2024)☆143Jan 2, 2025Updated last year
- ☆36Jan 20, 2025Updated last year
- Official Code Repository for the paper "Key-value memory in the brain"☆31Feb 25, 2025Updated last year
- [ICLR 2025] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization☆25Oct 5, 2025Updated 6 months ago
- Triton implement of bi-directional (non-causal) linear attention☆73Mar 1, 2026Updated last month
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- Research about dataflow architecture☆12Nov 30, 2023Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- [ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…☆11Nov 28, 2023Updated 2 years ago
- ☆33Nov 11, 2024Updated last year
- SyOPs counter for spiking neural networks☆73May 6, 2023Updated 2 years ago
- 2019~2021年间Zero-shot/Data-free知识蒸馏的论文合集☆11Sep 8, 2021Updated 4 years ago
- 一个基于AXI接口的PL端卷积加速器,可由PS端调用☆12Apr 15, 2023Updated 2 years ago
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆35Nov 11, 2025Updated 4 months ago
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- Offical implementation of "Scaling Spike-driven Transformer with Efficient Spike Firing Approximation Training" (IEEE T-PAMI2025)☆106May 20, 2025Updated 10 months ago
- Offical implementation of "Advancing Spiking Neural Networks towards Deep Residual Learning" (IEEE TNNLS 2024)☆14Aug 28, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer☆30Dec 6, 2023Updated 2 years ago
- ☆32Jan 7, 2024Updated 2 years ago
- [AAAI-25 Oral] Adaptive Calibration☆15Jul 6, 2025Updated 9 months ago
- Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…☆14May 14, 2024Updated last year
- HGRN2: Gated Linear RNNs with State Expansion☆57Aug 20, 2024Updated last year
- FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation [Efficient ML Model]☆48Feb 17, 2026Updated last month
- An efficient spiking variational autoencoder☆13Nov 13, 2023Updated 2 years ago