Offical implementation of "MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map" (NeurIPS2024 Oral)
☆36Jan 18, 2025Updated last year
Alternatives and similar repositories for MetaLA
Users that are interested in MetaLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implement spike-drive using OR residual connection and propose SynA attention for natural pruning.(Under Review)☆13Mar 31, 2024Updated 2 years ago
- This is simple code of SpikedAttention (Neurips 2024)☆23Mar 30, 2025Updated last year
- Offical implementation of "Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation …☆229May 10, 2024Updated 2 years ago
- [TNNLS 2024] Implementation of "TCJA-SNN: Temporal-Channel Joint Attention for Spiking Neural Networks"☆61Apr 16, 2024Updated 2 years ago
- ☆55Jan 21, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This project contains code for the paper titled "SpikingBERT: Distilling BERT to Train Spiking Language Models Using Implicit Differentia…☆28Feb 21, 2024Updated 2 years ago
- [Neural Networks] SpikeBERT: A Language Spikformer Learned from BERT with Knowledge Distillation☆30Apr 11, 2025Updated last year
- ☆70Jul 8, 2025Updated 11 months ago
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- Offical implementation of "Quantized Spike-driven Transformer" (ICLR2025)☆33Dec 23, 2025Updated 5 months ago
- [CVPR 2026 Highlight] Official implementation of Log-linear Sparse Attention (LLSA).☆78May 1, 2026Updated last month
- WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models☆32Feb 13, 2026Updated 3 months ago
- Implementation of "A Hybrid ANN-SNN Architecture for Low-Power and Low-Latency Visual Perception". CVPRW 2024☆41May 2, 2024Updated 2 years ago
- high-performance linear attention kernel library built on TileLang☆531May 7, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Offical implementation of "Spike-driven Transformer" (NeurIPS2023)☆313Mar 18, 2024Updated 2 years ago
- Synth-Empathy: Towards High-Quality Synthetic Empathy Data☆18Feb 28, 2025Updated last year
- ☆23Nov 6, 2022Updated 3 years ago
- Offical implementation of High-Performance Temporal Reversible Spiking Neural Networks with $O(L)$ Training Memory and $O(1)$ Inference C…☆22Feb 3, 2026Updated 4 months ago
- ☆18May 14, 2025Updated last year
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"☆259Jan 31, 2025Updated last year
- 🔥 A minimal training framework for scaling FLA models☆391Apr 22, 2026Updated last month
- Offical code of "QKFormer: Hierarchical Spiking Transformer using Q-K Attention"☆146May 25, 2026Updated 2 weeks ago
- Official Code Repository for the paper "Key-value memory in the brain"☆32Feb 25, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2025] Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-initialization☆24Oct 5, 2025Updated 8 months ago
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- Triton implement of bi-directional (non-causal) linear attention☆76Mar 1, 2026Updated 3 months ago
- The code for "AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference", Qingyue Yang, Jie Wang, Xing Li, Zhihai Wang, Ch…☆29Jul 15, 2025Updated 10 months ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 6 years ago
- [CVPR2022] SVIP: Sequence VerIfication for Procedures in Videos☆24Feb 24, 2023Updated 3 years ago
- [ECCV 2024] Official implementation of the paper "EventBind: Learning a Unified Representation to Bind Them All for Event-based Open-worl…☆36Oct 9, 2025Updated 8 months ago
- ☆50May 20, 2025Updated last year
- [ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…☆11Nov 28, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆33Nov 11, 2024Updated last year
- Research about dataflow architecture☆14Nov 30, 2023Updated 2 years ago
- SyOPs counter for spiking neural networks☆75May 6, 2023Updated 3 years ago
- Code accompanying paper "Coordinated Proximal Policy Optimization"☆10Mar 26, 2022Updated 4 years ago
- 2019~2021年间Zero-shot/Data-free知识蒸馏的论文合集☆11Sep 8, 2021Updated 4 years ago
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆39Nov 11, 2025Updated 6 months ago
- 一个基于AXI接口的PL端卷积加速器,可由PS端调用☆12Apr 15, 2023Updated 3 years ago