gmlwns2000 / sea-attentionView external linksLinks
Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)
☆11Jun 20, 2025Updated 7 months ago
Alternatives and similar repositories for sea-attention
Users that are interested in sea-attention are comparing it to the libraries listed below
Sorting:
- [AAAI 2025] Official Implementation of "HDT: Hierarchical Discrete Transformer for Multivariate Time Series Forecasting"☆16Feb 17, 2025Updated last year
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 6 months ago
- ☆25May 14, 2019Updated 6 years ago
- ☆10Mar 13, 2024Updated last year
- ☆32Aug 24, 2022Updated 3 years ago
- Official Code Repository for the paper "KALA: Knowledge-Augmented Language Model Adaptation" (NAACL 2022)☆35Oct 17, 2023Updated 2 years ago
- Sample Average Approximation (SAA) in Newsvendor☆11Aug 15, 2020Updated 5 years ago
- code for paper "DRoC: Elevating Large Language Models for Complex Vehicle Routing via Decomposed Retrieval of Constraints"☆25Feb 4, 2025Updated last year
- DiG-IN: Diffusion Guidance for Investigating Networks - Uncovering Classifier Differences, Neuron Visualisations, and Visual Counterfactu…☆10Oct 9, 2024Updated last year
- Official Implementation of VarDrop(AAAI25)☆20Oct 23, 2025Updated 3 months ago
- [ICLR 2023] RC-MAE☆53Dec 18, 2023Updated 2 years ago
- ☆43Feb 21, 2022Updated 3 years ago
- Official Code Repository for the paper "Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-intensive Tasks…☆42Nov 24, 2024Updated last year
- Learning Low-rank and Sparse Discriminative Correlation Filters for Coarse-to-Fine Visual Object Tracking☆10Apr 15, 2021Updated 4 years ago
- [AAAI 2026] Official repository of the EMAformer paper: "EMAformer: Enhancing Transformer through Embedding Armor for Time Series Forecas…☆34Dec 3, 2025Updated 2 months ago
- Official implementation of "COExpander: Adaptive Solution Expansion for Combinatorial Optimization".☆21Jun 28, 2025Updated 7 months ago
- Eye diagrams are used for visual analysis of the severity of inter symbol interference (ISI), accuracy of sampling timing extraction and …☆13Jul 18, 2020Updated 5 years ago
- Tactical Observation of RF GNSS Interference☆14Jun 25, 2020Updated 5 years ago
- ☆12Mar 24, 2021Updated 4 years ago
- Python code of information geometric causal inference☆12Aug 20, 2019Updated 6 years ago
- ☆15Jan 12, 2026Updated last month
- ☆13Jun 22, 2025Updated 7 months ago
- Multiple instance learning bag generation code using data from the ECOSTRESS Spectral Library V1.0.☆13Mar 25, 2020Updated 5 years ago
- Compare univariate and multivariate xLSTM models against Markov Chain model to predict future values based on historical temporal sequenc…☆12Jun 12, 2024Updated last year
- Blind Source Separation (BSS) refers to a problem where both the sources and the mixing methodology are unknown, only mixture signals are…☆11Aug 13, 2020Updated 5 years ago
- pytorch implementation for "Mutual Information Neural Estimation"☆11Dec 13, 2019Updated 6 years ago
- QGFN: Controllable Greediness with Action Values - Code☆11May 17, 2024Updated last year
- Notes and Translations of Great AI paper☆11Nov 11, 2025Updated 3 months ago
- ☆10Nov 21, 2023Updated 2 years ago
- A hierarchical Bayesian model accounting for endmember variability and abrupt spectral changes to unmix multitemporal hyperspectral image…☆10Nov 20, 2020Updated 5 years ago
- ☆10May 12, 2022Updated 3 years ago
- [NeurIPS 2025] An official source code for paper "L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models"☆22Oct 29, 2025Updated 3 months ago
- ☆12Nov 25, 2018Updated 7 years ago
- This repository is about my work "Infrared Small Target Using Tri-layer Template Local Difference Measure", including the link of paper, …☆12Apr 20, 2022Updated 3 years ago
- Official implementation of "Physics-Informed Long-Sequence Forecasting From Multi-Resolution Spatiotemporal Data".☆11Dec 12, 2022Updated 3 years ago
- Time Series Forecasting with Dynamic Graph Modeling☆15Aug 31, 2025Updated 5 months ago
- [NeurIPS 2024] Official Implementation of "SDformer: Similarity-driven Discrete Transformer For Time Series Generation"☆13May 23, 2025Updated 8 months ago
- MATLAB and Simulink models for minimum shift keying☆11Aug 21, 2020Updated 5 years ago
- Y. Wu, L. Jiao, X. Liu, F. Liu, S. Yang and L. Li, Domain Adaptation-aware Transformer for Hyperspectral Object Tracking. IEEE Transactio…☆12Jul 15, 2024Updated last year