Sparse Attention with Linear Units
☆20Apr 21, 2021Updated 4 years ago
Alternatives and similar repositories for rectified-linear-attention
Users that are interested in rectified-linear-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning☆20Feb 4, 2022Updated 4 years ago
- Code for ICML 2020 paper: Do RNN and LSTM have Long Memory?☆17Jan 6, 2021Updated 5 years ago
- This is a sample implementation of "Robust Graph Convolutional Networks Against Adversarial Attacks", KDD 2019.☆10Dec 8, 2020Updated 5 years ago
- ☆12Jul 19, 2023Updated 2 years ago
- RADLADS training code☆37May 7, 2025Updated 10 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 8 months ago
- This is the public github for our paper "Transformer with a Mixture of Gaussian Keys"☆28Aug 13, 2022Updated 3 years ago
- 2021sodic企业隐患排查赛道——top6水煮毛血旺方案分享☆11Jul 17, 2021Updated 4 years ago
- EgoBody3M Egocentric Body Tracking on a VR Headset using a Diverse Dataset☆22Oct 1, 2024Updated last year
- GCP + Kaggle Docker + VSCode☆15Feb 28, 2022Updated 4 years ago
- [ECCV 2024] EgoPoseFormer: A Simple Baseline for Stereo Egocentric 3D Human Pose Estimation☆20Mar 6, 2026Updated 2 weeks ago
- The repo for ACL2021 findings paper - Don't Miss the Labels: Label-semantic Argumented Meta-Learner for Few-Shot Text Classification☆15Mar 24, 2022Updated 4 years ago
- Official code for the paper "Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks".☆16Dec 7, 2021Updated 4 years ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆34Oct 30, 2020Updated 5 years ago
- 基于预训练BERT和GAT的剧本角色情绪识别研究☆13Dec 15, 2023Updated 2 years ago
- ☆11Jun 28, 2020Updated 5 years ago
- Code for the paper "Adaptive Transformers for Learning Multimodal Representations" (ACL SRW 2020)☆43Oct 20, 2022Updated 3 years ago
- [ECCV 2020] Learning to Separate: Detecting Heavily-Occluded Objects in Urban Scenes☆12Dec 11, 2020Updated 5 years ago
- 🚀🚀🚀 [Journal Pre-print] Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review☆15Oct 13, 2025Updated 5 months ago
- ☆33Apr 12, 2021Updated 4 years ago
- [ACL‘20] Highway Transformer: A Gated Transformer.☆33Dec 5, 2021Updated 4 years ago
- Implementation of a Systolic Array based sorting engine on an FPGA using Verilog☆11May 11, 2017Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ECCV 2022] ST-P3, an end-to-end vision-based autonomous driving framework via spatial-temporal feature learning.☆13Jun 1, 2023Updated 2 years ago
- datetime模块的C语言实现,《奔跑吧,Python君》系列相关代码☆10Apr 30, 2023Updated 2 years ago
- Pytorch implementations of GMM - HMM☆11Dec 28, 2020Updated 5 years ago
- dracut module using vdfuse to loop mount☆11Mar 21, 2021Updated 5 years ago
- A numpy deep learning framework☆19Feb 11, 2022Updated 4 years ago
- A new benchmark of 118 ICPC problems for evaluating LLM reasoning in competitive coding, featuring realistic ICPC competition scenario, r…☆16May 18, 2025Updated 10 months ago
- Learning to Encode Position for Transformer with Continuous Dynamical Model☆59Aug 3, 2020Updated 5 years ago
- Spectral RNNs with adaptive window learning in TensorFlow, ICANN 2020.☆10Sep 20, 2021Updated 4 years ago
- ☆11Nov 27, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- AAAI 2024, "Working Memory Capacity of ChatGPT: An Empirical Study".☆15Feb 10, 2025Updated last year
- 基于 树莓派 的项目,天气实况、天气预报,实时温度、湿度、空气污染指数,自带中文语音播报,根据思科 EA 系列路由器,实现自动门禁功能。☆11Dec 24, 2015Updated 10 years ago
- 中国人民大学 YOJ 题库☆11Jun 9, 2022Updated 3 years ago
- SGQuant: Squeezing the Last Bit on Graph Neural Networks with Specialized Quantization☆11Aug 12, 2020Updated 5 years ago
- Implementation of the Winograd algorithm.☆24Nov 6, 2018Updated 7 years ago
- State-Regularized Recurrent Neural Networks☆11Sep 20, 2019Updated 6 years ago
- Convert Wolfram Mathematica notebooks to markdown files☆14Nov 14, 2017Updated 8 years ago