Sparse Attention with Linear Units
☆20Apr 21, 2021Updated 5 years ago
Alternatives and similar repositories for rectified-linear-attention
Users that are interested in rectified-linear-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning☆20Feb 4, 2022Updated 4 years ago
- Code for ICML 2020 paper: Do RNN and LSTM have Long Memory?☆17Jan 6, 2021Updated 5 years ago
- This is a sample implementation of "Robust Graph Convolutional Networks Against Adversarial Attacks", KDD 2019.☆10Dec 8, 2020Updated 5 years ago
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆15Jul 4, 2025Updated 10 months ago
- This is the public github for our paper "Transformer with a Mixture of Gaussian Keys"☆28Aug 13, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Research project for task-oriented dialogue system with jointly training multi-intent classification and slot filling☆10Sep 11, 2023Updated 2 years ago
- ☆12Apr 1, 2025Updated last year
- ☆13Mar 30, 2022Updated 4 years ago
- RADLADS training code☆43May 7, 2025Updated last year
- Custom Keras layers for implementing multi-dimensional recurrent neural networks (MDRNNs) described in Alex Graves's paper https://arxiv.…☆10Apr 27, 2020Updated 6 years ago
- Fixed version of https://github.com/tomguluson92/PRNet_PyTorch☆10Mar 30, 2020Updated 6 years ago
- EMNLP 2021: A Label-Aware BERT Attention Network for Zero-Shot Multi-Intent Detection in Spoken Language Understanding☆10Apr 8, 2022Updated 4 years ago
- Gradually Updated Neural Networks for Large-Scale Image Recognition at ICML 2018☆10Jun 25, 2018Updated 7 years ago
- [ECCV 2024] EgoPoseFormer: A Simple Baseline for Stereo Egocentric 3D Human Pose Estimation☆24Mar 6, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The repo for ACL2021 findings paper - Don't Miss the Labels: Label-semantic Argumented Meta-Learner for Few-Shot Text Classification☆15Mar 24, 2022Updated 4 years ago
- A Chrome Plugin to enhance your LeetCoding Experience☆11Jan 31, 2021Updated 5 years ago
- This is the official Python implementation repository for a paper entitled "Resolving Camera Position for a Practical Application of Gaz…☆12Jan 11, 2022Updated 4 years ago
- Official code for the paper "Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks".☆16Dec 7, 2021Updated 4 years ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆34Oct 30, 2020Updated 5 years ago
- 基于预训练BERT和GAT的剧本角色情绪识别研究☆13Dec 15, 2023Updated 2 years ago
- ☆11Jun 28, 2020Updated 5 years ago
- Code for the paper "Adaptive Transformers for Learning Multimodal Representations" (ACL SRW 2020)☆43Oct 20, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ECCV 2020] Learning to Separate: Detecting Heavily-Occluded Objects in Urban Scenes☆12Dec 11, 2020Updated 5 years ago
- Code of our IJCAI2021 paper: "Learning Class-Transductive Intent Representations for Zero-shot Intent Detection"☆15Sep 10, 2021Updated 4 years ago
- 🚀🚀🚀 [Journal Pre-print] Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review☆15Oct 13, 2025Updated 7 months ago
- ☆33Apr 12, 2021Updated 5 years ago
- [ACL‘20] Highway Transformer: A Gated Transformer.☆33Dec 5, 2021Updated 4 years ago
- [ECCV 2022] ST-P3, an end-to-end vision-based autonomous driving framework via spatial-temporal feature learning.☆13Jun 1, 2023Updated 2 years ago
- NLP 相关岗位 笔试面试资源汇总☆16Jun 17, 2021Updated 4 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆114Jun 10, 2021Updated 4 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Pytorch implementations of GMM - HMM☆10Dec 28, 2020Updated 5 years ago
- A new benchmark of 118 ICPC problems for evaluating LLM reasoning in competitive coding, featuring realistic ICPC competition scenario, r…☆17May 18, 2025Updated last year
- A numpy deep learning framework☆19Feb 11, 2022Updated 4 years ago
- dracut module using vdfuse to loop mount☆11Mar 21, 2021Updated 5 years ago
- ☆16May 22, 2023Updated 3 years ago
- Learning to Encode Position for Transformer with Continuous Dynamical Model☆59Aug 3, 2020Updated 5 years ago
- AAAI 2024, "Working Memory Capacity of ChatGPT: An Empirical Study".☆15Feb 10, 2025Updated last year