code for Explicit Sparse Transformer
☆60Jul 21, 2023Updated 2 years ago
Alternatives and similar repositories for Explicit-Sparse-Transformer
Users that are interested in Explicit-Sparse-Transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Semi-MoreGAN: A Semi-supervised Image Mixture of Rain Removal Network☆16Jul 1, 2025Updated 11 months ago
- Codes for paper "LexicalAT: Lexical-Based Adversarial Reinforcement Training for Robust Sentiment Classification"☆16Oct 30, 2019Updated 6 years ago
- Code associated with the paper **SkipBERT: Efficient Inference with Shallow Layer Skipping**, at ACL 2022☆16Jun 22, 2022Updated 3 years ago
- Code for ACL 2021 paper: Accelerating BERT Inference for Sequence Labeling via Early-Exit☆28Aug 19, 2022Updated 3 years ago
- Uncertainty Guided Multi-Scale Residual Learning-using a Cycle Spinning CNN for Single Image De-Raining☆24Mar 17, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Sparse Attention with Linear Units☆20Apr 21, 2021Updated 5 years ago
- ☆13Nov 23, 2019Updated 6 years ago
- ☆10Jul 5, 2023Updated 2 years ago
- The entmax mapping and its loss, a family of sparse softmax alternatives.☆474Jun 22, 2024Updated last year
- Implementation of RealFormer using pytorch☆101Dec 27, 2020Updated 5 years ago
- Implement attention model to LSTM using TensorFlow☆10Jul 3, 2018Updated 7 years ago
- Datasets, more results and implementation details of the paper "Frame-Consistent Recurrent Video Deraining with Dual-Level Flow" in CVPR-…☆25May 3, 2021Updated 5 years ago
- Official repository for the paper "Stochastic Window Transformer for Image Restoration".☆32Dec 7, 2022Updated 3 years ago
- ☆23Mar 18, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The augmented data of the paper "Parallel Data Augmentation for Formality Style Transfer" (ACL 2020).☆12May 14, 2020Updated 6 years ago
- Neural Text Generation with Unlikelihood Training☆311Aug 31, 2021Updated 4 years ago
- Feature Structure Distillation with Centered Kernel Alignment in BERT Transferring official code☆11Jul 17, 2023Updated 2 years ago
- Spectral Attention Autoregressive Model (SAAM)☆17Oct 27, 2022Updated 3 years ago
- Subgraph-augmented Path Embedding for Semantic User Search on Heterogeneous Social Network☆13Feb 19, 2018Updated 8 years ago
- ☆14Jan 5, 2022Updated 4 years ago
- Contrastive evaluation of pronoun translation in neural machine translation☆26Aug 22, 2019Updated 6 years ago
- Data and code used in our NAACL'19 paper "Selective Attention for Context-aware Neural Machine Translation"☆30Apr 12, 2020Updated 6 years ago
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆25Jun 6, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization.☆10Sep 28, 2021Updated 4 years ago
- DeepFM: A Factorization-Machine based Neural Network for CTR Prediction / xDeepFM: Combining Explicit and Implicit Feature Interactions f…☆15Oct 8, 2019Updated 6 years ago
- SXL: Spatially explicit learning of geographic processes with auxiliary tasks☆15Nov 26, 2021Updated 4 years ago
- FLASHQuad_pytorch☆68Apr 1, 2022Updated 4 years ago
- Code for "Understanding and Improving Layer Normalization"☆46Dec 8, 2019Updated 6 years ago
- Unpaired Image Captioning☆36Mar 25, 2021Updated 5 years ago
- Implementation of "Removing Raindrops and Rain Streaks in One Go"☆41Jul 19, 2021Updated 4 years ago
- Official implementation of paper "GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model", ICML 2025☆17Dec 25, 2025Updated 5 months ago
- Jump to better conclusions: SCAN both left and right☆11Jan 24, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Legal Juegment Prediction (LJP) with BiLSTM and Attention☆13Jan 10, 2019Updated 7 years ago
- Code for the ACL'18 paper: A Neural Approach to Pun Generation☆18Jan 13, 2020Updated 6 years ago
- A pytorch implementation of our paper Image Captioning with Inherent Sentiment (ICME 2021 Oral).☆11Jul 18, 2022Updated 3 years ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,616Aug 12, 2020Updated 5 years ago
- Understanding the Difficulty of Training Transformers☆332May 31, 2022Updated 4 years ago
- Implementation for paper "A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation"☆24Mar 1, 2020Updated 6 years ago
- ☆17Mar 3, 2025Updated last year