code for Explicit Sparse Transformer
☆61Jul 21, 2023Updated 2 years ago
Alternatives and similar repositories for Explicit-Sparse-Transformer
Users that are interested in Explicit-Sparse-Transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the article "Automatic Temperature Control for Neural Machine Translation" (EMNLP 2018)☆14Apr 16, 2019Updated 6 years ago
- Semi-MoreGAN: A Semi-supervised Image Mixture of Rain Removal Network☆16Jul 1, 2025Updated 9 months ago
- Codes for paper "LexicalAT: Lexical-Based Adversarial Reinforcement Training for Robust Sentiment Classification"☆16Oct 30, 2019Updated 6 years ago
- The Repo of Semi-supervised Single Image Deraining☆12Nov 5, 2023Updated 2 years ago
- Code associated with the paper **SkipBERT: Efficient Inference with Shallow Layer Skipping**, at ACL 2022☆16Jun 22, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Unsupervised Image Deraining: Optimization Model Driven Deep CNN☆16Apr 25, 2022Updated 3 years ago
- Codes for "Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View"☆147Jun 10, 2019Updated 6 years ago
- Tool for Evaluating Adversarial Perturbations on Text☆61Feb 27, 2022Updated 4 years ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 3 years ago
- Code for ACL 2021 paper: Accelerating BERT Inference for Sequence Labeling via Early-Exit☆28Aug 19, 2022Updated 3 years ago
- Sparse Attention with Linear Units☆20Apr 21, 2021Updated 4 years ago
- The entmax mapping and its loss, a family of sparse softmax alternatives.☆468Jun 22, 2024Updated last year
- Implementation of RealFormer using pytorch☆101Dec 27, 2020Updated 5 years ago
- Leveraging Local and Global Patterns for Self-Attention Networks☆12Jun 3, 2019Updated 6 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆34Jun 11, 2025Updated 10 months ago
- Datasets, more results and implementation details of the paper "Frame-Consistent Recurrent Video Deraining with Dual-Level Flow" in CVPR-…☆25May 3, 2021Updated 4 years ago
- Official repository for the paper "Stochastic Window Transformer for Image Restoration".☆31Dec 7, 2022Updated 3 years ago
- 对卷积神经网络提取的每一层特征用t-SNE进行降维可视化☆22Dec 13, 2021Updated 4 years ago
- Data and code for paper "Review-Driven Multi-Label Music Style Classification by Exploiting Style Correlations"☆17Jun 30, 2019Updated 6 years ago
- The augmented data of the paper "Parallel Data Augmentation for Formality Style Transfer" (ACL 2020).☆12May 14, 2020Updated 5 years ago
- Neural Text Generation with Unlikelihood Training☆311Aug 31, 2021Updated 4 years ago
- Subgraph-augmented Path Embedding for Semantic User Search on Heterogeneous Social Network☆13Feb 19, 2018Updated 8 years ago
- ☆14Jan 5, 2022Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Contrastive evaluation of pronoun translation in neural machine translation☆26Aug 22, 2019Updated 6 years ago
- Cross-modal Coherence Modeling for Caption Generation☆11Jul 24, 2020Updated 5 years ago
- ☆13May 19, 2021Updated 4 years ago
- Code for Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization.☆10Sep 28, 2021Updated 4 years ago
- DeepFM: A Factorization-Machine based Neural Network for CTR Prediction / xDeepFM: Combining Explicit and Implicit Feature Interactions f…☆15Oct 8, 2019Updated 6 years ago
- The accompanying code for "Memory-efficient Transformers via Top-k Attention" (Ankit Gupta, Guy Dar, Shaya Goodman, David Ciprut, Jonatha…☆69Sep 19, 2021Updated 4 years ago
- SXL: Spatially explicit learning of geographic processes with auxiliary tasks☆15Nov 26, 2021Updated 4 years ago
- Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering☆18Oct 30, 2019Updated 6 years ago
- Official implementation of paper "Vision Graph Prompting via Semantic Low-Rank Decomposition", ICML 2025☆16Dec 25, 2025Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- FLASHQuad_pytorch☆68Apr 1, 2022Updated 4 years ago
- Code for "Understanding and Improving Layer Normalization"☆46Dec 8, 2019Updated 6 years ago
- Home Repository for the CfRR website☆14Apr 2, 2026Updated last week
- Unpaired Image Captioning☆36Mar 25, 2021Updated 5 years ago
- NLSTM Nested LSTM in Pytorch☆17Apr 4, 2018Updated 8 years ago
- Implementation of "Removing Raindrops and Rain Streaks in One Go"☆41Jul 19, 2021Updated 4 years ago
- Legal Juegment Prediction (LJP) with BiLSTM and Attention☆13Jan 10, 2019Updated 7 years ago