code for Explicit Sparse Transformer
☆61Jul 21, 2023Updated 2 years ago
Alternatives and similar repositories for Explicit-Sparse-Transformer
Users that are interested in Explicit-Sparse-Transformer are comparing it to the libraries listed below
Sorting:
- Code for the article "Automatic Temperature Control for Neural Machine Translation" (EMNLP 2018)☆14Apr 16, 2019Updated 6 years ago
- Semi-MoreGAN: A Semi-supervised Image Mixture of Rain Removal Network☆16Jul 1, 2025Updated 8 months ago
- Code associated with the paper **SkipBERT: Efficient Inference with Shallow Layer Skipping**, at ACL 2022☆16Jun 22, 2022Updated 3 years ago
- Codes for "Understanding and Improving Transformer From a Multi-Particle Dynamic System Point of View"☆147Jun 10, 2019Updated 6 years ago
- Tool for Evaluating Adversarial Perturbations on Text☆61Feb 27, 2022Updated 4 years ago
- ☆13Nov 23, 2019Updated 6 years ago
- ☆10Jul 5, 2023Updated 2 years ago
- The entmax mapping and its loss, a family of sparse softmax alternatives.☆464Jun 22, 2024Updated last year
- Implementation of RealFormer using pytorch☆101Dec 27, 2020Updated 5 years ago
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆34Jun 11, 2025Updated 9 months ago
- 对卷积神经网络提取的每一层特征用t-SNE进行降维可视化☆22Dec 13, 2021Updated 4 years ago
- Subgraph-augmented Path Embedding for Semantic User Search on Heterogeneous Social Network☆13Feb 19, 2018Updated 8 years ago
- ☆14Jan 5, 2022Updated 4 years ago
- This repo contains all the codes for SEScore implementation☆15Mar 3, 2025Updated last year
- Contrastive evaluation of pronoun translation in neural machine translation☆26Aug 22, 2019Updated 6 years ago
- Data and code used in our NAACL'19 paper "Selective Attention for Context-aware Neural Machine Translation"☆30Apr 12, 2020Updated 5 years ago
- Cross-modal Coherence Modeling for Caption Generation☆11Jul 24, 2020Updated 5 years ago
- ☆11Dec 27, 2022Updated 3 years ago
- DeepFM: A Factorization-Machine based Neural Network for CTR Prediction / xDeepFM: Combining Explicit and Implicit Feature Interactions f…☆15Oct 8, 2019Updated 6 years ago
- The accompanying code for "Memory-efficient Transformers via Top-k Attention" (Ankit Gupta, Guy Dar, Shaya Goodman, David Ciprut, Jonatha…☆69Sep 19, 2021Updated 4 years ago
- Unofficial reimplementation of Dynamic Fusion with Intra- and Inter-modality Attention Flow for Visual Question Answering☆18Oct 30, 2019Updated 6 years ago
- ☆17Jul 6, 2023Updated 2 years ago
- FLASHQuad_pytorch☆68Apr 1, 2022Updated 3 years ago
- Unpaired Image Captioning☆36Mar 25, 2021Updated 4 years ago
- NLSTM Nested LSTM in Pytorch☆17Apr 4, 2018Updated 7 years ago
- Official implementation of paper "GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model", ICML 2025☆15Dec 25, 2025Updated 2 months ago
- Jump to better conclusions: SCAN both left and right☆11Jan 24, 2019Updated 7 years ago
- Code for the ACL'18 paper: A Neural Approach to Pun Generation☆18Jan 13, 2020Updated 6 years ago
- Legal Juegment Prediction (LJP) with BiLSTM and Attention☆13Jan 10, 2019Updated 7 years ago
- Understanding the Difficulty of Training Transformers☆332May 31, 2022Updated 3 years ago
- code for ResSys'18 paper: "Exploring Recommendations Under User-Controlled Data Filtering"☆23Oct 16, 2018Updated 7 years ago
- ☆36Oct 3, 2018Updated 7 years ago
- Codes for reproducing the adversarial attacks on image captioning systems in “Attacking Visual Language Grounding with Adversarial Examp…☆39Feb 18, 2022Updated 4 years ago
- Course repository for the Fall 2021 COMP790 course "Information Theory" at UNC☆11Aug 24, 2021Updated 4 years ago
- Data and code for Kang et al., EMNLP 2019's paper titled "(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Ann…☆30Mar 17, 2020Updated 6 years ago
- ☆33Jul 25, 2024Updated last year
- ☆11Jan 2, 2022Updated 4 years ago
- Deep Networks Grok All the Time and Here is Why☆38May 18, 2024Updated last year
- lstm with attention to deal with qa☆34Apr 15, 2017Updated 8 years ago