lancopku/Explicit-Sparse-Transformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lancopku/Explicit-Sparse-Transformer)

lancopku / Explicit-Sparse-Transformer

code for Explicit Sparse Transformer

☆61

Alternatives and similar repositories for Explicit-Sparse-Transformer

Users that are interested in Explicit-Sparse-Transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lancopku / SACT
View on GitHub
Code for the article "Automatic Temperature Control for Neural Machine Translation" (EMNLP 2018)
☆14Apr 16, 2019Updated 7 years ago
ChangfengYu-Hust / UDGNet
View on GitHub
Unsupervised Image Deraining: Optimization Model Driven Deep CNN
☆16Apr 25, 2022Updated 4 years ago
cuiyixin555 / SSID-KD
View on GitHub
The Repo of Semi-supervised Single Image Deraining
☆12Nov 5, 2023Updated 2 years ago
lancopku / MUKI
View on GitHub
[Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models
☆19Mar 16, 2023Updated 3 years ago
LeeSureman / Sequence-Labeling-Early-Exit
View on GitHub
Code for ACL 2021 paper: Accelerating BERT Inference for Sequence Labeling via Early-Exit
☆28Aug 19, 2022Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
wubaoyuan / adversarial-attack-to-caption
View on GitHub
☆13Nov 23, 2019Updated 6 years ago
fengjun321 / Transformer_count
View on GitHub
使用多头注意力机制实现数字预测
☆10May 10, 2022Updated 4 years ago
windg / AttentionLSTM
View on GitHub
Implement attention model to LSTM using TensorFlow
☆10Jul 3, 2018Updated 8 years ago
lancopku / RMSC
View on GitHub
Data and code for paper "Review-Driven Multi-Label Music Style Classification by Exploiting Style Correlations"
☆17Jun 30, 2019Updated 7 years ago
lancopku / Augmented_Data_for_FST
View on GitHub
The augmented data of the paper "Parallel Data Augmentation for Formality Style Transfer" (ACL 2020).
☆12May 14, 2020Updated 6 years ago
maroo-sky / FSD
View on GitHub
Feature Structure Distillation with Centered Kernel Alignment in BERT Transferring official code
☆11Jul 17, 2023Updated 3 years ago
divelab / MFA
View on GitHub
Official code repository of paper Equivariance via Minimal Frame Averaging for More Symmetries and Efficiency.
☆20Jan 18, 2025Updated last year
ZurichNLP / ContraPro
View on GitHub
Contrastive evaluation of pronoun translation in neural machine translation
☆26Aug 22, 2019Updated 6 years ago
karthiTox / deepnet.js
View on GitHub
Auto-differentiation library for javascript
☆12Mar 4, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
shvdiwnkozbw / Video-Representation-via-Multi-level-Optimization
View on GitHub
Code for Enhancing Self-supervised Video Representation Learning via Multi-level Feature Optimization.
☆10Sep 28, 2021Updated 4 years ago
xu1998hz / SEScore
View on GitHub
This repo contains all the codes for SEScore implementation
☆15Mar 3, 2025Updated last year
JinyongJeong / DeeplabV3_Apolloscape_and_CityScape
View on GitHub
☆11Dec 27, 2022Updated 3 years ago
malihealikhani / Cross-modal_Coherence_Modeling
View on GitHub
Cross-modal Coherence Modeling for Caption Generation
☆11Jul 24, 2020Updated 6 years ago
glassroom / heinsen_attention
View on GitHub
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
☆25Jun 6, 2024Updated 2 years ago
TeddyChe2003 / XJTU-SY-bearings-dataset-preparation
View on GitHub
☆15Aug 2, 2025Updated 11 months ago
ag1988 / top_k_attention
View on GitHub
The accompanying code for "Memory-efficient Transformers via Top-k Attention" (Ankit Gupta, Guy Dar, Shaya Goodman, David Ciprut, Jonatha…
☆70Sep 19, 2021Updated 4 years ago
lancopku / AdaNorm
View on GitHub
Code for "Understanding and Improving Layer Normalization"
☆46Dec 8, 2019Updated 6 years ago
TsinghuaC3I / Fourier-Position-Embedding
View on GitHub
[ICML 2025] Fourier Position Embedding: Enhancing Attention’s Periodic Extension for Length Generalization
☆118Jun 2, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
RobertCsordas / ndr
View on GitHub
The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".
☆34Jun 11, 2025Updated last year
abduallahmohamed / Nested-LSTM-NLSTM-Pytorch
View on GitHub
NLSTM Nested LSTM in Pytorch
☆17Apr 4, 2018Updated 8 years ago
facebookarchive / NACS
View on GitHub
Jump to better conclusions: SCAN both left and right
☆11Jan 24, 2019Updated 7 years ago
ok858ok / CP-ViT
View on GitHub
Code for "CP-ViT: Cascade Vision Transformer Pruning via Progressive Sparsity Prediction" on CIFAR-10/100.
☆14Dec 10, 2021Updated 4 years ago
dalinvip / Legal_Judgment_Prediction_BiLSTM_ATT
View on GitHub
Legal Juegment Prediction (LJP) with BiLSTM and Attention
☆13Jan 10, 2019Updated 7 years ago
LiyuanLucasLiu / Transformer-Clinic
View on GitHub
Understanding the Difficulty of Training Transformers
☆332May 31, 2022Updated 4 years ago
ezeli / InSentiCap_model
View on GitHub
A pytorch implementation of our paper Image Captioning with Inherent Sentiment (ICME 2021 Oral).
☆11Jul 18, 2022Updated 4 years ago
JianGuanTHU / CommonsenseStoryGen
View on GitHub
Implementation for paper "A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation"
☆24Mar 1, 2020Updated 6 years ago
craffel / comp790-information-theory-fall-2021
View on GitHub
Course repository for the Fall 2021 COMP790 course "Information Theory" at UNC
☆11Aug 24, 2021Updated 4 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
lucidrains / AoA-pytorch
View on GitHub
A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering
☆43Nov 8, 2020Updated 5 years ago
hrlics / HoPE
View on GitHub
[NeurIPS 2025] HoPE: Hybrid of Position Embedding for Long Context Vision-Language Models
☆29Feb 19, 2026Updated 5 months ago
dykang / PASTEL
View on GitHub
Data and code for Kang et al., EMNLP 2019's paper titled "(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Ann…
☆30Mar 17, 2020Updated 6 years ago
person-lee / QA_LSTM_ATTENTION
View on GitHub
lstm with attention to deal with qa
☆34Apr 15, 2017Updated 9 years ago
ictnlp / NMLA-NAT
View on GitHub
Code for NeurIPS 2022 Spotlight paper " Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation"
☆20Nov 16, 2022Updated 3 years ago
swiseman / neighbor-splicing
View on GitHub
☆11Jan 2, 2022Updated 4 years ago
JianmingGuo / gateway-IDS
View on GitHub
In-vehicle gateway intrusion detection based on ARM
☆13Jan 26, 2021Updated 5 years ago