jaketae / realformer
PyTorch implementation of RealFormer: Transformer Likes Residual Attention
☆11Updated 4 years ago
Alternatives and similar repositories for realformer
Users that are interested in realformer are comparing it to the libraries listed below
Sorting:
- ☆17Updated 2 years ago
- ☆19Updated 3 years ago
- compare the performance of cross entropy, focal loss, and dice loss in solving the problem of data imbalance☆9Updated 3 years ago
- Second-Order Pooling for Graph Neural Networks☆16Updated 4 years ago
- Source code for "Distilling Knowledge From Graph Convolutional Networks", CVPR'20☆57Updated 2 years ago
- LowFER: Low-rank Bilinear Pooling for Link Prediction (ICML 2020)☆13Updated 2 years ago
- ☆14Updated 3 years ago
- AdaTask: A Task-Aware Adaptive Learning Rate Approach to Multi-Task Learning. AAAI, 2023.☆28Updated last year
- Implementation of Mogrifier LSTM in PyTorch☆35Updated 5 years ago
- code for Explicit Sparse Transformer☆62Updated last year
- ☆22Updated 2 years ago
- A collection of graph contrastive learning methods.☆18Updated 3 years ago
- Source code for the paper Residual Enhanced Multi-Hypergraph Neural Network (ICIP 2021).☆18Updated 3 years ago
- ☆11Updated last year
- WSDM2022 Challenge - Large scale temporal graph link prediction☆38Updated 3 years ago
- The code of "Inductive Unsupervised Domain Adaptation for Few-Shot Classification via Clustering", ECML-PKDD 2020.☆21Updated 2 years ago
- MetaBalance algorithm for multi-task learning☆58Updated 3 years ago
- My implementation of the gMLP model from the paper "Pay Attention to MLPs".☆25Updated 3 years ago
- Official implementation of "Non-Local Graph Neural Networks" [TPAMI]☆22Updated 2 years ago
- Official code for ICLR 2023 paper "ContraNorm: A Contrastive Learning Perspective on Oversmoothing and Beyond "☆35Updated 2 years ago
- Energy-based Out-of-distribution Detection☆15Updated 4 years ago
- AAAI 2022 papers with code☆36Updated 3 years ago
- PyTorch Implementation of the Multi-gate Mixture-of-Experts with Exclusivity (MMoEEx)☆32Updated 3 years ago
- Code for Unsupervised Multi-Target Domain Adaptation: An Information Theoretic Approach☆14Updated 4 years ago
- ruizhang-ai / HIRS-Detecting_Arbitrary_Order_Beneficial_Feature_Interactions_for_Recommender_SystemsDetecting Arbitrary Order Beneficial Feature Interactions for Recommender Systems, KDD 2022☆24Updated 2 years ago
- ☆42Updated 4 years ago
- Implementation of the paper NAST: Non-Autoregressive Spatial-Temporal Transformer for Time Series Forecasting.☆76Updated 4 years ago
- ☆35Updated 2 years ago
- AutoGCL: Automated Graph Contrastive Learning via Learnable View Generators (AAAI 2022)☆46Updated 3 years ago
- Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models☆19Updated 4 years ago