My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
☆44Dec 12, 2024Updated last year
Alternatives and similar repositories for Rethinking-attention
Users that are interested in Rethinking-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Mar 5, 2026Updated 2 months ago
- Image to LaTeX pytorch model☆14Jul 6, 2023Updated 2 years ago
- ☆31Apr 17, 2023Updated 3 years ago
- ☆12Jun 26, 2023Updated 2 years ago
- ☆15Apr 8, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆29May 4, 2024Updated 2 years ago
- Atomic Structure Generation from Reconstructing Structural Fingerprints☆15Oct 6, 2022Updated 3 years ago
- This repository contains the entire pipline (including data preprocessing, training, testing, evaluation and visualization) for the Shear…☆10Dec 3, 2019Updated 6 years ago
- [ICCV2025] CoopTrack: Exploring End-to-End Learning for Efficient Cooperative Sequential Perception☆56Sep 2, 2025Updated 8 months ago
- pytorch实现MOAT,可以在ImageNet或自己的数据集上训练,支持apex混合精度,各种图像增强技术☆34Oct 21, 2022Updated 3 years ago
- This is the source code of CubicGAN generating cubic crystal structures using improved WGAN.☆10Jun 6, 2022Updated 3 years ago
- [CVPR2024] Multi-agent Collaborative Perception via Motion-aware Robust Communication Network☆30Mar 23, 2024Updated 2 years ago
- ☆11Mar 8, 2024Updated 2 years ago
- A PyTorch implementation of the shearlet transform.☆13Oct 9, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for AAAI 2023 accepted paper titled "Knowledge-Bridged Causal Interaction Network for Causal Emotion Entailment"☆14May 6, 2023Updated 3 years ago
- ICPR2022: Dynamic Data Augmentation with Gating Networks for Time Series Recognition☆11Jul 28, 2022Updated 3 years ago
- Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.☆10Mar 13, 2023Updated 3 years ago
- Python image tiling library for image processing, object detection, etc.☆12Jul 25, 2024Updated last year
- Replicated paper: END-TO-END TRAINED CNN ENCODER-DECODER NETWORKS FOR IMAGE STEGANOGRAPHY☆10Dec 19, 2019Updated 6 years ago
- ☆13Mar 28, 2024Updated 2 years ago
- Behavior-Contextualized Item Preference Network for Multi-Behavior Recommendation☆16Nov 8, 2024Updated last year
- [IJCV 2023] FlowNAS: Neural Architecture Search for Optical Flow Estimation☆15Feb 21, 2024Updated 2 years ago
- The repo for reproducing the main results in TSMixer: An all-MLP Architecture for Time Series Forecasting.☆10Jun 15, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- An implementation of LazyLLM token pruning for LLaMa 2 model family.☆13Jan 6, 2025Updated last year
- [IJCAI'2023] "DSL: Denoised Self-Augmented Learning for Social Recommendation"☆32Aug 1, 2024Updated last year
- CNN+KAN architecture on MNIST (Val Acc: 96%)☆13May 4, 2024Updated 2 years ago
- Improving Recommendation Fairness via Data Augmentation-WWW23☆15Jun 6, 2023Updated 2 years ago
- Multistep univariate time series forecasting using Gated Recurrent Unit.☆16Jan 18, 2020Updated 6 years ago
- ☆16Nov 16, 2025Updated 5 months ago
- Unofficial implementation of paper : Exploring the Space of Key-Value-Query Models with Intention☆12May 24, 2023Updated 2 years ago
- Time Series Representation Models☆13Jul 17, 2025Updated 9 months ago
- This repository is an official PyTorch implementation of our paper "Feature Distillation Interaction Weighting Network for Lightweight Im…☆13May 6, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- repository for "Adaptive Disentangled Transformer for Sequential Recommendation"☆15Jun 6, 2023Updated 2 years ago
- LIVW-Localization: A Multi-modal Information Fused Vehicle Localization method for Complex, Large-Scale and GNSS-Denied Environments.☆14Jan 19, 2026Updated 3 months ago
- DAWN: Direction-aware Attention Wavelet Network for Image Deraining☆11Jan 7, 2024Updated 2 years ago
- [WSDM 2024 Oral] This is our Pytorch implementation for the paper: "Intent Contrastive Learning with Cross Subsequences for Sequential Re…☆41Jan 7, 2024Updated 2 years ago
- ☆13May 12, 2025Updated 11 months ago
- Adaptive Hardness Negative Sampling for Collaborative Filtering, AAAI2024☆12Dec 13, 2023Updated 2 years ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago