My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT pretrained models.
☆44Dec 12, 2024Updated last year
Alternatives and similar repositories for Rethinking-attention
Users that are interested in Rethinking-attention are comparing it to the libraries listed below
Sorting:
- ☆21Feb 23, 2023Updated 3 years ago
- ☆29Apr 17, 2023Updated 2 years ago
- Automatically extract executable programs from pruned mechanistic circuits, extending OpenAI's Sparse Circuits☆64Nov 23, 2025Updated 3 months ago
- The code for WWW2024 paper "Rethinking Cross-Domain Sequential Recommendation under Open-World Assumptions".☆35Aug 12, 2024Updated last year
- [IJCAI'2023] "DSL: Denoised Self-Augmented Learning for Social Recommendation"☆33Aug 1, 2024Updated last year
- ☆10Aug 9, 2023Updated 2 years ago
- The official implementation of "Unlocking the Potential of Unlabeled Data in Semi-Supervised Domain Generalization" (CVPR 2025)☆14Nov 20, 2025Updated 3 months ago
- ☆13Mar 7, 2025Updated 11 months ago
- [WSDM 2024 Oral] This is our Pytorch implementation for the paper: "Intent Contrastive Learning with Cross Subsequences for Sequential Re…☆39Jan 7, 2024Updated 2 years ago
- LMTuner: Make the LLM Better for Everyone☆38Sep 21, 2023Updated 2 years ago
- [ICCV 2023] HM-ViT: Hetero-modal Vehicle-to-Vehicle Cooperative perception with vision transformer☆39Jul 15, 2024Updated last year
- Direct transcription of an optimal control problem and resolution☆12Updated this week
- Spark projects. Learning book "Machine Learning with Spark"☆10Jun 3, 2017Updated 8 years ago
- A scalable MPI library for computing fast Fourier transforms in python.☆11Sep 11, 2025Updated 5 months ago
- The code will come soon.☆15Sep 12, 2025Updated 5 months ago
- Python platform for parallel Surrogate-Based Optimization☆12Nov 27, 2024Updated last year
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆25Oct 20, 2025Updated 4 months ago
- Matlab codes for PAT image reconstruction from subsampled data based on a novel regularisation term (Hessian Schatten-norm of the filtere…☆10Aug 21, 2019Updated 6 years ago
- ☆21Updated this week
- Cordova plugin for jitsi meet react native sdk☆10Jun 7, 2019Updated 6 years ago
- ☆24Feb 18, 2026Updated last week
- My blogs and code for machine learning. http://cnblogs.com/pinard☆13Jul 12, 2019Updated 6 years ago
- ☆12Jun 19, 2024Updated last year
- 《大语言模型》综述全书学习笔记☆13Aug 2, 2024Updated last year
- Machine Learning for Mathematical Formalization☆11Jul 20, 2024Updated last year
- News classification & recommendation in Keras☆13Jun 15, 2020Updated 5 years ago
- The code of AAAI'24 paper GLRec: Exploring Large Language Model for Graph Data Understanding in Online Job Recommendations☆38Dec 21, 2023Updated 2 years ago
- Automated bottleneck detection and solution orchestration☆19Feb 24, 2026Updated last week
- Given a Substack newsletter, save the contents into an sqlite db and format it as an epub☆13Jan 11, 2024Updated 2 years ago
- Provides really comfortable generation of phpDocumentor doc blocks for PHP4 & 5.☆25Aug 3, 2019Updated 6 years ago
- DAWN: Direction-aware Attention Wavelet Network for Image Deraining☆11Jan 7, 2024Updated 2 years ago
- 可用于中文开放领域信息抽取的数据集☆14Nov 15, 2021Updated 4 years ago
- Manual Baseline Models☆10Nov 7, 2024Updated last year
- A counterfactual collaborative session-based recommender system. WWW'23.☆10Nov 10, 2023Updated 2 years ago
- [WWW '24] UnifiedSSR: A Unified Framework of Sequential Search and Recommendation☆12Feb 16, 2024Updated 2 years ago
- Brute forces an md5 string☆10Nov 25, 2020Updated 5 years ago
- This repository is the official implementation of our paper Robust Diffusion Model-Generated Image Detection with CLIP, accepted by MIPR …☆10Jun 13, 2024Updated last year
- ACE (Adaptive Code Evolution) is an AI-powered system for code analysis and optimization.☆12Nov 4, 2025Updated 3 months ago
- Node.js server to receive events from a Janus server.☆10Sep 17, 2024Updated last year