A pytorch &keras implementation and demo of Fastformer.
☆192Sep 22, 2022Updated 3 years ago
Alternatives and similar repositories for Fastformer
Users that are interested in Fastformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Fast Transformer in Pytorch☆176Aug 26, 2021Updated 4 years ago
- Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."☆131Sep 6, 2021Updated 4 years ago
- ☆58May 12, 2022Updated 4 years ago
- A new NRMS model for the MIcrosoft News Dataset(MIND)☆22Jan 19, 2024Updated 2 years ago
- 2020 MIND news recomendation first place solution☆93Mar 10, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆32Jun 14, 2023Updated 3 years ago
- ACL 2021: HiTransformer☆13May 29, 2021Updated 5 years ago
- [WWW'22] Deep Interest Highlight Network for Click-Through Rate Prediction in Trigger-Induced Recommendation☆22Apr 11, 2022Updated 4 years ago
- ☆121Updated this week
- FairSeq repo with Apollo optimizer☆113Dec 20, 2023Updated 2 years ago
- ☆16Nov 9, 2023Updated 2 years ago
- Faster, more accurate and entirely open source method for predicting contacts in proteins☆12May 21, 2018Updated 8 years ago
- The dataset for paper "Why Do We Click: Visual Impression-aware News Recommendation", ACM MM 2021☆15Feb 24, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Open source data and code of the MGNM☆25Sep 15, 2022Updated 3 years ago
- Deep neural network codes for ctr/cvr prediction task in ranking process implemented by Tensorflow (1.14/2.4.1 version), using tf.estimat…☆11Apr 21, 2021Updated 5 years ago
- a python package for stat,caculate woe,iv, ks,auc,roc,psi and plot.☆18Jul 17, 2023Updated 2 years ago
- A tensorflow implementation of the Forward-Forward Algorithm from NeurIPS '22.☆10May 10, 2023Updated 3 years ago
- ☆18Apr 6, 2023Updated 3 years ago
- Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding☆11May 19, 2023Updated 3 years ago
- Source code for "N-ary Constituent Tree Parsing with Recursive Semi-Markov Model" published at ACL 2021☆10May 27, 2021Updated 5 years ago
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆23Jun 19, 2023Updated 3 years ago
- Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch☆120Aug 4, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A method to mine beyond-pairwise relationships using Min-Hashing for large-scale pattern discovery☆27Oct 10, 2021Updated 4 years ago
- Resources for the paper "NPA: News Recommendation with Personalized Attention"☆48Sep 22, 2022Updated 3 years ago
- Code for MME-SID accepted to CIKM 2025 Full Research track.☆30Oct 29, 2025Updated 8 months ago
- transformers go brrr...☆148Feb 15, 2022Updated 4 years ago
- ☆257Oct 4, 2022Updated 3 years ago
- zero-vocab or low-vocab embeddings☆18Jul 17, 2022Updated 3 years ago
- Implementations of some methods in news recommendation.☆259Oct 8, 2022Updated 3 years ago
- Official PyTorch Repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆415Jul 25, 2024Updated last year
- Pytorch library for fast transformer implementations☆1,772Mar 23, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code of CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping☆17Oct 8, 2022Updated 3 years ago
- Source code of FedPrompt☆16May 4, 2022Updated 4 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Oct 29, 2021Updated 4 years ago
- Cross Sentence Neural Machine Translation☆10Mar 26, 2018Updated 8 years ago
- Materials for ACL-2022 tutorial: A Gentle Introduction to Deep Nets and Opportunities for the Future☆17May 24, 2022Updated 4 years ago
- Rationales for Sequential Predictions☆39Mar 10, 2022Updated 4 years ago
- Efficient-FedRec: Efficient Federated Learning Framework for Privacy-Preserving News Recommendation☆58Apr 28, 2024Updated 2 years ago