A pytorch &keras implementation and demo of Fastformer.
☆192Sep 22, 2022Updated 3 years ago
Alternatives and similar repositories for Fastformer
Users that are interested in Fastformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."☆131Sep 6, 2021Updated 4 years ago
- A new NRMS model for the MIcrosoft News Dataset(MIND)☆21Jan 19, 2024Updated 2 years ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Sep 27, 2021Updated 4 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆33Jun 14, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ACL 2021: HiTransformer☆13May 29, 2021Updated 4 years ago
- [WWW'22] Deep Interest Highlight Network for Click-Through Rate Prediction in Trigger-Induced Recommendation☆22Apr 11, 2022Updated 4 years ago
- ☆120Apr 1, 2026Updated last month
- FairSeq repo with Apollo optimizer☆113Dec 20, 2023Updated 2 years ago
- A Julia IO type that facilitates width-limited printing☆12Mar 21, 2023Updated 3 years ago
- ☆16Nov 9, 2023Updated 2 years ago
- Faster, more accurate and entirely open source method for predicting contacts in proteins☆12May 21, 2018Updated 7 years ago
- The dataset for paper "Why Do We Click: Visual Impression-aware News Recommendation", ACM MM 2021☆15Feb 24, 2022Updated 4 years ago
- Implementation of Hierarchical Transformer Memory (HTM) for Pytorch☆76Sep 15, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Presentation for JuliaCon 2022 on precompilation☆15Aug 18, 2023Updated 2 years ago
- a python package for stat,caculate woe,iv, ks,auc,roc,psi and plot.☆18Jul 17, 2023Updated 2 years ago
- A tensorflow implementation of the Forward-Forward Algorithm from NeurIPS '22.☆10May 10, 2023Updated 2 years ago
- ☆17Apr 6, 2023Updated 3 years ago
- Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding☆11May 19, 2023Updated 2 years ago
- Source code for "N-ary Constituent Tree Parsing with Recursive Semi-Markov Model" published at ACL 2021☆10May 27, 2021Updated 4 years ago
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch☆120Aug 4, 2021Updated 4 years ago
- ☆45Oct 14, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Punctuation restoration in ASR text☆33Jul 1, 2019Updated 6 years ago
- ☆257Oct 4, 2022Updated 3 years ago
- zero-vocab or low-vocab embeddings☆18Jul 17, 2022Updated 3 years ago
- Implementations of some methods in news recommendation.☆261Oct 8, 2022Updated 3 years ago
- Official PyTorch Repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆415Jul 25, 2024Updated last year
- Code of CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping☆17Oct 8, 2022Updated 3 years ago
- Source code of FedPrompt☆16May 4, 2022Updated 4 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Oct 29, 2021Updated 4 years ago
- Materials for ACL-2022 tutorial: A Gentle Introduction to Deep Nets and Opportunities for the Future☆17May 24, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Rationales for Sequential Predictions☆40Mar 10, 2022Updated 4 years ago
- PyTorch-Lightning Library for Neural News Recommendation☆61Jun 6, 2025Updated 10 months ago
- Transformer based on a variant of attention that is linear complexity in respect to sequence length☆829May 5, 2024Updated last year
- Implementation of Multistream Transformers in Pytorch☆54Jul 31, 2021Updated 4 years ago
- ☆47Jan 8, 2021Updated 5 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Dec 27, 2022Updated 3 years ago