A pytorch &keras implementation and demo of Fastformer.
☆192Sep 22, 2022Updated 3 years ago
Alternatives and similar repositories for Fastformer
Users that are interested in Fastformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."☆132Sep 6, 2021Updated 4 years ago
- ☆58May 12, 2022Updated 3 years ago
- 2020 MIND news recomendation first place solution☆94Mar 10, 2021Updated 5 years ago
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆33Jun 14, 2023Updated 2 years ago
- ACL 2021: HiTransformer☆13May 29, 2021Updated 4 years ago
- [WWW'22] Deep Interest Highlight Network for Click-Through Rate Prediction in Trigger-Induced Recommendation☆22Apr 11, 2022Updated 3 years ago
- FairSeq repo with Apollo optimizer☆114Dec 20, 2023Updated 2 years ago
- A Julia IO type that facilitates width-limited printing☆12Mar 21, 2023Updated 3 years ago
- ☆16Nov 9, 2023Updated 2 years ago
- Open source data and code of the MGNM☆25Sep 15, 2022Updated 3 years ago
- Template repo for Python projects, especially those focusing on machine learning and/or deep learning.☆15Jan 14, 2026Updated 2 months ago
- Deep neural network codes for ctr/cvr prediction task in ranking process implemented by Tensorflow (1.14/2.4.1 version), using tf.estimat…☆11Apr 21, 2021Updated 4 years ago
- Presentation for JuliaCon 2022 on precompilation☆15Aug 18, 2023Updated 2 years ago
- A tensorflow implementation of the Forward-Forward Algorithm from NeurIPS '22.☆10May 10, 2023Updated 2 years ago
- Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding☆11May 19, 2023Updated 2 years ago
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆23Jun 19, 2023Updated 2 years ago
- Punctuation restoration in ASR text☆33Jul 1, 2019Updated 6 years ago
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- ☆23Jul 17, 2023Updated 2 years ago
- ☆255Oct 4, 2022Updated 3 years ago
- 🗣️ NALP is a library that covers Natural Adversarial Language Processing.☆23Jan 1, 2026Updated 2 months ago
- zero-vocab or low-vocab embeddings☆18Jul 17, 2022Updated 3 years ago
- Pytorch library for fast transformer implementations☆1,763Mar 23, 2023Updated 3 years ago
- Code of CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping☆17Oct 8, 2022Updated 3 years ago
- Source code of FedPrompt☆16May 4, 2022Updated 3 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Oct 29, 2021Updated 4 years ago
- Cross Sentence Neural Machine Translation☆11Mar 26, 2018Updated 7 years ago
- Materials for ACL-2022 tutorial: A Gentle Introduction to Deep Nets and Opportunities for the Future☆17May 24, 2022Updated 3 years ago
- Code accompanying paper "Set Norm and Equivariant Skip Connections: Putting the Deep in Deep Sets."☆31Oct 11, 2022Updated 3 years ago
- Efficient-FedRec: Efficient Federated Learning Framework for Privacy-Preserving News Recommendation☆58Apr 28, 2024Updated last year
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,643Oct 16, 2024Updated last year
- ☆11Oct 11, 2023Updated 2 years ago
- ☆46Jan 8, 2021Updated 5 years ago
- Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture. Training an MDM using GPT with this repo!☆35Jun 23, 2025Updated 9 months ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Dec 27, 2022Updated 3 years ago
- Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources (NAACL-2021).☆17Nov 18, 2021Updated 4 years ago
- Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling☆30Feb 25, 2021Updated 5 years ago
- Implementation of Hinton's forward-forward (FF) algorithm in tensorflow - an alternative to back-propagation☆35Apr 3, 2023Updated 2 years ago
- Sequence modeling with Mega.☆303Jan 28, 2023Updated 3 years ago