A pytorch &keras implementation and demo of Fastformer.
☆192Sep 22, 2022Updated 3 years ago
Alternatives and similar repositories for Fastformer
Users that are interested in Fastformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A new NRMS model for the MIcrosoft News Dataset(MIND)☆22Jan 19, 2024Updated 2 years ago
- Official PyTorch implementation of "Attention-Free Keyword Spotting", Mashrur. M. Morshed & Ahmad Omar Ahsan, PML4DC @ ICLR 2022.☆15Nov 5, 2022Updated 3 years ago
- 2020 MIND news recomendation first place solution☆93Mar 10, 2021Updated 5 years ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Sep 27, 2021Updated 4 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆32Jun 14, 2023Updated 3 years ago
- ACL 2021: HiTransformer☆13May 29, 2021Updated 5 years ago
- [WWW'22] Deep Interest Highlight Network for Click-Through Rate Prediction in Trigger-Induced Recommendation☆22Apr 11, 2022Updated 4 years ago
- ☆121Jun 1, 2026Updated last week
- FairSeq repo with Apollo optimizer☆113Dec 20, 2023Updated 2 years ago
- A Julia IO type that facilitates width-limited printing☆12Mar 21, 2023Updated 3 years ago
- Open source data and code of the MGNM☆25Sep 15, 2022Updated 3 years ago
- Implementation of Hierarchical Transformer Memory (HTM) for Pytorch☆76Sep 15, 2021Updated 4 years ago
- Deep neural network codes for ctr/cvr prediction task in ranking process implemented by Tensorflow (1.14/2.4.1 version), using tf.estimat…☆11Apr 21, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Presentation for JuliaCon 2022 on precompilation☆15Aug 18, 2023Updated 2 years ago
- A tensorflow implementation of the Forward-Forward Algorithm from NeurIPS '22.☆10May 10, 2023Updated 3 years ago
- ☆18Apr 6, 2023Updated 3 years ago
- Source code for "N-ary Constituent Tree Parsing with Recursive Semi-Markov Model" published at ACL 2021☆10May 27, 2021Updated 5 years ago
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- Implementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch☆120Aug 4, 2021Updated 4 years ago
- ☆23Jul 17, 2023Updated 2 years ago
- A method to mine beyond-pairwise relationships using Min-Hashing for large-scale pattern discovery☆27Oct 10, 2021Updated 4 years ago
- Punctuation restoration in ASR text☆33Jul 1, 2019Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Resources for the paper "NPA: News Recommendation with Personalized Attention"☆48Sep 22, 2022Updated 3 years ago
- Code for MME-SID accepted to CIKM 2025 Full Research track.☆29Oct 29, 2025Updated 7 months ago
- ☆257Oct 4, 2022Updated 3 years ago
- 🗣️ NALP is a library that covers Natural Adversarial Language Processing.☆24Jan 1, 2026Updated 5 months ago
- zero-vocab or low-vocab embeddings☆18Jul 17, 2022Updated 3 years ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆24Sep 21, 2025Updated 8 months ago
- Implementations of some methods in news recommendation.☆260Oct 8, 2022Updated 3 years ago
- Official PyTorch Repo for "ReZero is All You Need: Fast Convergence at Large Depth"☆415Jul 25, 2024Updated last year
- Pytorch library for fast transformer implementations☆1,771Mar 23, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code of CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping☆17Oct 8, 2022Updated 3 years ago
- Source code of FedPrompt☆16May 4, 2022Updated 4 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Oct 29, 2021Updated 4 years ago
- Cross Sentence Neural Machine Translation☆11Mar 26, 2018Updated 8 years ago
- Materials for ACL-2022 tutorial: A Gentle Introduction to Deep Nets and Opportunities for the Future☆17May 24, 2022Updated 4 years ago
- Rationales for Sequential Predictions☆40Mar 10, 2022Updated 4 years ago
- Efficient-FedRec: Efficient Federated Learning Framework for Privacy-Preserving News Recommendation☆58Apr 28, 2024Updated 2 years ago