A pytorch &keras implementation and demo of Fastformer.
☆192Sep 22, 2022Updated 3 years ago
Alternatives and similar repositories for Fastformer
Users that are interested in Fastformer are comparing it to the libraries listed below
Sorting:
- ☆58May 12, 2022Updated 3 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- 2020 MIND news recomendation first place solution☆94Mar 10, 2021Updated 4 years ago
- A new NRMS model for the MIcrosoft News Dataset(MIND)☆19Jan 19, 2024Updated 2 years ago
- FairSeq repo with Apollo optimizer☆114Dec 20, 2023Updated 2 years ago
- Deep neural network codes for ctr/cvr prediction task in ranking process implemented by Tensorflow (1.14/2.4.1 version), using tf.estimat…☆11Apr 21, 2021Updated 4 years ago
- ☆14Mar 20, 2025Updated 11 months ago
- Rationales for Sequential Predictions☆40Mar 10, 2022Updated 3 years ago
- [WWW'22] Deep Interest Highlight Network for Click-Through Rate Prediction in Trigger-Induced Recommendation☆22Apr 11, 2022Updated 3 years ago
- Large-scale topic discovery with Sampled-MinHashing☆10Jul 3, 2019Updated 6 years ago
- Cross Sentence Neural Machine Translation☆11Mar 26, 2018Updated 7 years ago
- Source code for "N-ary Constituent Tree Parsing with Recursive Semi-Markov Model" published at ACL 2021☆10May 27, 2021Updated 4 years ago
- Code for MME-SID accepted to CIKM 2025 Full Research track.☆27Oct 29, 2025Updated 4 months ago
- The implement of LLMTreeRec☆14Dec 9, 2024Updated last year
- A Julia IO type that facilitates width-limited printing☆12Mar 21, 2023Updated 2 years ago
- ☆254Oct 4, 2022Updated 3 years ago
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆24Jun 19, 2023Updated 2 years ago
- Sequence modeling with Mega.☆303Jan 28, 2023Updated 3 years ago
- Cross-domain data integration for named entity disambiguation in biomedical text☆11Dec 15, 2021Updated 4 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Oct 29, 2021Updated 4 years ago
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"☆48May 25, 2022Updated 3 years ago
- Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling☆30Feb 25, 2021Updated 5 years ago
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆16Apr 17, 2025Updated 10 months ago
- ☆19Jun 4, 2025Updated 8 months ago
- Faster, more accurate and entirely open source method for predicting contacts in proteins☆12May 21, 2018Updated 7 years ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Oct 3, 2024Updated last year
- ☆21Jul 21, 2025Updated 7 months ago
- Resources for the paper "NPA: News Recommendation with Personalized Attention"☆48Sep 22, 2022Updated 3 years ago
- The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual C…☆17May 27, 2024Updated last year
- ☆13Dec 4, 2017Updated 8 years ago
- Using Huggingface to generate relation expressions☆15Jan 15, 2021Updated 5 years ago
- ACL 2021: HiTransformer☆13May 29, 2021Updated 4 years ago
- ☆13Jan 14, 2022Updated 4 years ago
- ☆13Nov 7, 2021Updated 4 years ago
- ☆13Jan 27, 2019Updated 7 years ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,641Oct 16, 2024Updated last year
- Recent Advances in MLP-based Models (MLP is all you need!)☆116Dec 13, 2022Updated 3 years ago
- Rectified Rotary Position Embeddings☆389May 20, 2024Updated last year