Implementation of RealFormer using pytorch
☆101Dec 27, 2020Updated 5 years ago
Alternatives and similar repositories for realformer-pytorch
Users that are interested in realformer-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆42Jan 22, 2021Updated 5 years ago
- 7th place solution to RecSys Challenge 2023 by Corca☆10Jan 8, 2024Updated 2 years ago
- Github repository for Zero Shot Visual Storytelling☆15Dec 6, 2021Updated 4 years ago
- BERT Baseline for the Natural Questions☆11Jan 24, 2019Updated 7 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [ICLR 2020] Lite Transformer with Long-Short Range Attention☆610Jul 11, 2024Updated last year
- A PyTorch implementation of the paper - "Synthesizer: Rethinking Self-Attention in Transformer Models"☆75Dec 8, 2022Updated 3 years ago
- [ACL 2020] DeFormer: Decomposing Pre-trained Transformers for Faster Question Answering☆120May 22, 2023Updated 3 years ago
- ☆10Apr 2, 2022Updated 4 years ago
- Streamlit, but better.☆16Feb 5, 2024Updated 2 years ago
- Implementation of Feedback Transformer in Pytorch☆108Mar 2, 2021Updated 5 years ago
- codes and pre-trained models of paper "Segatron: Segment-aware Transformer for Language Modeling and Understanding"☆18Oct 25, 2022Updated 3 years ago
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A template for starting an allennlp project using a python script instead of config files☆27Mar 20, 2024Updated 2 years ago
- ☆22Dec 26, 2020Updated 5 years ago
- ☆19Jul 1, 2020Updated 6 years ago
- ☆18Jun 15, 2023Updated 3 years ago
- code for Explicit Sparse Transformer☆60Jul 21, 2023Updated 2 years ago
- ☆11Aug 10, 2021Updated 4 years ago
- NER Task with CNN + BiLSTM + CRF (with Naver NLP Challenge dataset) with Pytorch☆31Jul 25, 2024Updated last year
- Comparing attention-based convolutional and recurrent neural networks under adversarial attacks to investigate their success and limitati…☆10Aug 24, 2018Updated 7 years ago
- Visual Transformers with Primal Object Queries for Multi-Label Image Classification☆12May 17, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Unofficial Implementation of MagicMix☆98Nov 3, 2022Updated 3 years ago
- M3TR: Multi-modal Multi-label Recognition with Transformer. ACM MM 2021☆16Oct 27, 2021Updated 4 years ago
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆91Aug 24, 2021Updated 4 years ago
- These papers will provide unique insightful concepts that will broaden your perspective on neural networks and deep learning☆48Sep 3, 2023Updated 2 years ago
- ☆220Jun 8, 2020Updated 6 years ago
- Encode-attend-navigate unofficial Pytorch implementation☆12Oct 1, 2024Updated last year
- Code for the paper "Adaptive Transformers for Learning Multimodal Representations" (ACL SRW 2020)☆43Oct 20, 2022Updated 3 years ago
- ☆13Jan 24, 2018Updated 8 years ago
- [ACL'21 Findings] Why Machine Reading Comprehension Models Learn Shortcuts?☆16Aug 8, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- An implementation of Performer, a linear attention-based transformer, in Pytorch☆1,176Feb 2, 2022Updated 4 years ago
- ☆33Apr 12, 2021Updated 5 years ago
- My take on a practical implementation of Linformer for Pytorch.☆424Jul 27, 2022Updated 3 years ago
- Apollo: An Adaptive Parameter-wise Diagonal Quasi-Newton Method for Nonconvex Stochastic Optimization☆181Nov 21, 2021Updated 4 years ago
- Sparse Attention with Linear Units☆20Apr 21, 2021Updated 5 years ago
- ☆22Oct 14, 2021Updated 4 years ago
- ☆20Jun 7, 2020Updated 6 years ago