Trains Transformer model variants. Data isn't shuffled between batches.
☆143Oct 5, 2022Updated 3 years ago
Alternatives and similar repositories for transformer-sequential
Users that are interested in transformer-sequential are comparing it to the libraries listed below
Sorting:
- Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).☆228Apr 18, 2022Updated 3 years ago
- My implementation of DeepMind's Perceiver☆63Apr 23, 2021Updated 4 years ago
- Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch☆42Apr 14, 2021Updated 4 years ago
- Expressive Power of Invariant and Equivariant Graph Neural Networks (ICLR 2021)☆41Aug 25, 2023Updated 2 years ago
- Estimating Example Difficulty using Variance of Gradients☆64Jan 10, 2023Updated 3 years ago
- GAN models implemented with Pytorch Lightning and Hydra configuration☆33Jun 5, 2022Updated 3 years ago
- ☆30Jan 17, 2022Updated 4 years ago
- Sequence modeling with Mega.☆303Jan 28, 2023Updated 3 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.☆245Feb 16, 2026Updated last week
- Collection of machine learning research paper references☆26Feb 23, 2025Updated last year
- lanmt ebm☆12Jun 19, 2020Updated 5 years ago
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆433Aug 17, 2022Updated 3 years ago
- ☆100Dec 8, 2021Updated 4 years ago
- ☆388Oct 18, 2023Updated 2 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆175Jun 6, 2021Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Jul 26, 2021Updated 4 years ago
- http://nlp.seas.harvard.edu/2018/04/03/attention.html☆63May 20, 2021Updated 4 years ago
- A library for evaluating representations.☆77Nov 21, 2021Updated 4 years ago
- Official code Cross-Covariance Image Transformer (XCiT)☆674Sep 28, 2021Updated 4 years ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆102Nov 2, 2020Updated 5 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 4 years ago
- Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".☆14May 23, 2021Updated 4 years ago
- Euclidean Wasserstein-2 optimal transportation☆46Aug 19, 2023Updated 2 years ago
- Efficient, check-pointed data loading for deep learning with massive data sets.☆211Jun 12, 2023Updated 2 years ago
- Lightweight Cluster/Cloud VM Job Management 🚀☆42Aug 27, 2024Updated last year
- Unofficial PyTorch implementation of Fastformer based on paper "Fastformer: Additive Attention Can Be All You Need"."☆132Sep 6, 2021Updated 4 years ago
- ☆13Jun 18, 2021Updated 4 years ago
- Neural Text Generation with Unlikelihood Training☆310Aug 31, 2021Updated 4 years ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆31Dec 5, 2022Updated 3 years ago
- Course notes and notebooks to teach the fundamentals of how deep learning works; uses PyTorch.☆80Feb 16, 2021Updated 5 years ago
- A Python library for mathematical optimization☆141Sep 27, 2024Updated last year
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Dec 6, 2024Updated last year
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,924Feb 14, 2023Updated 3 years ago
- Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)☆63Apr 19, 2022Updated 3 years ago
- (K3IM) Keras 3 Image Models☆20Feb 22, 2024Updated 2 years ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆606Jun 15, 2022Updated 3 years ago
- This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"☆20May 19, 2022Updated 3 years ago
- ☆21Dec 8, 2022Updated 3 years ago