facebookresearch/transformer-sequential

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/transformer-sequential)

facebookresearch / transformer-sequential

Trains Transformer model variants. Data isn't shuffled between batches.

☆147

Alternatives and similar repositories for transformer-sequential

Users that are interested in transformer-sequential are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

automl / nes
View on GitHub
Neural Ensemble Search for Uncertainty Estimation and Dataset Shift
☆35Jan 10, 2026Updated 6 months ago
timy90022 / DropLoss
View on GitHub
Implementation of DropLoss for Long-Tail Instance Segmentation in Pytorch
☆44Apr 14, 2021Updated 5 years ago
NVIDIA / transformer-ls
View on GitHub
Official PyTorch Implementation of Long-Short Transformer (NeurIPS 2021).
☆228Apr 18, 2022Updated 4 years ago
mlelarge / graph_neural_net
View on GitHub
Expressive Power of Invariant and Equivariant Graph Neural Networks (ICLR 2021)
☆43Aug 25, 2023Updated 2 years ago
louislva / deepmind-perceiver
View on GitHub
My implementation of DeepMind's Perceiver
☆65Apr 23, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ebartrum / lightning_gan_zoo
View on GitHub
GAN models implemented with Pytorch Lightning and Hydra configuration
☆33Jun 5, 2022Updated 4 years ago
chirag-agarwall / VOG
View on GitHub
Estimating Example Difficulty using Variance of Gradients
☆66Jan 10, 2023Updated 3 years ago
anguyen8 / sam
View on GitHub
Code for the CVPR 2020 [ORAL] paper "SAM: The Sensitivity of Attribution Methods to Hyperparameters"
☆29Dec 8, 2022Updated 3 years ago
facebookresearch / mega
View on GitHub
Sequence modeling with Mega.
☆303Jan 28, 2023Updated 3 years ago
donutloop / machine-learning-research-papers
View on GitHub
Collection of machine learning research paper references
☆26Updated this week
willwhitney / reprieve
View on GitHub
A library for evaluating representations.
☆81Nov 21, 2021Updated 4 years ago
mle-infrastructure / mle-scheduler
View on GitHub
Lightweight Cluster/Cloud VM Job Management 🚀
☆44Aug 27, 2024Updated last year
microsoft / fastseq
View on GitHub
An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…
☆433Aug 17, 2022Updated 3 years ago
facebookresearch / w2ot
View on GitHub
Euclidean Wasserstein-2 optimal transportation
☆48Aug 19, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
zomux / lanmt-ebm
View on GitHub
lanmt ebm
☆12Jun 19, 2020Updated 6 years ago
nttcslab / Generalized-Domain-Adaptation
View on GitHub
☆12Jun 18, 2021Updated 5 years ago
facebookresearch / xcit
View on GitHub
Official code Cross-Covariance Image Transformer (XCiT)
☆681Sep 28, 2021Updated 4 years ago
parrt / fundamentals-of-deep-learning
View on GitHub
Course notes and notebooks to teach the fundamentals of how deep learning works; uses PyTorch.
☆84Feb 16, 2021Updated 5 years ago
rish-16 / aft-pytorch
View on GitHub
Unofficial PyTorch implementation of Attention Free Transformer (AFT) layers by Apple Inc.
☆246Feb 16, 2026Updated 5 months ago
Mehooz / BIRD_code
View on GitHub
Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".
☆14May 23, 2021Updated 5 years ago
google-research / dice_rl
View on GitHub
☆114Jul 3, 2026Updated 2 weeks ago
PAL-ML / PEARL_v1
View on GitHub
☆30Jan 17, 2022Updated 4 years ago
roeeaharoni / string-to-tree-nmt
View on GitHub
Source code and data for the paper "Towards String-to-Tree Neural Machine Translation"
☆16Dec 31, 2017Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
s-nlp / annotated-transformer
View on GitHub
http://nlp.seas.harvard.edu/2018/04/03/attention.html
☆63May 20, 2021Updated 5 years ago
openopt / copt
View on GitHub
A Python library for mathematical optimization
☆143Sep 27, 2024Updated last year
lucidrains / feedback-transformer-pytorch
View on GitHub
Implementation of Feedback Transformer in Pytorch
☆108Mar 2, 2021Updated 5 years ago
lucidrains / g-mlp-gpt
View on GitHub
GPT, but made only out of MLPs
☆89May 25, 2021Updated 5 years ago
facebookresearch / SentAugment
View on GitHub
SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…
☆359Feb 22, 2022Updated 4 years ago
facebookresearch / reconsider
View on GitHub
ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…
☆50Apr 26, 2021Updated 5 years ago
ahmetustun / udapter
View on GitHub
UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…
☆31Dec 5, 2022Updated 3 years ago
seloufian / Deep-Learning-Computer-Vision
View on GitHub
My assignment solutions for Stanford’s CS231n (CNNs for Visual Recognition) and Michigan’s EECS 498-007/598-005 (Deep Learning for Comput…
☆135Apr 4, 2021Updated 5 years ago
ofirpress / shortformer
View on GitHub
Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.
☆147Jul 26, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
studio-ousia / bpr
View on GitHub
Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering
☆175Jun 6, 2021Updated 5 years ago
HazyResearch / embroid
View on GitHub
Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification
☆11Aug 12, 2023Updated 2 years ago
facebookresearch / unlikelihood_training
View on GitHub
Neural Text Generation with Unlikelihood Training
☆311Aug 31, 2021Updated 4 years ago
mle-infrastructure / mle-logging
View on GitHub
Lightweight ML Experiment Logging 📖
☆83Aug 26, 2024Updated last year
lucidrains / memformer
View on GitHub
Implementation of Memformer, a Memory-augmented Transformer, in Pytorch
☆126Nov 13, 2020Updated 5 years ago
lyeoni / gpt-pytorch
View on GitHub
PyTorch Implementation of OpenAI GPT
☆131Jun 28, 2023Updated 3 years ago
PruneTruong / GOCor
View on GitHub
Official Pytorch implementation of 'GOCor: Bringing Globally Optimized Correspondence Volumes into Your Neural Network' (NeurIPS 2020)
☆76Feb 23, 2023Updated 3 years ago