Training Transformer-XL on 128 GPUs
☆141Jun 11, 2020Updated 5 years ago
Alternatives and similar repositories for transformer-xl
Users that are interested in transformer-xl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lightweight interface to AWS☆47Oct 15, 2019Updated 6 years ago
- Transformer training code for sequential tasks☆609Sep 14, 2021Updated 4 years ago
- NLP library designed for reproducible experimentation management☆294Jul 25, 2024Updated last year
- Implements an infinite sum of poisson-weighted convolutions☆27Aug 22, 2018Updated 7 years ago
- Implementation of Adversarial Variational Optimization in PyTorch☆42Aug 7, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Implementation of https://arxiv.org/abs/1904.00962☆377Dec 9, 2020Updated 5 years ago
- A curated list of papers exploring the limits of deep learning for NLP☆24Mar 20, 2018Updated 8 years ago
- PyTorch implementation of the NIPS'17 paper Training Deep Networks without Learning Rates Through Coin Betting.☆38May 15, 2018Updated 7 years ago
- Code for "EigenDamage: Structured Pruning in the Kronecker-Factored Eigenbasis" https://arxiv.org/abs/1905.05934☆113Mar 3, 2020Updated 6 years ago
- Low precision Torch nn library using uint8_t GEMM (experiment)☆19Mar 13, 2016Updated 10 years ago
- ☆3,697Sep 21, 2022Updated 3 years ago
- PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"☆271Feb 12, 2022Updated 4 years ago
- Run Pytorch graphs inside Theano graph (and pytorch wrapper for AIS for generative models).☆18Oct 19, 2017Updated 8 years ago
- Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CAS…☆746Apr 14, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Using Fastai library to classify Twitter jokes in Spanish☆12Jul 4, 2019Updated 6 years ago
- A neural assembly compiler for pyTorch based on adaptive-neural-compilation☆27Mar 27, 2018Updated 8 years ago
- .NET bindings for the Pytorch engine☆17Oct 26, 2019Updated 6 years ago
- PhD thesis (updating) of Jiatao Gu from HKU☆19Aug 10, 2018Updated 7 years ago
- train on AWS☆76Sep 21, 2018Updated 7 years ago
- ☆37Mar 27, 2019Updated 7 years ago
- Repository of code for the tutorial on Transfer Learning in NLP held at NAACL 2019 in Minneapolis, MN, USA☆722Oct 16, 2019Updated 6 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,927Feb 14, 2023Updated 3 years ago
- Original PyTorch implementation of the Leap meta-learner (https://arxiv.org/abs/1812.01054) along with code for running the Omniglot expe…☆146Apr 10, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A Chainer implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI☆28Jun 20, 2018Updated 7 years ago
- ☆48Apr 26, 2018Updated 7 years ago
- Implementation of the pseudo-reference generation algorithm proposed in EMNLP 2018 paper: Multi-Reference Training with Pseudo-References…☆11Oct 15, 2018Updated 7 years ago
- LibreOffice Neural Machine Translation☆72Nov 4, 2020Updated 5 years ago
- Neural networks training pipeline based on PyTorch☆312Jun 1, 2020Updated 5 years ago
- Supporting example for "A Rust SentencePiece implementation"☆20Jun 7, 2020Updated 5 years ago
- LSTM and QRNN Language Model Toolkit for PyTorch☆1,990Feb 12, 2022Updated 4 years ago
- Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)☆2,112Jan 4, 2022Updated 4 years ago
- higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual tr…☆1,629Mar 25, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A faster python zipfile.☆31Feb 22, 2018Updated 8 years ago
- Code for the Eager Translation Model from the paper You May Not Need Attention☆294Dec 17, 2018Updated 7 years ago
- Cascaded Text Generation with Markov Transformers☆130Mar 20, 2023Updated 3 years ago
- Unsupervised text tokenizer focused on computational efficiency☆977Mar 29, 2024Updated 2 years ago
- lua apply function for cutorch☆17Jan 5, 2017Updated 9 years ago
- Nervana Neon kernels in Torch☆19Nov 5, 2015Updated 10 years ago
- Write PyTorch code at the level of individual examples, then run it efficiently on minibatches.☆485Feb 12, 2022Updated 4 years ago