TimDettmers/transformer-xl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TimDettmers/transformer-xl)

TimDettmers / transformer-xl

☆65

Alternatives and similar repositories for transformer-xl

Users that are interested in transformer-xl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

harvardnlp / cascaded-generation
View on GitHub
Cascaded Text Generation with Markov Transformers
☆130Mar 20, 2023Updated 3 years ago
timvieira / rl
View on GitHub
Reference implementation of algorithms for reinforcement learning and Markov decision processes.
☆12Jan 28, 2021Updated 5 years ago
spyysalo / wiki-bert-pipeline
View on GitHub
Generate BERT vocabularies and pretraining examples from Wikipedias
☆17May 11, 2020Updated 6 years ago
mhagiwara / nanigonet
View on GitHub
NanigoNet — Language detector for code-mixed input supporting 150+19 human+programming languages using deep neural networks
☆71May 22, 2023Updated 3 years ago
nttcslab-nlp / doc_lm
View on GitHub
☆11Jan 9, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / unlikelihood_training
View on GitHub
Neural Text Generation with Unlikelihood Training
☆311Aug 31, 2021Updated 4 years ago
ShaojieJiang / tldr
View on GitHub
Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"
☆10Aug 11, 2023Updated 2 years ago
Holmeswww / PPOGAN
View on GitHub
☆25May 3, 2024Updated 2 years ago
jungokasai / deep-shallow
View on GitHub
☆43Sep 16, 2020Updated 5 years ago
srush / learns-dex
View on GitHub
☆33Jan 14, 2021Updated 5 years ago
Roxot / mbr-nmt
View on GitHub
Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation
☆16Oct 14, 2022Updated 3 years ago
XMUDeepLIT / VarNDRR
View on GitHub
Code for "Variational Neural Discourse Relation Recognizer" (EMNLP 2016)
☆16Dec 29, 2017Updated 8 years ago
microsoft / conservative-uncertainty-estimation-random-priors
View on GitHub
Source code for paper Conservative Uncertainty Estimation By Fitting Prior Networks (ICLR 2020)
☆22Nov 28, 2022Updated 3 years ago
zhaoyanpeng / cpcfg
View on GitHub
Fast and Modularized CFG-focused Models
☆23Nov 8, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
microsoft / xtreme-distil-transformers
View on GitHub
XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale
☆157Dec 20, 2023Updated 2 years ago
cognitiveailab / tg2021task
View on GitHub
Participant Kit for the TextGraphs-15 Shared Task on Explanation Regeneration
☆18Nov 8, 2021Updated 4 years ago
nng555 / ssmba
View on GitHub
☆61Apr 19, 2022Updated 4 years ago
allenai / tpu_pretrain
View on GitHub
LM Pretraining with PyTorch/TPU
☆137Oct 24, 2019Updated 6 years ago
jwieting / paraphrastic-representations-at-scale
View on GitHub
☆74Jul 2, 2021Updated 5 years ago
MadryLab / dataset-replication-analysis
View on GitHub
☆25May 20, 2020Updated 6 years ago
tacchinotacchi / distil-bilstm
View on GitHub
Scripts to train a bidirectional LSTM with knowledge distillation from BERT
☆159Nov 21, 2019Updated 6 years ago
Kyubyong / cjk_trans
View on GitHub
Pre-trained Machine Translation Models of Korean from/to ECJ
☆28Jul 15, 2019Updated 7 years ago
zxie / vae
View on GitHub
Variational autoencoder in Theano
☆11Sep 14, 2017Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yoonkim / neural-qcfg
View on GitHub
☆45Oct 11, 2021Updated 4 years ago
MSR-LIT / XtremeDistil
View on GitHub
☆16Jun 12, 2023Updated 3 years ago
srush / awesome-ml-tracking
View on GitHub
☆105Jan 14, 2021Updated 5 years ago
kweonwooj / papers
View on GitHub
summary of ML papers I've read
☆323Jul 27, 2018Updated 8 years ago
allenai / sledgehammer
View on GitHub
☆48Jun 8, 2020Updated 6 years ago
fajri91 / RSTExtractor
View on GitHub
☆11Dec 8, 2022Updated 3 years ago
cdg720 / emnlp2016
View on GitHub
☆47May 22, 2017Updated 9 years ago
sjchoi86 / tf_practice
View on GitHub
TensorFlow 1.x Practice
☆15Oct 2, 2020Updated 5 years ago
locuslab / monotone_op_net
View on GitHub
Monotone operator equilibrium networks
☆53Jun 22, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ltgoslo / simple_elmo_training
View on GitHub
Minimal code to train ELMo models in recent versions of TensorFlow
☆14Jun 16, 2026Updated last month
shijie-wu / crosslingual-nlp
View on GitHub
This repo supports various cross-lingual transfer learning & multilingual NLP models.
☆92Sep 13, 2023Updated 2 years ago
curto2 / mckernel
View on GitHub
McKernel: A Library for Approximate Kernel Expansions in Log-linear Time.
☆14Sep 3, 2022Updated 3 years ago
boschresearch / adversarial_meta_embeddings
View on GitHub
Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"
☆13Dec 14, 2021Updated 4 years ago
midas-research / bhaav
View on GitHub
Dataset of sentences from Hindi stories tagged with different emotion tags
☆11Nov 26, 2019Updated 6 years ago
edwardjhu / improved_wasserstein
View on GitHub
Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"
☆14Apr 28, 2020Updated 6 years ago
yinwenpeng / SciTail
View on GitHub
This released code is for our ACL2018 paper "End-Task Oriented Textual Entailment via Deep Explorations of Inter-Sentence Interactions". …
☆15May 28, 2018Updated 8 years ago