kefirski / attentive-translationLinks

pytorch Transformer model with byte-pair encoding

☆11

Alternatives and similar repositories for attentive-translation

Users that are interested in attentive-translation are comparing it to the libraries listed below

Sorting:

lucidrains / distilled-retriever-pytorch
Implementation of the retriever distillation procedure as outlined in the paper "Distilling Knowledge from Reader to Retriever"
☆32Updated 4 years ago
MultiPath / Efficient-Neural-Machine-Translation
PhD thesis (updating) of Jiatao Gu from HKU
☆19Updated 6 years ago
ofirpress / PartialShuffle
☆14Updated 6 years ago
lioutasb / TaLKConvolutions
Official PyTorch implementation of Time-aware Large Kernel (TaLK) Convolutions (ICML 2020)
☆29Updated 4 years ago
bzhangGo / lrn
Source code for "A Lightweight Recurrent Network for Sequence Modeling"
☆26Updated 2 years ago
zbloss / reformer_lm
a Pytorch implementation of the Reformer Network (https://openreview.net/pdf?id=rkgNKkHtvB)
☆53Updated 2 years ago
RemiLeblond / SeaRNN-open
Open source implementation of SeaRNN (ICLR 2018, https://openreview.net/forum?id=HkUR_y-RZ)
☆48Updated 7 years ago
jiesutd / PyTorchSequence
☆9Updated 7 years ago
yzh119 / segtree-transformer-v0
Code for SegTree Transformer (ICLR-RLGM 2019).
☆27Updated 5 years ago
agadetsky / pytorch-definitions
[ACL 2018] Conditional Generators of Words Definitions
☆33Updated 6 years ago
uralik / beamdream
☆28Updated 3 years ago
demelin / Noise-Contrastive-Estimation-NCE-for-pyTorch
Re-implementation of the Noise Contrastive Estimation algorithm for pyTorch, following "Noise-contrastive estimation: A new estimation pr…
☆44Updated 5 years ago
yandex-research / graph-glove
PyTorch code for the EMNLP 2020 paper "Embedding Words in Non-Vector Space with Unsupervised Graph Learning"
☆41Updated 4 years ago
cgraywang / transformer-on-diet
Code repo for "Transformer on a Diet" paper
☆31Updated 5 years ago
ishalyminov / babi_tools
Augmentation scripts for the bAbI Dialog Tasks dataset
☆13Updated 6 years ago
seraphlabs-ca / MIM
Code for "MIM: Mutual Information Machine" paper.
☆16Updated 2 years ago
Sandeep42 / anuvada
Interpretable Models for NLP using PyTorch
☆18Updated 7 years ago
OliverRichter / normalized-attention
Code publication to the paper "Normalized Attention Without Probability Cage"
☆16Updated 3 years ago
cambridgeltl / adversarial-postspec
Auxiliary GAN for WE post-specialisation
☆24Updated 6 years ago
ItzikMalkiel / MTAdam
MTAdam: Automatic Balancing of Multiple Training Loss Terms
☆36Updated 4 years ago
rtmdrr / replicability-analysis-NLP
☆15Updated 4 years ago
vyraun / long-tailed
Code for "On Long-Tailed Phenomena in NMT".
☆10Updated 4 years ago
cindyxinyiwang / SDE
Source code for the paper "Multilingual Neural Machine Translation with Soft Decoupled Encoding"
☆29Updated 4 years ago
JACKHAHA363 / VQVAE
a replicate of https://arxiv.org/pdf/1711.00937.pdf
☆16Updated 7 years ago
Noahs-ARK / rational-recurrences
Implementation for "Rational Recurrences", Peng et al., EMNLP 2018.
☆28Updated 3 years ago
giannisdaras / smyrf
[NeurIPS 2020] Official Implementation: "SMYRF: Efficient Attention using Asymmetric Clustering".
☆50Updated last year
pytorch-tpu / fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆22Updated 2 years ago
HA-Transformer / MAT
The implementation of multi-branch attentive Transformer (MAT).
☆33Updated 4 years ago
ofirpress / sandwich_transformer
This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …
☆55Updated 4 years ago
jmhessel / multi-retrieval
Code for Unsupervised Discovery of Multimodal Links in Multi-Image/Multi-Sentence Documents
☆30Updated 4 years ago