Implementation of Transformer Model in Tensorflow
☆483Mar 25, 2023Updated 2 years ago
Alternatives and similar repositories for transformer-tensorflow
Users that are interested in transformer-tensorflow are comparing it to the libraries listed below
Sorting:
- A TensorFlow Implementation of the Transformer: Attention Is All You Need☆4,455May 21, 2023Updated 2 years ago
- TensorFlow implementation of 'Attention Is All You Need (2017. 6)'☆349Apr 30, 2018Updated 7 years ago
- Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.☆17,025Jun 2, 2023Updated 2 years ago
- Sequence-to-Sequence Model for User Simulation☆10Feb 6, 2017Updated 9 years ago
- A PyTorch implementation of the Transformer model in "Attention is All You Need".☆9,634Apr 16, 2024Updated last year
- TensorFlow code and pre-trained models for BERT☆39,879Jul 23, 2024Updated last year
- ☆3,686Sep 21, 2022Updated 3 years ago
- An annotated implementation of the Transformer paper.☆7,058Apr 7, 2024Updated last year
- Conversational Word Embedding for Retrieval-based Dialog System (ACL2020)☆30Sep 2, 2020Updated 5 years ago
- MT Tutorial for the JSALT 2019 Summer School☆48Jun 24, 2019Updated 6 years ago
- TensorFlow Neural Machine Translation Tutorial☆6,466Oct 9, 2022Updated 3 years ago
- A repository containing tutorials for practical NLP using PyTorch☆537Sep 14, 2019Updated 6 years ago
- a simple yet complete implementation of the popular BERT model☆128Mar 19, 2020Updated 5 years ago
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 3 months ago
- Recurrent Discounted Attention unit (RDA) for Tensorflow☆22Mar 12, 2018Updated 7 years ago
- PyTorch implementation of context2vec from Melamud et al., CoNLL 2016☆19Sep 25, 2018Updated 7 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,170Sep 30, 2025Updated 5 months ago
- Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"☆1,611Aug 12, 2020Updated 5 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,176May 28, 2023Updated 2 years ago
- Code for EMNLP 2018 paper "Auto-Encoding Dictionary Definitions into Consistent Word Embeddings"☆36Aug 22, 2018Updated 7 years ago
- lattice lstm cell implementation with tensorflow☆30Aug 3, 2018Updated 7 years ago
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆541May 30, 2020Updated 5 years ago
- Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"☆2,197Jun 16, 2022Updated 3 years ago
- Model implementation for the contextual embeddings project☆41Jun 2, 2025Updated 9 months ago
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,122Nov 28, 2022Updated 3 years ago
- 🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI☆1,523Aug 9, 2021Updated 4 years ago
- pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"☆915Jan 23, 2023Updated 3 years ago
- 夸夸机器人☆20Dec 23, 2021Updated 4 years ago
- code for polite☆11Feb 28, 2024Updated 2 years ago
- Software library RLCM (recursively low-rank compressed matrices)☆14Apr 15, 2021Updated 4 years ago
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the mo…☆22,981Jul 28, 2024Updated last year
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,489Jan 14, 2026Updated last month
- Reliable Uncertainty Estimates in Deep Neural Networks using Noise Contrastive Priors☆62Apr 8, 2020Updated 5 years ago
- Python3 ROS Interface to Rethink Sawyer Robots with OpenAI Gym Compatibility☆62Apr 13, 2019Updated 6 years ago
- Lingvo☆2,857Feb 20, 2026Updated 2 weeks ago
- some attention implements☆1,452Nov 20, 2019Updated 6 years ago
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆157,462Updated this week
- 深度学习相关的模型训练、评估和预测相关代码☆1,040Jul 26, 2021Updated 4 years ago
- Dynamic seq2seq in TensorFlow, step by step☆994Aug 20, 2017Updated 8 years ago