lsdefine / attention-is-all-you-need-kerasView external linksLinks
A Keras+TensorFlow Implementation of the Transformer: Attention Is All You Need
☆713Sep 24, 2021Updated 4 years ago
Alternatives and similar repositories for attention-is-all-you-need-keras
Users that are interested in attention-is-all-you-need-keras are comparing it to the libraries listed below
Sorting:
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆541May 30, 2020Updated 5 years ago
- Keras implementation of BERT with pre-trained weights☆816Jul 26, 2019Updated 6 years ago
- Keras Attention Layer (Luong and Bahdanau scores).☆2,816Nov 17, 2023Updated 2 years ago
- Transformer implemented in Keras☆369Jan 22, 2022Updated 4 years ago
- Implementation of BERT that could load official pre-trained models for feature extraction and prediction☆2,428Jan 22, 2022Updated 4 years ago
- some attention implements☆1,452Nov 20, 2019Updated 6 years ago
- Visualizing RNNs using the attention mechanism☆751Jun 25, 2019Updated 6 years ago
- A TensorFlow Implementation of the Transformer: Attention Is All You Need☆4,453May 21, 2023Updated 2 years ago
- Attention mechanism for processing sequential data that considers the context for each timestamp.☆657Jan 22, 2022Updated 4 years ago
- Neural Machine Translation with Keras☆530Jul 30, 2021Updated 4 years ago
- Sequence to Sequence Learning with Keras☆3,177Aug 20, 2022Updated 3 years ago
- Implementation of the Transformer architecture described by Vaswani et al. in "Attention Is All You Need"☆28Apr 21, 2019Updated 6 years ago
- How to use ELMo embeddings in Keras with Tensorflow Hub☆260Dec 18, 2018Updated 7 years ago
- Using Keras + Tensor Flow to Implement Model Transformer in Paper "Attention Is All You Need". 使用 keras+tensorflow 实现论文"Attention Is All …☆34Jan 9, 2019Updated 7 years ago
- Collection of custom layers and utility functions for Keras which are missing in the main framework.☆62May 25, 2020Updated 5 years ago
- Re-implementation of ELMo on Keras☆135Mar 25, 2023Updated 2 years ago
- Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.☆16,997Jun 2, 2023Updated 2 years ago
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,177May 28, 2023Updated 2 years ago
- Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN☆968Jan 1, 2019Updated 7 years ago
- A PyTorch implementation of the Transformer model in "Attention is All You Need".☆9,629Apr 16, 2024Updated last year
- Position embedding layers in Keras☆58Jan 22, 2022Updated 4 years ago
- Keras community contributions☆1,585Oct 21, 2022Updated 3 years ago
- Load GPT-2 checkpoint and generate texts☆127Jan 22, 2022Updated 4 years ago
- An easy-to-use BERT in keras via tf-hub.☆11May 23, 2019Updated 6 years ago
- Text classifier for Hierarchical Attention Networks for Document Classification☆1,081Sep 16, 2021Updated 4 years ago
- Implementation of Simple Recurrent Unit in Keras☆90Nov 9, 2017Updated 8 years ago
- An example attention network with simple dataset.☆228Mar 5, 2019Updated 6 years ago
- TensorFlow implementation of 'Attention Is All You Need (2017. 6)'☆349Apr 30, 2018Updated 7 years ago
- Framework for building complex recurrent neural networks with Keras☆768Oct 29, 2022Updated 3 years ago
- Implementation of Hierarchical Attention Networks as presented in https://www.cs.cmu.edu/~diyiy/docs/naacl16.pdf☆57Mar 21, 2018Updated 7 years ago
- Facilitating the design, comparison and sharing of deep text matching models.☆3,854Aug 2, 2024Updated last year
- Convolutional Neural Networks for Sentence Classification in Keras☆596Nov 13, 2018Updated 7 years ago
- ☆536Dec 7, 2018Updated 7 years ago
- 🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP☆12,819Jan 23, 2024Updated 2 years ago
- Keras implementation of Graph Convolutional Networks☆794Apr 19, 2021Updated 4 years ago
- QANet in keras (with Cove)☆66May 13, 2019Updated 6 years ago
- keras implement of transformers for humans☆5,420Nov 11, 2024Updated last year
- Code of Directional Self-Attention Network (DiSAN)☆311May 8, 2018Updated 7 years ago
- all kinds of text classification models and more with deep learning☆7,951Sep 28, 2023Updated 2 years ago