A Keras+TensorFlow Implementation of the Transformer: Attention Is All You Need
☆714Sep 24, 2021Updated 4 years ago
Alternatives and similar repositories for attention-is-all-you-need-keras
Users that are interested in attention-is-all-you-need-keras are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆541May 30, 2020Updated 5 years ago
- Keras implementation of BERT with pre-trained weights☆815Jul 26, 2019Updated 6 years ago
- Keras Attention Layer (Luong and Bahdanau scores).☆2,813Mar 12, 2026Updated 2 weeks ago
- Transformer implemented in Keras☆369Jan 22, 2022Updated 4 years ago
- some attention implements☆1,452Nov 20, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Implementation of BERT that could load official pre-trained models for feature extraction and prediction☆2,426Jan 22, 2022Updated 4 years ago
- Visualizing RNNs using the attention mechanism☆750Jun 25, 2019Updated 6 years ago
- A TensorFlow Implementation of the Transformer: Attention Is All You Need☆4,460May 21, 2023Updated 2 years ago
- Implementation of the Transformer architecture described by Vaswani et al. in "Attention Is All You Need"☆28Apr 21, 2019Updated 6 years ago
- Attention mechanism for processing sequential data that considers the context for each timestamp.☆657Jan 22, 2022Updated 4 years ago
- Neural Machine Translation with Keras☆532Jul 30, 2021Updated 4 years ago
- Collection of custom layers and utility functions for Keras which are missing in the main framework.☆62May 25, 2020Updated 5 years ago
- Sequence to Sequence Learning with Keras☆3,176Aug 20, 2022Updated 3 years ago
- Using Keras + Tensor Flow to Implement Model Transformer in Paper "Attention Is All You Need". 使用 keras+tensorflow 实现论文"Attention Is All …☆34Jan 9, 2019Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- How to use ELMo embeddings in Keras with Tensorflow Hub☆260Dec 18, 2018Updated 7 years ago
- Re-implementation of ELMo on Keras☆135Mar 25, 2023Updated 3 years ago
- Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.☆17,110Jun 2, 2023Updated 2 years ago
- An easy-to-use BERT in keras via tf-hub.☆11May 23, 2019Updated 6 years ago
- A PyTorch implementation of the Transformer model in "Attention is All You Need".☆9,661Apr 16, 2024Updated last year
- XLNet: Generalized Autoregressive Pretraining for Language Understanding☆6,178May 28, 2023Updated 2 years ago
- Keras community contributions☆1,584Oct 21, 2022Updated 3 years ago
- Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN☆967Jan 1, 2019Updated 7 years ago
- Position embedding layers in Keras☆58Jan 22, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- TensorFlow implementation of 'Attention Is All You Need (2017. 6)'☆349Apr 30, 2018Updated 7 years ago
- QANet in keras (with Cove)☆66May 13, 2019Updated 6 years ago
- Implementation of Hierarchical Attention Networks as presented in https://www.cs.cmu.edu/~diyiy/docs/naacl16.pdf☆57Mar 21, 2018Updated 8 years ago
- An example attention network with simple dataset.☆228Mar 5, 2019Updated 7 years ago
- Load GPT-2 checkpoint and generate texts☆127Jan 22, 2022Updated 4 years ago
- Keras implementation of Graph Convolutional Networks☆794Apr 19, 2021Updated 4 years ago
- Text classifier for Hierarchical Attention Networks for Document Classification☆1,080Sep 16, 2021Updated 4 years ago
- Implementation of Simple Recurrent Unit in Keras☆90Nov 9, 2017Updated 8 years ago
- Keras Layer implementation of Attention for Sequential models☆444Mar 25, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Framework for building complex recurrent neural networks with Keras☆768Oct 29, 2022Updated 3 years ago
- ☆536Dec 7, 2018Updated 7 years ago
- Code of Directional Self-Attention Network (DiSAN)☆311May 8, 2018Updated 7 years ago
- A wrapper layer for stacking layers horizontally☆228Jan 22, 2022Updated 4 years ago
- keras implement of transformers for humans☆5,424Nov 11, 2024Updated last year
- 🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP☆12,829Jan 23, 2024Updated 2 years ago
- Facilitating the design, comparison and sharing of deep text matching models.☆3,853Aug 2, 2024Updated last year