CyberZHG / keras-multi-headLinks
A wrapper layer for stacking layers horizontally
☆228Updated 3 years ago
Alternatives and similar repositories for keras-multi-head
Users that are interested in keras-multi-head are comparing it to the libraries listed below
Sorting:
- Attention mechanism for processing sequential data that considers the context for each timestamp.☆655Updated 3 years ago
- Transformer implemented in Keras☆371Updated 3 years ago
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆537Updated 5 years ago
- A Keras+TensorFlow Implementation of the Transformer: Attention Is All You Need☆712Updated 3 years ago
- Keras Layer implementation of Attention for Sequential models☆441Updated 2 years ago
- An example attention network with simple dataset.☆229Updated 6 years ago
- Layer normalization implemented in Keras☆60Updated 3 years ago
- Implementations for a family of attention mechanisms, suitable for all kinds of natural language processing tasks and compatible with Ten…☆360Updated last year
- attention-based LSTM/Dense implemented by Keras☆299Updated 7 years ago
- 自注意力与文本分类☆119Updated 6 years ago
- Re-implementation of ELMo on Keras☆133Updated 2 years ago
- Attention-based bidirectional LSTM for Classification Task (ICASSP)☆115Updated 2 years ago
- An Attention Layer in Keras☆43Updated 6 years ago
- Binary and Categorical Focal loss implementation in Keras.☆278Updated 6 months ago
- TensorFlow implementation of focal loss☆189Updated 4 years ago
- Position embedding layers in Keras☆58Updated 3 years ago
- document classification using LSTM + self attention☆112Updated 5 years ago
- Implementation of the Transformer architecture described by Vaswani et al. in "Attention Is All You Need"☆28Updated 6 years ago
- A bidirectional LSTM with attention for multiclass/multilabel text classification.☆173Updated 9 months ago
- Visualizing RNNs using the attention mechanism☆750Updated 5 years ago
- This repository contain various types of attention mechanism like Bahdanau , Soft attention , Additive Attention , Hierarchical Attention…☆126Updated 3 years ago
- Transformer-based models implemented in tensorflow 2.x(using keras).☆75Updated 3 years ago
- Pytorch implementation of R-Transformer. Some parts of the code are adapted from the implementation of TCN and Transformer.☆230Updated 5 years ago
- Implementation of XLNet that can load pretrained checkpoints☆171Updated 3 years ago
- Keras implementation of BERT with pre-trained weights☆814Updated 5 years ago
- Collection of custom layers and utility functions for Keras which are missing in the main framework.☆62Updated 5 years ago
- 利用keras搭建的胶囊网络(capsule network文本分类模型,包含RNN、CNN、HAN等,其中keras_utils包含了capsule层和attention层的keras实现☆77Updated 6 years ago
- How to use ELMo embeddings in Keras with Tensorflow Hub☆259Updated 6 years ago
- A pytorch implementation of Fairseq Convolutional Sequence to Sequence Learning(Gehring et al. 2017)☆46Updated 6 years ago
- Transformer-XL with checkpoint loader☆68Updated 3 years ago