CyberZHG / keras-multi-head
A wrapper layer for stacking layers horizontally
☆227Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for keras-multi-head
- Attention mechanism for processing sequential data that considers the context for each timestamp.☆656Updated 2 years ago
- Transformer implemented in Keras☆370Updated 2 years ago
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆533Updated 4 years ago
- A Keras+TensorFlow Implementation of the Transformer: Attention Is All You Need☆708Updated 3 years ago
- Keras Layer implementation of Attention for Sequential models☆444Updated last year
- Layer normalization implemented in Keras☆60Updated 2 years ago
- Implementations for a family of attention mechanisms, suitable for all kinds of natural language processing tasks and compatible with Ten…☆345Updated 9 months ago
- An example attention network with simple dataset.☆230Updated 5 years ago
- attention-based LSTM/Dense implemented by Keras☆294Updated 6 years ago
- Attention-based bidirectional LSTM for Classification Task (ICASSP)☆108Updated last year
- Collection of custom layers and utility functions for Keras which are missing in the main framework.☆62Updated 4 years ago
- 自注意力与文本分类☆119Updated 6 years ago
- TensorFlow implementation of focal loss☆186Updated 3 years ago
- Implementation of the Transformer architecture described by Vaswani et al. in "Attention Is All You Need"☆28Updated 5 years ago
- Binary and Categorical Focal loss implementation in Keras.☆278Updated last year
- An Attention Layer in Keras☆43Updated 5 years ago
- Pytorch implementation of R-Transformer. Some parts of the code are adapted from the implementation of TCN and Transformer.☆224Updated 5 years ago
- Keras/TF implementation of AdamW, SGDW, NadamW, Warm Restarts, and Learning Rate multipliers☆167Updated 2 years ago
- This repository contain various types of attention mechanism like Bahdanau , Soft attention , Additive Attention , Hierarchical Attention…☆122Updated 3 years ago
- Transformer-based models implemented in tensorflow 2.x(using keras).☆75Updated 2 years ago
- Re-implementation of ELMo on Keras☆135Updated last year
- An implementation of a sequence to sequence neural network using an encoder-decoder☆208Updated 5 years ago
- Position embedding layers in Keras☆59Updated 2 years ago
- A pytorch implementation of Fairseq Convolutional Sequence to Sequence Learning(Gehring et al. 2017)☆44Updated 5 years ago
- A simple technique to integrate BERT from tf hub to keras☆258Updated last year
- An LSTM in PyTorch with best practices (weight dropout, forget bias, etc.) built-in. Fully compatible with PyTorch LSTM.☆133Updated 4 years ago
- Transformer-XL with checkpoint loader☆68Updated 2 years ago
- document classification using LSTM + self attention☆112Updated 5 years ago
- A Dual-Stage Attention-Based Recurrent Neural Network for Time Series Prediction☆111Updated 4 months ago
- A bidirectional LSTM with attention for multiclass/multilabel text classification.☆172Updated 2 months ago