CyberZHG / keras-multi-headView external linksLinks
A wrapper layer for stacking layers horizontally
☆228Jan 22, 2022Updated 4 years ago
Alternatives and similar repositories for keras-multi-head
Users that are interested in keras-multi-head are comparing it to the libraries listed below
Sorting:
- Attention mechanism for processing sequential data that considers the context for each timestamp.☆657Jan 22, 2022Updated 4 years ago
- Transformer implemented in Keras☆369Jan 22, 2022Updated 4 years ago
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆541May 30, 2020Updated 5 years ago
- Transformer-XL with checkpoint loader☆68Jan 22, 2022Updated 4 years ago
- Layer normalization implemented in Keras☆60Jan 22, 2022Updated 4 years ago
- Position embedding layers in Keras☆58Jan 22, 2022Updated 4 years ago
- Keras Attention Layer (Luong and Bahdanau scores).☆2,816Nov 17, 2023Updated 2 years ago
- Adaptive embedding and softmax☆17Jan 22, 2022Updated 4 years ago
- Lookahead mechanism for optimizers in Keras.☆50Jun 24, 2021Updated 4 years ago
- Implementation of BERT that could load official pre-trained models for feature extraction and prediction☆2,428Jan 22, 2022Updated 4 years ago
- SNAIL Attention Block for Keras.☆17Mar 30, 2020Updated 5 years ago
- Tensorflow port implementation of Single Headed Attention RNN☆16Feb 1, 2020Updated 6 years ago
- An Attention Layer in Keras☆43Apr 23, 2019Updated 6 years ago
- Keras Layer implementation of Attention for Sequential models☆444Mar 25, 2023Updated 2 years ago
- 数据预处理——插值法填补缺失值,并且标记填充位置☆10Apr 19, 2019Updated 6 years ago
- Gradient accumulation for Keras☆35Jun 27, 2021Updated 4 years ago
- Collection of custom layers and utility functions for Keras which are missing in the main framework.☆62May 25, 2020Updated 5 years ago
- AdaBound optimizer in Keras☆56Jul 11, 2020Updated 5 years ago
- a simple implementation of self attention layer that outputs flattened sentence embedding matrix, with the Frobenius norm penalty☆16Sep 14, 2018Updated 7 years ago
- DropBlock implemented in Keras☆26Jan 22, 2022Updated 4 years ago
- Contains an implementation of the attention mechanism and a keras text classifier wrapper.☆29Sep 18, 2018Updated 7 years ago
- Tensorflow NCE loss in Keras☆34Oct 6, 2018Updated 7 years ago
- Keras implementation of BERT with pre-trained weights☆816Jul 26, 2019Updated 6 years ago
- Keras implementation of AdaBound☆130Nov 4, 2019Updated 6 years ago
- Keras community contributions☆1,586Oct 21, 2022Updated 3 years ago
- 自注意力与文本分类☆119Nov 3, 2018Updated 7 years ago
- Effective sampling methods within TensorFlow input functions.☆10Mar 24, 2023Updated 2 years ago
- 层次注意力机制用于文本分类☆10Mar 17, 2020Updated 5 years ago
- keras implement of transformers for humans☆5,421Nov 11, 2024Updated last year
- A Lite BERT☆60Oct 28, 2019Updated 6 years ago
- Sample data science projects (machine learning, optimization, business intelligence)☆28Aug 12, 2018Updated 7 years ago
- A simple technique to integrate BERT from tf hub to keras☆258Feb 15, 2023Updated 2 years ago
- 12th place solution for Kaggle Corporación Favorita Grocery Sales Forecasting☆15Jan 29, 2018Updated 8 years ago
- A simple template for TensorFlow's highly efficient CudnnLSTM module☆11Jun 8, 2018Updated 7 years ago
- This is a drop-in Keras layer for ELMo embeddings.☆47Dec 29, 2018Updated 7 years ago
- RAdam implemented in Keras & TensorFlow☆325Jan 22, 2022Updated 4 years ago
- Implementations for a family of attention mechanisms, suitable for all kinds of natural language processing tasks and compatible with Ten…☆362Feb 6, 2024Updated 2 years ago
- Neural Deconvolutions in Tensorflow☆12May 18, 2020Updated 5 years ago
- ☆11Jun 13, 2017Updated 8 years ago