A wrapper layer for stacking layers horizontally
☆228Jan 22, 2022Updated 4 years ago
Alternatives and similar repositories for keras-multi-head
Users that are interested in keras-multi-head are comparing it to the libraries listed below
Sorting:
- Attention mechanism for processing sequential data that considers the context for each timestamp.☆657Jan 22, 2022Updated 4 years ago
- Transformer implemented in Keras☆369Jan 22, 2022Updated 4 years ago
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆541May 30, 2020Updated 5 years ago
- Transformer-XL with checkpoint loader☆67Jan 22, 2022Updated 4 years ago
- Layer normalization implemented in Keras☆60Jan 22, 2022Updated 4 years ago
- Position embedding layers in Keras☆58Jan 22, 2022Updated 4 years ago
- Keras Attention Layer (Luong and Bahdanau scores).☆2,816Nov 17, 2023Updated 2 years ago
- Adaptive embedding and softmax☆17Jan 22, 2022Updated 4 years ago
- Lookahead mechanism for optimizers in Keras.☆50Jun 24, 2021Updated 4 years ago
- Implementation of BERT that could load official pre-trained models for feature extraction and prediction☆2,425Jan 22, 2022Updated 4 years ago
- SNAIL Attention Block for Keras.☆17Mar 30, 2020Updated 5 years ago
- Tensorflow port implementation of Single Headed Attention RNN☆16Feb 1, 2020Updated 6 years ago
- Gradient accumulation for Keras☆35Jun 27, 2021Updated 4 years ago
- Collection of custom layers and utility functions for Keras which are missing in the main framework.☆62May 25, 2020Updated 5 years ago
- How to use ELMo embeddings in Keras with Tensorflow Hub☆260Dec 18, 2018Updated 7 years ago
- AdaBound optimizer in Keras☆56Jul 11, 2020Updated 5 years ago
- Binary and Categorical Focal loss implementation in Keras.☆281Dec 20, 2024Updated last year
- Contains an implementation of the attention mechanism and a keras text classifier wrapper.☆29Sep 18, 2018Updated 7 years ago
- Keras Temporal Convolutional Network. Supports Python and R.☆2,000Apr 8, 2025Updated 10 months ago
- Keras/TF implementation of AdamW, SGDW, NadamW, Warm Restarts, and Learning Rate multipliers☆169Jan 6, 2022Updated 4 years ago
- some attention implements☆1,452Nov 20, 2019Updated 6 years ago
- Keras implementation of BERT with pre-trained weights☆815Jul 26, 2019Updated 6 years ago
- Keras implementation of AdaBound☆130Nov 4, 2019Updated 6 years ago
- A Keras TensorFlow 2.0 implementation of BERT, ALBERT and adapter-BERT.☆808Jan 13, 2023Updated 3 years ago
- Keras community contributions☆1,585Oct 21, 2022Updated 3 years ago
- 自注意力与文本分类☆119Nov 3, 2018Updated 7 years ago
- Effective sampling methods within TensorFlow input functions.☆10Mar 24, 2023Updated 2 years ago
- 层次注意力机制用于文本分类☆10Mar 17, 2020Updated 5 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Apr 30, 2021Updated 4 years ago
- keras implement of transformers for humans☆5,424Nov 11, 2024Updated last year
- A Lite BERT☆60Oct 28, 2019Updated 6 years ago
- Keras implementation of ABCNN by Yin & Schütze (WIP)☆23Jun 16, 2020Updated 5 years ago
- Sample data science projects (machine learning, optimization, business intelligence)☆28Aug 12, 2018Updated 7 years ago
- A Hyperparameter Tuning Library for Keras☆2,918Dec 1, 2025Updated 3 months ago
- A simple technique to integrate BERT from tf hub to keras☆258Feb 15, 2023Updated 3 years ago
- 12th place solution for Kaggle Corporación Favorita Grocery Sales Forecasting☆15Jan 29, 2018Updated 8 years ago
- A simple template for TensorFlow's highly efficient CudnnLSTM module☆11Jun 8, 2018Updated 7 years ago
- A more elegant and convenient CRF built on tensorflow-addons.☆28Sep 18, 2021Updated 4 years ago
- This is a drop-in Keras layer for ELMo embeddings.☆47Dec 29, 2018Updated 7 years ago