A wrapper layer for stacking layers horizontally
☆228Jan 22, 2022Updated 4 years ago
Alternatives and similar repositories for keras-multi-head
Users that are interested in keras-multi-head are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Attention mechanism for processing sequential data that considers the context for each timestamp.☆657Jan 22, 2022Updated 4 years ago
- Transformer implemented in Keras☆368Jan 22, 2022Updated 4 years ago
- Layer normalization implemented in Keras☆60Jan 22, 2022Updated 4 years ago
- Transformer-XL with checkpoint loader☆67Jan 22, 2022Updated 4 years ago
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆541May 30, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Position embedding layers in Keras☆58Jan 22, 2022Updated 4 years ago
- Keras Attention Layer (Luong and Bahdanau scores).☆2,812Mar 12, 2026Updated 2 months ago
- Lookahead mechanism for optimizers in Keras.☆50Jun 24, 2021Updated 4 years ago
- Implementation of BERT that could load official pre-trained models for feature extraction and prediction☆2,424Jan 22, 2022Updated 4 years ago
- DropBlock implemented in Keras☆26Jan 22, 2022Updated 4 years ago
- Keras Layer implementation of Attention for Sequential models☆444Mar 25, 2023Updated 3 years ago
- AdaBound optimizer in Keras☆56Jul 11, 2020Updated 5 years ago
- How to use ELMo embeddings in Keras with Tensorflow Hub☆259Dec 18, 2018Updated 7 years ago
- A Keras+TensorFlow Implementation of the Transformer: Attention Is All You Need☆718Sep 24, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Binary and Categorical Focal loss implementation in Keras.☆279Dec 20, 2024Updated last year
- SNAIL Attention Block for Keras.☆17Mar 30, 2020Updated 6 years ago
- some attention implements☆1,450Nov 20, 2019Updated 6 years ago
- An implementation of Bisecting KMeans Clustering which is a kind of Hierarchical Clustering algorithm on Spark☆12Dec 2, 2015Updated 10 years ago
- Keras implementation of BERT with pre-trained weights☆815Jul 26, 2019Updated 6 years ago
- A simple technique to integrate BERT from tf hub to keras☆258Feb 15, 2023Updated 3 years ago
- 自注意力与文本分类☆119Nov 3, 2018Updated 7 years ago
- This is a drop-in Keras layer for ELMo embeddings.☆47Dec 29, 2018Updated 7 years ago
- SemEval 2019 Hyperpartisan News Detection - team Bertha von Suttner contribution☆23Aug 15, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- keras implement of transformers for humans☆5,419Nov 11, 2024Updated last year
- A Keras TensorFlow 2.0 implementation of BERT, ALBERT and adapter-BERT.☆807Jan 13, 2023Updated 3 years ago
- 12th place solution for Kaggle Corporación Favorita Grocery Sales Forecasting☆15Jan 29, 2018Updated 8 years ago
- Implementation of Rectified Adam in Keras☆70Aug 24, 2019Updated 6 years ago
- ☆11Jun 13, 2017Updated 8 years ago
- Implementations for a family of attention mechanisms, suitable for all kinds of natural language processing tasks and compatible with Ten…☆380Feb 6, 2024Updated 2 years ago
- RAdam implemented in Keras & TensorFlow☆324Jan 22, 2022Updated 4 years ago
- Keras community contributions☆1,584Oct 21, 2022Updated 3 years ago
- Implementation for QANet using Keras with Tensorflow backend☆12Dec 13, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Keras/TF implementation of AdamW, SGDW, NadamW, Warm Restarts, and Learning Rate multipliers☆169Jan 6, 2022Updated 4 years ago
- Keras Bi-LSTM-CRF for sequence tagging☆34Aug 6, 2018Updated 7 years ago
- Layer-wise Adaptive Moments optimizer for Batch training☆15Apr 3, 2019Updated 7 years ago
- Keras implementation of AdaBound☆130Nov 4, 2019Updated 6 years ago
- IPython notebook with synthetic experiments for AFLite, based on the ICML 2020 paper, "Adversarial Filters of Dataset Biases".☆16Aug 14, 2020Updated 5 years ago
- A Hyperparameter Tuning Library for Keras☆2,924Dec 1, 2025Updated 5 months ago
- Temporal Pattern Attention for Multivariate Time Series Forecasting☆734Nov 29, 2018Updated 7 years ago