A wrapper layer for stacking layers horizontally
☆228Jan 22, 2022Updated 4 years ago
Alternatives and similar repositories for keras-multi-head
Users that are interested in keras-multi-head are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Attention mechanism for processing sequential data that considers the context for each timestamp.☆657Jan 22, 2022Updated 4 years ago
- Transformer implemented in Keras☆368Jan 22, 2022Updated 4 years ago
- Transformer-XL with checkpoint loader☆67Jan 22, 2022Updated 4 years ago
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆541May 30, 2020Updated 6 years ago
- Keras Attention Layer (Luong and Bahdanau scores).☆2,811Mar 12, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Adaptive embedding and softmax☆17Jan 22, 2022Updated 4 years ago
- Lookahead mechanism for optimizers in Keras.☆50Jun 24, 2021Updated 4 years ago
- Implementation of BERT that could load official pre-trained models for feature extraction and prediction☆2,421Jan 22, 2022Updated 4 years ago
- Tensorflow port implementation of Single Headed Attention RNN☆16Feb 1, 2020Updated 6 years ago
- Keras Layer implementation of Attention for Sequential models☆444Mar 25, 2023Updated 3 years ago
- 数据预处理——插值法填补缺失值,并且标记填充位置☆10Apr 19, 2019Updated 7 years ago
- A Keras+TensorFlow Implementation of the Transformer: Attention Is All You Need☆719Sep 24, 2021Updated 4 years ago
- Collection of custom layers and utility functions for Keras which are missing in the main framework.☆62May 25, 2020Updated 6 years ago
- An Attention Layer in Keras☆43Apr 23, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- SNAIL Attention Block for Keras.☆17Mar 30, 2020Updated 6 years ago
- Contains an implementation of the attention mechanism and a keras text classifier wrapper.☆29Sep 18, 2018Updated 7 years ago
- some attention implements☆1,450Nov 20, 2019Updated 6 years ago
- A Transformer implementation in Keras' Imperative (Subclassing) API for TensorFlow.☆55Aug 16, 2019Updated 6 years ago
- Keras Temporal Convolutional Network. Supports Python and R.☆2,008Mar 11, 2026Updated 3 months ago
- Keras implementation of BERT with pre-trained weights☆813Jul 26, 2019Updated 6 years ago
- A simple technique to integrate BERT from tf hub to keras☆258Feb 15, 2023Updated 3 years ago
- This is a drop-in Keras layer for ELMo embeddings.☆47Dec 29, 2018Updated 7 years ago
- TensorFlow implementation of several popular Graph Neural Network layers, wrapped with tf.keras.layers.Layer.☆20Aug 25, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- keras implement of transformers for humans☆5,418Nov 11, 2024Updated last year
- A Keras TensorFlow 2.0 implementation of BERT, ALBERT and adapter-BERT.☆807Jan 13, 2023Updated 3 years ago
- 12th place solution for Kaggle Corporación Favorita Grocery Sales Forecasting☆15Jan 29, 2018Updated 8 years ago
- Graph convolutional layers☆62Jan 22, 2022Updated 4 years ago
- Implementations for a family of attention mechanisms, suitable for all kinds of natural language processing tasks and compatible with Ten…☆381Feb 6, 2024Updated 2 years ago
- RAdam implemented in Keras & TensorFlow☆324Jan 22, 2022Updated 4 years ago
- Keras community contributions☆1,585Oct 21, 2022Updated 3 years ago
- A simple implementation of Transformer Encoder in keras. This repository also includes an example of Transformer as a classifier and its …☆16Apr 9, 2019Updated 7 years ago
- Keras/TF implementation of AdamW, SGDW, NadamW, Warm Restarts, and Learning Rate multipliers☆168Jan 6, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Distributed Tensorflow best practices template using Tensorflow Estimator API☆17Mar 19, 2019Updated 7 years ago
- Keras Bi-LSTM-CRF for sequence tagging☆34Aug 6, 2018Updated 7 years ago
- Layer-wise Adaptive Moments optimizer for Batch training☆15Apr 3, 2019Updated 7 years ago
- ☆11May 5, 2023Updated 3 years ago
- Keras implementation of AdaBound☆130Nov 4, 2019Updated 6 years ago
- IPython notebook with synthetic experiments for AFLite, based on the ICML 2020 paper, "Adversarial Filters of Dataset Biases".☆16Aug 14, 2020Updated 5 years ago
- A Hyperparameter Tuning Library for Keras☆2,924Dec 1, 2025Updated 6 months ago