Transformer-XL with checkpoint loader
☆67Jan 22, 2022Updated 4 years ago
Alternatives and similar repositories for keras-transformer-xl
Users that are interested in keras-transformer-xl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of XLNet that can load pretrained checkpoints☆169Jan 22, 2022Updated 4 years ago
- Gradient accumulation for Keras☆35Jun 27, 2021Updated 4 years ago
- Ordered Neurons LSTM☆30Jan 22, 2022Updated 4 years ago
- Lookahead mechanism for optimizers in Keras.☆50Jun 24, 2021Updated 4 years ago
- Transformer implemented in Keras☆368Jan 22, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Load GPT-2 checkpoint and generate texts☆127Jan 22, 2022Updated 4 years ago
- ☆11Sep 3, 2021Updated 4 years ago
- A wrapper layer for stacking layers horizontally☆228Jan 22, 2022Updated 4 years ago
- Layer-wise Adaptive Moments optimizer for Batch training☆15Apr 3, 2019Updated 7 years ago
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆541May 30, 2020Updated 5 years ago
- Learning rate multiplier☆46Jun 22, 2021Updated 4 years ago
- SNAIL Attention Block for Keras.☆17Mar 30, 2020Updated 6 years ago
- Bidirectional Attention Flow for Machine Comprehension implemented in Keras 2☆63Nov 21, 2022Updated 3 years ago
- An Attention Layer in Keras☆43Apr 23, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Layer normalization implemented in Keras☆60Jan 22, 2022Updated 4 years ago
- Kashgari 框架的中文文档☆22Sep 11, 2020Updated 5 years ago
- lookahead optimizer for keras☆169Oct 14, 2019Updated 6 years ago
- Implementation of BERT that could load official pre-trained models for feature extraction and prediction☆2,426Jan 22, 2022Updated 4 years ago
- 提取出判决书中的金额项和金额数。☆11Apr 8, 2016Updated 10 years ago
- Attention mechanism for processing sequential data that considers the context for each timestamp.☆657Jan 22, 2022Updated 4 years ago
- Position embedding layers in Keras☆58Jan 22, 2022Updated 4 years ago
- A Keras TensorFlow 2.0 implementation of BERT, ALBERT and adapter-BERT.☆808Jan 13, 2023Updated 3 years ago
- Remove and restore masks for layers that do not support masking☆16Jan 22, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- keras encoder-decoder☆17Apr 3, 2018Updated 8 years ago
- RAdam implemented in Keras & TensorFlow☆324Jan 22, 2022Updated 4 years ago
- ☆3,702Sep 21, 2022Updated 3 years ago
- ☆12Nov 15, 2018Updated 7 years ago
- attention block for keras Functional Model with only tensorflow backend☆26Apr 13, 2019Updated 7 years ago
- AdaBound optimizer in Keras☆56Jul 11, 2020Updated 5 years ago
- Calculate similarity with embedding☆11Jan 22, 2022Updated 4 years ago
- wrapping a keras optimizer to implement gradient accumulation☆119Aug 29, 2020Updated 5 years ago
- Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includ…☆2,384Sep 3, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Keras implementation of BERT with pre-trained weights☆815Jul 26, 2019Updated 6 years ago
- A minimal Typescript library for converting a json-rules-engine condition to a JsonLogic rule specification.☆11Jul 6, 2022Updated 3 years ago
- Documentation for Chatstack: A Full Pipeline UI for building Chinese NLU System☆18Sep 7, 2019Updated 6 years ago
- Central repository for QA-SRL data.☆21Feb 13, 2021Updated 5 years ago
- Graph convolutional layers☆62Jan 22, 2022Updated 4 years ago
- Implementation of Rectified Adam in Keras☆70Aug 24, 2019Updated 6 years ago
- Collection of custom layers and utility functions for Keras which are missing in the main framework.☆62May 25, 2020Updated 5 years ago