Transformer-XL with checkpoint loader
☆67Jan 22, 2022Updated 4 years ago
Alternatives and similar repositories for keras-transformer-xl
Users that are interested in keras-transformer-xl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Adaptive embedding and softmax☆17Jan 22, 2022Updated 4 years ago
- Implementation of XLNet that can load pretrained checkpoints☆169Jan 22, 2022Updated 4 years ago
- Gradient accumulation for Keras☆35Jun 27, 2021Updated 4 years ago
- Ordered Neurons LSTM☆30Jan 22, 2022Updated 4 years ago
- Transformer implemented in Keras☆369Jan 22, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆11Sep 3, 2021Updated 4 years ago
- A wrapper layer for stacking layers horizontally☆228Jan 22, 2022Updated 4 years ago
- Layer-wise Adaptive Moments optimizer for Batch training☆15Apr 3, 2019Updated 6 years ago
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆541May 30, 2020Updated 5 years ago
- Learning rate multiplier☆46Jun 22, 2021Updated 4 years ago
- Bidirectional Attention Flow for Machine Comprehension implemented in Keras 2☆63Nov 21, 2022Updated 3 years ago
- An Attention Layer in Keras☆43Apr 23, 2019Updated 6 years ago
- Layer normalization implemented in Keras☆60Jan 22, 2022Updated 4 years ago
- Kashgari 框架的中文文档☆22Sep 11, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- lookahead optimizer for keras☆169Oct 14, 2019Updated 6 years ago
- Implementation of BERT that could load official pre-trained models for feature extraction and prediction☆2,426Jan 22, 2022Updated 4 years ago
- 提取出判决书中的金额项和金额数。☆11Apr 8, 2016Updated 9 years ago
- Attention mechanism for processing sequential data that considers the context for each timestamp.☆657Jan 22, 2022Updated 4 years ago
- Position embedding layers in Keras☆58Jan 22, 2022Updated 4 years ago
- A Keras TensorFlow 2.0 implementation of BERT, ALBERT and adapter-BERT.☆808Jan 13, 2023Updated 3 years ago
- Remove and restore masks for layers that do not support masking☆16Jan 22, 2022Updated 4 years ago
- keras encoder-decoder☆17Apr 3, 2018Updated 7 years ago
- Using Spatial Transformer Layer with keras (theano backend).☆12Jun 7, 2016Updated 9 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- RAdam implemented in Keras & TensorFlow☆324Jan 22, 2022Updated 4 years ago
- Accelerate Transformers pipelines using ONNX Runtime.☆10Jun 5, 2020Updated 5 years ago
- attention block for keras Functional Model with only tensorflow backend☆26Apr 13, 2019Updated 6 years ago
- Dataset Catalogue Homepage for Indonesian Languages☆10Feb 19, 2024Updated 2 years ago
- AdaBound optimizer in Keras☆56Jul 11, 2020Updated 5 years ago
- ☆47Apr 12, 2019Updated 6 years ago
- 将百度ernie的paddlepaddle模型转成tensorflow模型☆179Oct 12, 2019Updated 6 years ago
- ogeek算法挑战赛方案☆21Dec 4, 2018Updated 7 years ago
- Calculate similarity with embedding☆11Jan 22, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- wrapping a keras optimizer to implement gradient accumulation☆119Aug 29, 2020Updated 5 years ago
- Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includ…☆2,387Sep 3, 2024Updated last year
- Keras implementation of BERT with pre-trained weights☆815Jul 26, 2019Updated 6 years ago
- Documentation for Chatstack: A Full Pipeline UI for building Chinese NLU System☆18Sep 7, 2019Updated 6 years ago
- Central repository for QA-SRL data.☆21Feb 13, 2021Updated 5 years ago
- Graph convolutional layers☆62Jan 22, 2022Updated 4 years ago
- Implementation of Rectified Adam in Keras☆70Aug 24, 2019Updated 6 years ago