CyberZHG/keras-transformer-xl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CyberZHG/keras-transformer-xl)

CyberZHG / keras-transformer-xl

Transformer-XL with checkpoint loader

☆67

Alternatives and similar repositories for keras-transformer-xl

Users that are interested in keras-transformer-xl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CyberZHG / keras-adaptive-softmax
View on GitHub
Adaptive embedding and softmax
☆17Jan 22, 2022Updated 4 years ago
CyberZHG / keras-xlnet
View on GitHub
Implementation of XLNet that can load pretrained checkpoints
☆169Jan 22, 2022Updated 4 years ago
CyberZHG / keras-gradient-accumulation
View on GitHub
Gradient accumulation for Keras
☆35Jun 27, 2021Updated 5 years ago
CyberZHG / keras-ordered-neurons
View on GitHub
Ordered Neurons LSTM
☆30Jan 22, 2022Updated 4 years ago
CyberZHG / keras-lookahead
View on GitHub
Lookahead mechanism for optimizers in Keras.
☆50Jun 24, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
CyberZHG / keras-transformer
View on GitHub
Transformer implemented in Keras
☆369Jan 22, 2022Updated 4 years ago
CyberZHG / keras-gpt-2
View on GitHub
Load GPT-2 checkpoint and generate texts
☆127Jan 22, 2022Updated 4 years ago
iezhuozhuo / PaperReading
View on GitHub
☆11Sep 3, 2021Updated 4 years ago
CyberZHG / keras-multi-head
View on GitHub
A wrapper layer for stacking layers horizontally
☆228Jan 22, 2022Updated 4 years ago
CyberZHG / keras-lamb
View on GitHub
Layer-wise Adaptive Moments optimizer for Batch training
☆15Apr 3, 2019Updated 7 years ago
kpot / keras-transformer
View on GitHub
Keras library for building (Universal) Transformers, facilitating BERT and GPT models
☆540May 30, 2020Updated 6 years ago
jonashao / written_judgement
View on GitHub
提取出判决书中的金额项和金额数。
☆11Apr 8, 2016Updated 10 years ago
CyberZHG / keras-lr-multiplier
View on GitHub
Learning rate multiplier
☆46Jun 22, 2021Updated 5 years ago
ParikhKadam / bidaf-keras
View on GitHub
Bidirectional Attention Flow for Machine Comprehension implemented in Keras 2
☆63Nov 21, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
philipperemy / keras-snail-attention
View on GitHub
SNAIL Attention Block for Keras.
☆17Mar 30, 2020Updated 6 years ago
CyberZHG / keras-layer-normalization
View on GitHub
Layer normalization implemented in Keras
☆60Jan 22, 2022Updated 4 years ago
BrikerMan / Kashgari-doc-zh
View on GitHub
Kashgari 框架的中文文档
☆22Sep 11, 2020Updated 5 years ago
bojone / keras_lookahead
View on GitHub
lookahead optimizer for keras
☆168Oct 14, 2019Updated 6 years ago
CyberZHG / keras-bert
View on GitHub
Implementation of BERT that could load official pre-trained models for feature extraction and prediction
☆2,420Jan 22, 2022Updated 4 years ago
CyberZHG / keras-self-attention
View on GitHub
Attention mechanism for processing sequential data that considers the context for each timestamp.
☆657Jan 22, 2022Updated 4 years ago
CyberZHG / keras-pos-embd
View on GitHub
Position embedding layers in Keras
☆58Jan 22, 2022Updated 4 years ago
kpe / bert-for-tf2
View on GitHub
A Keras TensorFlow 2.0 implementation of BERT, ALBERT and adapter-BERT.
☆807Jan 13, 2023Updated 3 years ago
CyberZHG / keras-trans-mask
View on GitHub
Remove and restore masks for layers that do not support masking
☆16Jan 22, 2022Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
tatsuokun / seq2seq
View on GitHub
keras encoder-decoder
☆17Apr 3, 2018Updated 8 years ago
CyberZHG / keras-radam
View on GitHub
RAdam implemented in Keras & TensorFlow
☆324Jan 22, 2022Updated 4 years ago
kimiyoung / transformer-xl
View on GitHub
☆3,711Sep 21, 2022Updated 3 years ago
enningxie / transformers-with-onnx
View on GitHub
Accelerate Transformers pipelines using ONNX Runtime.
☆10Jun 5, 2020Updated 6 years ago
NLP-Deeplearning-Club / keras_attention_block
View on GitHub
attention block for keras Functional Model with only tensorflow backend
☆26Apr 13, 2019Updated 7 years ago
CyberZHG / keras-adabound
View on GitHub
AdaBound optimizer in Keras
☆56Jul 11, 2020Updated 6 years ago
benkrause / dynamiceval-transformer
View on GitHub
☆47Apr 12, 2019Updated 7 years ago
YunaQiu / ogeek
View on GitHub
ogeek算法挑战赛方案
☆22Dec 4, 2018Updated 7 years ago
bojone / accum_optimizer_for_keras
View on GitHub
wrapping a keras optimizer to implement gradient accumulation
☆118Aug 29, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
BrikerMan / Kashgari
View on GitHub
Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includ…
☆2,382Sep 3, 2024Updated last year
Separius / BERT-keras
View on GitHub
Keras implementation of BERT with pre-trained weights
☆813Jul 26, 2019Updated 7 years ago
uwnlp / qasrl-bank
View on GitHub
Central repository for QA-SRL data.
☆21Feb 13, 2021Updated 5 years ago
chatstack-ai / Chatstack-Doc
View on GitHub
Documentation for Chatstack: A Full Pipeline UI for building Chinese NLU System
☆18Sep 7, 2019Updated 6 years ago
CyberZHG / keras-gcn
View on GitHub
Graph convolutional layers
☆62Jan 22, 2022Updated 4 years ago
titu1994 / keras_rectified_adam
View on GitHub
Implementation of Rectified Adam in Keras
☆70Aug 24, 2019Updated 6 years ago
Holmes-Alan / SR-VAE
View on GitHub
SR-VAE
☆10Jul 26, 2021Updated 5 years ago