bojone / keras_radam
RAdam optimizer for keras
☆71Updated 5 years ago
Alternatives and similar repositories for keras_radam:
Users that are interested in keras_radam are comparing it to the libraries listed below
- Keras implement of Lazy optimizer☆21Updated 5 years ago
- wrapping a keras optimizer to implement gradient accumulation☆119Updated 4 years ago
- ai challenger Competitions 1: Fine-grained Sentiment Analysis of User Reviews☆18Updated 6 years ago
- lookahead optimizer for keras☆170Updated 5 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆58Updated 4 years ago
- Position embedding layers in Keras☆58Updated 3 years ago
- Ordered Neurons LSTM☆30Updated 3 years ago
- machine reading comprehension with deep learning☆20Updated 7 years ago
- Try to use tf.estimator and tf.data together to train a cnn model.☆79Updated 6 years ago
- This is our solution for WSDM - DiggSci 2020. We implemented a simple yet robust search pipeline which ranked 2nd in the validation set a…☆63Updated 4 years ago
- Source code for "Training Generative Adversarial Networks Via Turing Test".☆13Updated 4 years ago
- bert on Jigsaw Unintended Bias in Toxicity Classification☆50Updated 6 years ago
- ☆12Updated 7 years ago
- A Tensorflow implementation of Yin Wenpeng's recent paper on TACL "Attentive Convolution"☆33Updated 6 years ago
- adafactor optimizer for keras☆20Updated 3 years ago
- Transformer-XL with checkpoint loader☆68Updated 3 years ago
- 2019达观杯实体识别☆19Updated 5 years ago
- ☆86Updated 2 years ago
- Adaptive embedding and softmax☆17Updated 3 years ago
- bert-of-theseus via bert4keras☆31Updated 4 years ago
- Official code of our work, Robust, Transferable Sentence Representations for Text Classification [Arxiv 2018].☆21Updated 6 years ago
- My 1st place solution at WSDM 2019 cup for fake news classification☆44Updated 5 years ago
- kaggle-petfinder-adoption-prediction-10th-solution 10/2023☆13Updated 6 years ago
- Linear chain conditional random fields are implemented using Numpy and Mxnet/Gluon, and batch training is supported, not limited to train…☆23Updated 6 years ago
- A TensorFlow implementation of "QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension"☆31Updated 6 years ago
- saving memory by recomputing for keras☆37Updated 5 years ago
- Layer normalization implemented in Keras☆60Updated 3 years ago
- BERT Extension in TensorFlow☆30Updated 5 years ago
- Lookahead mechanism for optimizers in Keras.☆49Updated 3 years ago
- ogeek算法挑战赛方案☆21Updated 6 years ago