bojone / keras_radam
RAdam optimizer for keras
☆71Updated 5 years ago
Alternatives and similar repositories for keras_radam:
Users that are interested in keras_radam are comparing it to the libraries listed below
- wrapping a keras optimizer to implement gradient accumulation☆119Updated 4 years ago
- Keras implement of Lazy optimizer☆21Updated 5 years ago
- Ordered Neurons LSTM☆30Updated 3 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆58Updated 4 years ago
- This is our solution for WSDM - DiggSci 2020. We implemented a simple yet robust search pipeline which ranked 2nd in the validation set a…☆62Updated 4 years ago
- bert-of-theseus via bert4keras☆31Updated 4 years ago
- Try to use tf.estimator and tf.data together to train a cnn model.☆79Updated 6 years ago
- machine reading comprehension with deep learning☆20Updated 7 years ago
- bert on Jigsaw Unintended Bias in Toxicity Classification☆50Updated 5 years ago
- lookahead optimizer for keras☆170Updated 5 years ago
- ☆12Updated 7 years ago
- Transformer-XL with checkpoint loader☆68Updated 3 years ago
- Part of the 7th solution of the Kaggle Tweet Sentiment Extraction competition☆23Updated 4 years ago
- Chinese Natural Language Correction via Language Model☆14Updated 7 years ago
- 2019达观杯实体识别☆19Updated 5 years ago
- Official code of our work, Robust, Transferable Sentence Representations for Text Classification [Arxiv 2018].☆21Updated 6 years ago
- Stochastic Weight Averaging in Keras☆10Updated 6 years ago
- Multithreading inference in Tensorflow Estimators. This is a ServiceNow Research project that was started at Element AI.☆57Updated 2 years ago
- adafactor optimizer for keras☆20Updated 3 years ago
- Position embedding layers in Keras☆58Updated 3 years ago
- A Tensorflow implementation of Yin Wenpeng's recent paper on TACL "Attentive Convolution"☆33Updated 6 years ago
- ☆38Updated 7 years ago
- ai challenger Competitions 1: Fine-grained Sentiment Analysis of User Reviews☆18Updated 6 years ago
- Source code for "Training Generative Adversarial Networks Via Turing Test".☆13Updated 4 years ago
- Keras Conv+BiLSTM for Named Entity Recognition☆24Updated 7 years ago
- ☆96Updated 6 years ago
- [ACM-CIKM] 2nd place solution at CIKM AnalytiCup 2018, a task for determining short text similarities.☆75Updated 5 years ago
- Simple Tensorflow Implementation of "A Structured Self-attentive Sentence Embedding" (ICLR 2017)☆91Updated 6 years ago
- Tensorflow version implementation of focal loss for binary and multi classification☆110Updated 6 years ago
- ☆19Updated 5 years ago