bojone / keras_radamLinks
RAdam optimizer for keras
☆71Updated 5 years ago
Alternatives and similar repositories for keras_radam
Users that are interested in keras_radam are comparing it to the libraries listed below
Sorting:
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆58Updated 5 years ago
- Ordered Neurons LSTM☆30Updated 3 years ago
- wrapping a keras optimizer to implement gradient accumulation☆119Updated 4 years ago
- This is our solution for WSDM - DiggSci 2020. We implemented a simple yet robust search pipeline which ranked 2nd in the validation set a…☆63Updated 5 years ago
- bert on Jigsaw Unintended Bias in Toxicity Classification☆50Updated 6 years ago
- Keras implement of Lazy optimizer☆21Updated 5 years ago
- Try to use tf.estimator and tf.data together to train a cnn model.☆79Updated 7 years ago
- ☆12Updated 7 years ago
- Source code for "Training Generative Adversarial Networks Via Turing Test".☆13Updated 5 years ago
- A Tensorflow implementation of Yin Wenpeng's recent paper on TACL "Attentive Convolution"☆33Updated 6 years ago
- Chinese Natural Language Correction via Language Model☆14Updated 7 years ago
- Transformer-XL with checkpoint loader☆68Updated 3 years ago
- BERT Extension in TensorFlow☆30Updated 5 years ago
- lookahead optimizer for keras☆170Updated 5 years ago
- Position embedding layers in Keras☆58Updated 3 years ago
- bert-of-theseus via bert4keras☆31Updated 4 years ago
- ai challenger Competitions 1: Fine-grained Sentiment Analysis of User Reviews☆18Updated 6 years ago
- Adaptive embedding and softmax☆17Updated 3 years ago
- Text classification models: cnn, self-attention, cnn-rnf, rnn-att, capsule-net. TensorFlow. Single GPU or multi GPU☆19Updated 5 years ago
- Official code of our work, Robust, Transferable Sentence Representations for Text Classification [Arxiv 2018].☆21Updated 6 years ago
- machine reading comprehension with deep learning☆20Updated 7 years ago
- Multithreading inference in Tensorflow Estimators. This is a ServiceNow Research project that was started at Element AI.☆57Updated 3 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆75Updated 6 years ago
- codes for ai challenger 2018 machine reading comprehension☆27Updated 6 years ago
- saving memory by recomputing for keras☆37Updated 5 years ago
- 2019达观杯实体识别☆19Updated 5 years ago
- Kaggle Competition: Using deep learning to solve quora's question pairs problem☆54Updated 8 years ago
- 一些不同的Attention机制代码☆20Updated 5 years ago
- QANet in keras (with Cove)☆66Updated 6 years ago
- Tensorflow Implementation of Densely Connected Bidirectional LSTM with Applications to Sentence Classification☆47Updated 7 years ago