bluecamel / best_checkpoint_copier
Tensorflow Exporter that copies the best checkpoints
☆26Updated 2 years ago
Alternatives and similar repositories for best_checkpoint_copier:
Users that are interested in best_checkpoint_copier are comparing it to the libraries listed below
- PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset☆122Updated 5 years ago
- A lightweight class for saving the best Tensorflow checkpoints.☆103Updated 6 years ago
- LAMB Optimizer for Large Batch Training (TensorFlow version)☆120Updated 5 years ago
- Cyclic learning rate TensorFlow implementation.☆66Updated 5 years ago
- TensorFlow code and pre-trained models for BERT☆114Updated 4 years ago
- ☆58Updated 5 years ago
- Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling☆146Updated 4 years ago
- Sampled Softmax Implementation for PyTorch☆43Updated 6 years ago
- Corrupted labels and label smoothing☆128Updated 7 years ago
- Simple Tensorflow Implementation of "A Structured Self-attentive Sentence Embedding" (ICLR 2017)☆91Updated 6 years ago
- Multithreading inference in Tensorflow Estimators. This is a ServiceNow Research project that was started at Element AI.☆57Updated 2 years ago
- Implementation of the LAMB optimizer for Keras from the paper "Reducing BERT Pre-Training Time from 3 Days to 76 Minutes"☆75Updated 5 years ago
- Adaptive Softmax implementation for PyTorch☆80Updated 5 years ago
- Try to use tf.estimator and tf.data together to train a cnn model.☆79Updated 6 years ago
- Transformer-XL with checkpoint loader☆68Updated 3 years ago
- Learning rate multiplier☆46Updated 3 years ago
- Pytorch implementation of OpenAI-GPT for ROC stories☆51Updated 5 years ago
- ☆113Updated 7 years ago
- Bidirectional Attention Flow for Machine Comprehension implemented in Keras 2☆64Updated 2 years ago
- local-context-unit☆55Updated 7 years ago
- Reproducing Densely Interactive Inference Network in Keras☆74Updated 7 years ago
- wrapping a keras optimizer to implement gradient accumulation☆119Updated 4 years ago
- 高性能小模型测评 Shared Tasks in NLPCC 2020. Task 1 - Light Pre-Training Chinese Language Model for NLP Task☆57Updated 4 years ago
- a simple yet complete implementation of the popular BERT model☆127Updated 4 years ago
- ☆24Updated 4 years ago
- The experiment result of LSTM language models on PTB (Penn Treebank) and GBW (Google Billion Word) using AdaptiveSoftmax on TensorFlow.☆100Updated 6 years ago
- An Implementation of Bidirectional Attention Flow☆40Updated 7 years ago
- QANet in keras (with Cove)☆66Updated 5 years ago
- Inference with state-of-the-art models (pre-trained by LD-Net / AutoNER / VanillaNER / ...)☆115Updated 6 years ago
- Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs☆106Updated 2 years ago