Lisennlp / distributed_train_pytorchLinks
pytorch分布式训练,支持多机多卡,单机多卡。
☆41Updated 3 years ago
Alternatives and similar repositories for distributed_train_pytorch
Users that are interested in distributed_train_pytorch are comparing it to the libraries listed below
Sorting:
- lightweighted deep learning inference service framework☆39Updated 3 years ago
- 论文模型复现☆42Updated 2 years ago
- 基于BERT的预训练语言模型实现,分为两步:预训练和微调。目前已包括BERT、Roberta、ALbert三个模型,且皆可支持Whole Word Mask模式。☆16Updated 5 years ago
- RoFormer升级版☆152Updated 2 years ago
- intent detection and slot filling 意图识别与槽填充联合模型☆39Updated 2 years ago
- 不用tensorflow estimator,分别采用字mask和wwm mask在中文领域内finetune bert模型☆23Updated 5 years ago
- FLASHQuad_pytorch☆67Updated 3 years ago
- A text classification example using ddp horovod and accelerate☆33Updated 3 years ago
- using lear to do ner extraction☆29Updated 3 years ago
- 使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。☆118Updated last year
- ☆90Updated last year
- Apply the Circular to the Pretraining Model☆37Updated 3 years ago
- 基于seq2edit (Gector) 的中文文本纠错。☆28Updated 2 years ago
- This is the official code for paper titled "Exploration and Exploitation: Two Ways to Improve Chinese Spelling Correction Models".☆68Updated 4 years ago
- ☆25Updated 2 years ago
- This repo contains some experiments of text matching on Chinese dataset LCQMC☆27Updated 5 years ago
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆72Updated last year
- 适用于常见的NLP任务的模板☆34Updated 2 years ago
- ☆67Updated 2 years ago
- BLOOM 模型的指令微调☆24Updated last year
- R-Drop方法在中文任务上的简单实验☆91Updated 3 years ago
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Updated 2 years ago
- 文本智能校对大赛(Chinese Text Correction)的baseline☆67Updated 2 years ago
- Text Matching Based on LCQMC: A Large-scale Chinese Question Matching Corpus☆15Updated 4 years ago
- 简洁易用版TinyBert:基于Bert进行知识蒸馏的预训练语言模型☆265Updated 4 years ago
- 基于英雄联盟知识图谱的问答☆9Updated 2 years ago
- 音乐类语料的意图识别填槽以及槽值纠错模型☆16Updated 2 years ago
- NTK scaled version of ALiBi position encoding in Transformer.☆68Updated last year
- 在苏剑林老师的代码上改了一下,改成了python3.6,基于膨胀卷积,字词混合向量,radam梯度优化算法,百度百科词向量的阅读理解模型☆23Updated 5 years ago
- this is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large☆65Updated 5 years ago