Mryangkaitong / big_dataloaderLinks
☆37Updated 3 years ago
Alternatives and similar repositories for big_dataloader
Users that are interested in big_dataloader are comparing it to the libraries listed below
Sorting:
- 文本智能校对大赛(Chinese Text Correction)的baseline☆68Updated 3 years ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Updated 2 years ago
- NTK scaled version of ALiBi position encoding in Transformer.☆69Updated 2 years ago
- RoFormer升级版☆154Updated 3 years ago
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆123Updated 6 months ago
- 基于DPO算法微调语言大模型,简单好上手。☆49Updated last year
- 科大讯飞低资源多语种文本翻译挑战赛获奖方案☆27Updated 2 years ago
- Apply the Circular to the Pretraining Model☆38Updated 3 years ago
- Python下shuffle几百G文件☆33Updated 4 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆98Updated 2 years ago
- 我的数据竞赛方案总结☆72Updated 2 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Updated 2 years ago
- 怎么训练一个LLM分词器☆154Updated 2 years ago
- [TALLIP] General and Domain Adaptive Chinese Spelling Check with Error Consistent Pretraining☆62Updated last year
- CTC2021-中文文本纠错大赛的SOTA方案及在线演示☆73Updated 2 years ago
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆96Updated 10 months ago
- A text classification example using ddp horovod and accelerate☆33Updated 3 years ago
- pytorch分布式训练,支持多机多卡,单机多卡。☆43Updated 4 years ago
- 论文模型复现☆43Updated 3 years ago
- 一个多模态内容理解算法框架,其中包含数据处理、预训练模型、常见模型以及模型加速等模块。☆323Updated 4 years ago
- 使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。☆120Updated 2 years ago
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆15Updated 2 years ago
- FLASHQuad_pytorch☆68Updated 3 years ago
- lightweighted deep learning inference service framework☆39Updated 4 years ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆120Updated last year
- ☆84Updated 2 years ago
- easy-bert是一个中文NLP工具,提供诸多bert变体调用和调参方法,极速上手;清晰的设计和代码注释,也很适合学习☆83Updated 3 years ago
- llama,chatglm 等模型的微调☆91Updated last year
- ChatGPT相关资源汇总☆56Updated 2 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆19Updated 2 years ago