Mryangkaitong / big_dataloaderLinks
☆37Updated 3 years ago
Alternatives and similar repositories for big_dataloader
Users that are interested in big_dataloader are comparing it to the libraries listed below
Sorting:
- pytorch分布式训练,支持多机多卡,单机多卡。☆43Updated 4 years ago
- NTK scaled version of ALiBi position encoding in Transformer.☆69Updated 2 years ago
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆118Updated 4 months ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆98Updated 2 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Updated 2 years ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Updated last year
- 文本智能校对大赛(Chinese Text Correction)的baseline☆68Updated 3 years ago
- 科大讯飞低资源多语种文本翻译挑战赛获奖方案☆29Updated 2 years ago
- 使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。☆119Updated 2 years ago
- ICLR2023 - Tailoring Language Generation Models under Total Variation Distance☆21Updated 2 years ago
- Python下shuffle几百G文件☆33Updated 4 years ago
- A text classification example using ddp horovod and accelerate☆33Updated 3 years ago
- 怎么训练一个LLM分词器☆153Updated 2 years ago
- LORA微调BLOOMZ,参考BELLE☆25Updated 2 years ago
- ChatGPT相关资源汇总☆56Updated 2 years ago
- RoFormer升级版☆154Updated 3 years ago
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆60Updated last year
- dpo算法实现☆47Updated last year
- [TALLIP] General and Domain Adaptive Chinese Spelling Check with Error Consistent Pretraining☆59Updated last year
- 论文模型复现☆43Updated 3 years ago
- Implementation of some unbalanced loss like focal_loss, dice_loss, DSC Loss, GHM Loss et.al☆267Updated 2 years ago
- The Corpus & Code for EMNLP 2022 paper "FCGEC: Fine-Grained Corpus for Chinese Grammatical Error Correction" | FCGEC中文语法纠错语料及STG模型☆119Updated 10 months ago
- BLOOM 模型的指令微调☆24Updated 2 years ago
- make LLM easier to use☆59Updated 2 years ago
- 使用 Qwen2ForSequenceClassification 简单实现文本分类任务。☆82Updated last year
- baichuan and baichuan2 finetuning and alpaca finetuning☆33Updated 7 months ago
- Apply the Circular to the Pretraining Model☆38Updated 3 years ago
- Lion and Adam optimization comparison☆64Updated 2 years ago
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆15Updated 2 years ago
- Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.☆98Updated last year