thisiszhou / Transformer-Translate-DemoLinks
A translation model with Transformer, implement by pytorch, which is for learning Transformer.
☆50Updated 4 years ago
Alternatives and similar repositories for Transformer-Translate-Demo
Users that are interested in Transformer-Translate-Demo are comparing it to the libraries listed below
Sorting:
- pytorch分布式训练☆66Updated last year
- dpo算法实现☆38Updated 11 months ago
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆153Updated 7 months ago
- Inference code for LLaMA models☆121Updated last year
- DeepSpeed Tutorial☆97Updated 9 months ago
- Transformer是谷歌在17年发表的Attention Is All You Need 中使用的模型,经过这些年的大量的工业使用和论文验证,在深度学习领域已经占据重要地位。Bert就是从Transformer中衍生出来的语言模型。我会以中文翻译英文为例,来解释Tran…☆258Updated last year
- RoFormer V1 & V2 pytorch☆498Updated 3 years ago
- ☆90Updated last year
- NumPy实现类PyTorch的动态计算图和神经网络框架(MLP, CNN, RNN, Transformer)☆81Updated 11 months ago
- A Transformer Framework Based Translation Task☆151Updated 3 months ago
- ☆79Updated last year
- 更纯粹、更高压缩率的Tokenizer☆481Updated 6 months ago
- ☆141Updated 3 years ago
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆100Updated last year
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆98Updated 2 years ago
- 简洁易用版TinyBert:基于Bert进行知识蒸馏的预训练语言模型☆265Updated 4 years ago
- ☆110Updated 11 months ago
- 使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。☆118Updated last year
- 一个多模态内容理解算法框架,其中包含数据处理、预训练模型、常见模型以及模型加速等模块。☆315Updated 3 years ago
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆436Updated last month
- qwen models finetuning☆98Updated 2 months ago
- All about DeepLearning: 推荐系统、自然语言处理、Tensorflow、Pytorch等☆360Updated 3 weeks ago
- 包含程序员面试大厂面试题和面试经验☆132Updated 2 weeks ago
- 《ChatGPT原理与实战:大型语言模型的算法、技术和私有化》☆357Updated last year
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆164Updated last year
- A text classification example using ddp horovod and accelerate☆33Updated 3 years ago
- 2023全球智能汽车AI挑战赛——赛道一:AI大模型检索问答, 75+ baseline☆57Updated last year
- ☆71Updated 2 weeks ago
- 对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF☆194Updated 2 years ago
- ☆34Updated 5 months ago