WGS-note / transformer-noteLinks
☆17Updated 3 years ago
Alternatives and similar repositories for transformer-note
Users that are interested in transformer-note are comparing it to the libraries listed below
Sorting:
- An implementation of the BERT model and its related downstream tasks based on the PyTorch framework. @月来客栈☆607Updated 2 months ago
- A Transformer Framework Based Translation Task☆155Updated 4 months ago
- Chinese-Text-Classification Project including bert-classification, textCNN and so on.☆160Updated 3 years ago
- 超轻量级bert的pytorch版本,大量中文注释,容易修改结构,持续更新☆417Updated 3 years ago
- Bert源码阅读与讲解(Pytorch版本)-以BERT文本分类代码为例子☆658Updated 3 years ago
- This is a repository for a few projects built in torch.☆44Updated 2 months ago
- OpenAI GPT的简单复现☆21Updated 10 months ago
- personal chatgpt☆386Updated 10 months ago
- ☆163Updated 2 years ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆178Updated 2 years ago
- 使用LoRA对ChatGLM进行微调。☆49Updated 2 years ago
- ☆391Updated 8 months ago
- SimCSE中文语义 相似度对比学习模型☆89Updated 3 years ago
- 学习深度学习不如边写代码边学习,实际操作一遍才能理解数据的变换过程,参数的训练过程,这里整合了B站的jupter代码,可以结合着B站的视频边看边练,希望能对大家有帮助。☆138Updated 3 years ago
- RoFormer V1 & V2 pytorch☆512Updated 3 years ago
- Transformer在CV和NLP领域的变体模型的从零解读:Transformer;VIT;Swin Transformer☆336Updated 3 years ago
- DeepSpeed Tutorial☆102Updated last year
- 基于pytorch+bert的中文文本分类☆90Updated 2 years ago
- 对llama3进行全参微调、lora微调以及qlora微调。☆210Updated last year
- Inference code for LLaMA models☆125Updated 2 years ago
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆159Updated last year
- ☆75Updated 4 months ago
- pytorch分布式训练☆70Updated 2 years ago
- seq_2_seq text generation based on transformers☆22Updated 4 years ago
- 该仓库主要记录 NLP 算法工程师相关的顶会论文研读笔记【Bert篇】☆13Updated 2 years ago
- Huggingface transformers的中文文档☆273Updated last year
- Transformer是谷歌在17年发表的Attention Is All You Need 中使用的模型,经过这些年的大量的工业使用和论文验证,在深度学习领域已经占据重要地位。Bert就是从Transformer中衍生出来的语言模型。我会以中文翻译英文为例,来解释Tran…☆281Updated last year
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆122Updated last year
- A PyTorch implementation of a BiLSTM \ BERT \ Roberta (+ BiLSTM + CRF) model for Chinese Word Segmentation (中文分词) .☆214Updated 3 years ago
- 《跟我一起深度学习》@月来客栈 出品☆230Updated 2 months ago