WangHuiNEU / Transformer_Knowlegde
从底层机理了解Transformer
☆25Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for Transformer_Knowlegde
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆97Updated last year
- A light-weight script for maintaining a LOT of machine learning experiments.☆90Updated 2 years ago
- ☆155Updated last month
- The Roadmap for LLMs☆84Updated last year
- ☆51Updated last year
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆118Updated last year
- Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models☆53Updated 5 months ago
- A paper list about diffusion models for natural language processing.☆174Updated last year
- A Tight-fisted Optimizer☆47Updated last year
- The pure and clear PyTorch Distributed Training Framework.☆275Updated 10 months ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆52Updated 7 months ago
- Lion and Adam optimization comparison☆56Updated last year
- Yet another PyTorch Trainer and some core components for deep learning.☆206Updated 6 months ago
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆147Updated last month
- 实现了Transformer中的几种位置编码方案☆38Updated 3 years ago
- NTK scaled version of ALiBi position encoding in Transformer.☆67Updated last year
- ☆85Updated 2 weeks ago
- RoFormer V1 & V2 pytorch☆474Updated 2 years ago
- FLASHQuad_pytorch☆66Updated 2 years ago
- 使用sentencepiece中BPE训练中文词表,并在transformers中进行使用。☆111Updated last year
- The official repo of INF-34B models trained by INF Technology.☆34Updated 4 months ago
- ☆17Updated last year
- A paper list of pre-trained language models (PLMs).☆79Updated 2 years ago
- Rectified Rotary Position Embeddings☆341Updated 6 months ago
- QQ浏览器2021AI算法大赛赛道一 第1名 方案☆264Updated 2 years ago
- 更纯粹、更高压缩率的Tokenizer☆454Updated 7 months ago
- ☆30Updated this week
- ☆147Updated 4 months ago
- ☆82Updated last year
- ☆64Updated 2 years ago