ZouJiu1 / numpy_transformerLinks
transformer which using numpy,vision transformer of VIT, MNIST testset precision = 97.2%,mutil-attention, patch embed, position embed, full connect, convolution, etc. train normally, save model, restore model
☆12Updated 8 months ago
Alternatives and similar repositories for numpy_transformer
Users that are interested in numpy_transformer are comparing it to the libraries listed below
Sorting:
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆79Updated last year
- LLM Tokenizer with BPE algorithm☆47Updated last year
- [CVPR 2025] Docopilot: Improving Multimodal Models for Document-Level Understanding☆36Updated 6 months ago
- Music large model based on InternLM2-chat.☆23Updated last year
- ☆136Updated last year
- paper-read-notes☆13Updated last year
- 模型可视化工具netron的Flask版本☆19Updated 3 years ago
- 🍏专门为 2024 书生·浦语大模型挑战赛 (春季赛) 准备的 Repo🍎收录了赫萝相关的微调源码☆12Updated last year
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Updated 2 years ago
- ☆68Updated last year
- 模型压缩的小白入门教程☆22Updated last year
- ☆61Updated 6 months ago
- ☆31Updated last year
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆526Updated 5 months ago
- Train your grpo with zero dataset and low resources, 8bit/4bit/lora/qlora supported, multi-gpu supported ...☆79Updated 9 months ago
- 模型压缩的小白入门教程,PDF下载地址 https://github.com/datawhalechina/awesome-compression/releases☆356Updated 2 months ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆100Updated last year
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory [COLM2025]☆200Updated 6 months ago
- 基于InterLM的《黑神话:悟空》AI小助手,了解更多背后的故事--在更新视频中☆34Updated last year
- 这是一个不基于任何框架实现的从0到1的VLM finetune(包括Pre-train和SFT)☆35Updated 5 months ago
- ☆47Updated last year
- 使用单个24G显卡,从0开始训练LLM☆56Updated 7 months ago
- Implementation of FlashAttention in PyTorch☆180Updated last year
- 通过动画学强化学习笔记☆65Updated 11 months ago
- run ChatGLM2-6B in BM1684X☆49Updated last year
- Boost segmentation model mIoU/Dice instantly WITHOUT retraining. A plug-and-play, training-free optimization module. Published in NeurIPS…☆49Updated last week
- 想要从零开始训练一个中文的mini大语言模型,可以进行基本的对话,模型大小根据手头的机器决定☆65Updated last year
- 从零到一 实现一个 miniLLM~(动手学习LLM)☆77Updated last year
- A minimal PyTorch re-implementation of Qwen3 VL with a fancy CLI☆316Updated 2 months ago
- ☆134Updated 11 months ago