ZouJiu1 / numpy_transformerLinks
transformer which using numpy,vision transformer of VIT, MNIST testset precision = 97.2%,mutil-attention, patch embed, position embed, full connect, convolution, etc. train normally, save model, restore model
☆10Updated 4 months ago
Alternatives and similar repositories for numpy_transformer
Users that are interested in numpy_transformer are comparing it to the libraries listed below
Sorting:
- ☆59Updated last month
- [CVPR 2025] Docopilot: Improving Multimodal Models for Document-Level Understanding☆34Updated 2 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆72Updated last year
- 一个PyTorch实现的五子棋AI项目☆21Updated last month
- Implementation of FlashAttention in PyTorch☆170Updated 8 months ago
- ☆45Updated last year
- A minimal, easy-to-read PyTorch reimplementation of the Qwen3 and Qwen2.5 VL with a fancy CLI☆146Updated 3 weeks ago
- LLM101n: Let's build a Storyteller 中文版☆132Updated last year
- Parse LaTeX math expressions☆33Updated 2 months ago
- Music large model based on InternLM2-chat.☆22Updated 9 months ago
- 模型压缩的小白入门教程☆22Updated last year
- Tiny C++ LLM inference implementation from scratch☆66Updated 2 weeks ago
- 模型可视化工具netron的Flask版本☆18Updated 3 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆42Updated last year
- 一本系统地教你将深度学习模型的性能最大化的战术手册。☆20Updated 2 years ago
- Fairy±i (iFairy): Complex-valued Quantization Framework for Large Language Models☆103Updated 2 weeks ago
- LLM Tokenizer with BPE algorithm☆39Updated last year
- Train your grpo with zero dataset and low resources, 8bit/4bit/lora/qlora supported, multi-gpu supported ...☆78Updated 4 months ago
- Download the source latex code of multiple arXiv paper with one click☆111Updated last year
- The repository supports TensorRT, QNN platform inference, 2D obstacle detection yolo series (yolov5, yolov8, yolo11, yolox), semantic seg…☆20Updated 4 months ago
- 从0开始,将chatgpt的技术路线跑一遍。☆258Updated last year
- ☆30Updated last year
- 🍏专门为 2024 书 生·浦语大模型挑战赛 (春季赛) 准备的 Repo🍎收录了赫萝相关的微调源码☆11Updated last year
- ☆110Updated 11 months ago
- 关于书籍CUDA Programming使用了pycuda模块的Python版本的示例代码☆257Updated 5 years ago
- TensorRT简明教程☆26Updated 4 years ago
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆50Updated last year
- run ChatGLM2-6B in BM1684X☆49Updated last year
- 跨平台的容器化Linux桌面环境☆73Updated 7 months ago
- PaddlePaddle Code Convert Toolkit. 『飞桨』深度学习代码转换工具☆112Updated last week