lvyufeng / cybertron-aiLinks
mindspore implementation of transformers
☆67Updated 2 years ago
Alternatives and similar repositories for cybertron-ai
Users that are interested in cybertron-ai are comparing it to the libraries listed below
Sorting:
- Natural Language Processing Tutorial for MindSpore Users☆142Updated last year
- ☆18Updated 2 years ago
- MindSpore implementations of Generative Adversarial Networks.☆22Updated 2 years ago
- 《动手学深度学习》的MindSpore实现。供MindSpore学习者配合李沐老师课程使用。☆116Updated last year
- ☆137Updated last month
- 一个用于学习的仿Pytorch纯Python实现的自动求导工具。☆51Updated last year
- qwen-nsa☆67Updated 2 months ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆98Updated 2 years ago
- ☆34Updated 6 months ago
- Implementation of Denoising Diffusion Probabilistic Model in MindSpore☆36Updated 2 years ago
- ☆79Updated last year
- 一款便捷的抢占显卡脚本☆336Updated 5 months ago
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆154Updated 8 months ago
- An introduction to basic concepts of Transformers and key techniques of their recent advances.☆49Updated last year
- An awesome gpu tasks scheduler. 轻量好用的GPU机群任务调度工具。觉得有用可以点个star☆184Updated 2 years ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆71Updated 2 months ago
- DeepSpeed Tutorial☆97Updated 10 months ago
- ATC23 AE☆45Updated 2 years ago
- an implementation of parallel skills like amp, ddp, pp, tp for learning purposes☆13Updated last year
- analyse problems of AI with Math and Code☆17Updated 2 weeks ago
- ☆168Updated this week
- [ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408☆195Updated 2 years ago
- Must-read papers on improving efficiency for pre-trained language models.☆104Updated 2 years ago
- ☆84Updated last year
- ☆11Updated last year
- pytorch distribute tutorials☆138Updated last week
- Inference code for LLaMA models☆121Updated last year
- A Tight-fisted Optimizer☆48Updated 2 years ago
- From Llama to Deepseek, grpo/mtp implemented. With pt/sft/lora/qlora included☆27Updated 2 months ago
- A light-weight script for maintaining a LOT of machine learning experiments.☆91Updated 2 years ago