lvyufeng / cybertron-aiLinks
mindspore implementation of transformers
☆68Updated 2 years ago
Alternatives and similar repositories for cybertron-ai
Users that are interested in cybertron-ai are comparing it to the libraries listed below
Sorting:
- Natural Language Processing Tutorial for MindSpore Users☆141Updated last year
- 一个用于学习的仿Pytorch纯Python实现的自动求导工具。☆51Updated last year
- ☆152Updated 6 months ago
- 《动手学深度学习》的MindSpore实现。供MindSpore学习者配合李沐老师课程使用。☆125Updated 2 years ago
- ATC23 AE☆46Updated 2 years ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆405Updated 5 months ago
- Model Compression for Big Models☆167Updated 2 years ago
- qwen-nsa☆87Updated 3 months ago
- MindSpore implementations of Generative Adversarial Networks.☆23Updated 3 years ago
- ☆18Updated 3 years ago
- Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**☆216Updated 11 months ago
- ☆79Updated 2 years ago
- Models and examples built with OneFlow☆101Updated last year
- [ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Length☆147Updated last month
- An introduction to basic concepts of Transformers and key techniques of their recent advances.☆51Updated 2 years ago
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆159Updated last year
- Lion and Adam optimization comparison☆64Updated 2 years ago
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆71Updated 2 years ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆99Updated last month
- ☆51Updated 2 years ago
- [ACL 2024] Official PyTorch implementation of "IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact"☆48Updated last year
- DeepSeek Native Sparse Attention pytorch implementation☆111Updated last month
- An awesome gpu tasks scheduler. 轻量好用的GPU机群任务调度工具。觉得有用可以点个star☆196Updated 3 years ago
- Efficient Mixture of Experts for LLM Paper List☆156Updated 3 months ago
- ☆66Updated last year
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆186Updated 2 years ago
- ☆81Updated last month
- analyse problems of AI with Math and Code☆27Updated 5 months ago
- Multi-Candidate Speculative Decoding☆39Updated last year
- ☆155Updated 10 months ago