lvyufeng / cybertron-aiLinks
mindspore implementation of transformers
☆69Updated 2 years ago
Alternatives and similar repositories for cybertron-ai
Users that are interested in cybertron-ai are comparing it to the libraries listed below
Sorting:
- Natural Language Processing Tutorial for MindSpore Users☆142Updated last year
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆156Updated 10 months ago
- 一个用于学习的仿Pytorch纯Python实现的 自动求导工具。☆51Updated last year
- 《动手学深度学习》的MindSpore实现。供MindSpore学习者配合李沐老师课程使用。☆119Updated last year
- ☆143Updated last month
- MindSpore implementations of Generative Adversarial Networks.☆22Updated 3 years ago
- ☆52Updated 2 years ago
- ☆79Updated last year
- Model Compression for Big Models☆164Updated 2 years ago
- An awesome gpu tasks scheduler. 轻量好用的GPU机群任务调度工具。觉得有用可以点个star☆188Updated 3 years ago
- ATC23 AE☆46Updated 2 years ago
- Max的有趣数据集 / Max's awesome datasets☆37Updated 2 weeks ago
- qwen-nsa☆74Updated 4 months ago
- Inference code for LLaMA models☆122Updated 2 years ago
- 基于Gated Attention Unit的Transformer模型(尝鲜版)☆98Updated 2 years ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆80Updated 4 months ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆408Updated last month
- pytorch distribute tutorials☆150Updated 2 months ago
- Collaborative Training of Large Language Models in an Efficient Way☆416Updated last year
- The record of what I‘ve been through.☆101Updated 7 months ago
- Implementation of FlashAttention in PyTorch☆164Updated 7 months ago
- (Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from …☆179Updated last year
- The repo for Tsinghua summer course: Interdisciplinary Seminar on Big Models☆371Updated 3 years ago
- [ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Length☆108Updated 4 months ago
- An introduction to basic concepts of Transformers and key techniques of their recent advances.☆51Updated last year
- Efficient Mixture of Experts for LLM Paper List☆118Updated this week
- ☆50Updated last year
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆67Updated 2 years ago
- ☆18Updated 2 years ago
- analyse problems of AI with Math and Code☆21Updated last month