DeepLink-org / DeepLinkExt
☆11Updated this week
Related projects ⓘ
Alternatives and complementary repositories for DeepLinkExt
- ☆59Updated 3 weeks ago
- ☆68Updated last week
- A benchmark suited especially for deep learning operators☆41Updated last year
- ☆20Updated this week
- Development repository for the Triton-Linalg conversion☆151Updated last month
- Easiest and laziest way for building multi-agent LLMs applications.☆1,021Updated this week
- ☆138Updated 2 weeks ago
- FlagPerf is an open-source software platform for benchmarking AI chips.☆314Updated this week
- ☆79Updated 8 months ago
- FlagGems is an operator library for large language models implemented in Triton Language.☆343Updated this week
- AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and ver…☆206Updated this week
- ☆140Updated 7 months ago
- A model compilation solution for various hardware☆378Updated this week
- An acceleration library that supports arbitrary bit-width combinatorial quantization operations☆226Updated last month
- Large Language Model (LLM) Serving Paper and Resource List☆13Updated 2 months ago
- Yinghan's Code Sample☆289Updated 2 years ago
- Fast and easy distributed model training examples.☆10Updated this week
- TVM Documentation in Chinese Simplified / TVM 中文文档☆957Updated this week
- ☆290Updated last week
- how to learn PyTorch and OneFlow☆349Updated 8 months ago
- PaddlePaddle custom device implementaion. (『飞桨』自定义硬件接入实现)☆71Updated this week
- Summary of some awesome work for optimizing LLM inference☆37Updated this week
- ☆57Updated this week
- ☆124Updated 2 weeks ago
- PyTorch distributed training acceleration framework☆34Updated this week
- ☆166Updated this week
- learning how CUDA works☆169Updated 3 months ago
- FlagScale is a large model toolkit based on open-sourced projects.☆178Updated this week
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆547Updated last month
- TePDist (TEnsor Program DISTributed) is an HLO-level automatic distributed system for DL models.☆90Updated last year