DeepLink-org / DeepLinkExtLinks
☆13Updated 8 months ago
Alternatives and similar repositories for DeepLinkExt
Users that are interested in DeepLinkExt are comparing it to the libraries listed below
Sorting:
- ☆76Updated last year
- ☆73Updated last year
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆1,037Updated last week
- [DAC2024] A Holistic Functionalization Approach to Optimizing Imperative Tensor Programs in Deep Learning☆15Updated 2 years ago
- FlagScale is a large model toolkit based on open-sourced projects.☆471Updated last week
- ☆523Updated 2 weeks ago
- FlagGems is an operator library for large language models implemented in the Triton Language.☆893Updated this week
- 分享AI Infra知识&代码练习:PyTorch/vLLM/SGLang框架入门⚡️、性能加速🚀、大模型基础🧠、AI软硬件🔧等☆215Updated this week
- High Performance LLM Inference Operator Library☆695Updated this week
- Train speculative decoding models effortlessly and port them smoothly to SGLang serving.☆676Updated this week
- A benchmark suited especially for deep learning operators☆42Updated 2 years ago
- SGLang kernel library for NPU☆96Updated last week
- Disaggregated serving system for Large Language Models (LLMs).☆771Updated 10 months ago
- Summary of the Specs of Commonly Used GPUs for Training and Inference of LLM☆73Updated 5 months ago
- UltraScale Playbook 中文版☆125Updated 10 months ago
- Puzzles for learning Triton, play it with minimal environment configuration!☆613Updated last month
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆417Updated 5 months ago
- A light llama-like llm inference framework based on the triton kernel.☆171Updated last month
- LLM training technologies developed by kwai☆70Updated 2 weeks ago
- 注释的nano_vllm仓库,并且完成了MiniCPM4的适配以及注册新模型的功能☆155Updated 5 months ago
- learning how CUDA works☆373Updated 11 months ago
- A highly optimized LLM inference acceleration engine for Llama and its variants.☆906Updated this week
- Examples of CUDA implementations by Cutlass CuTe☆270Updated 7 months ago
- This repository organizes materials, recordings, and schedules related to AI-infra learning meetings.☆317Updated this week
- Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline mod…☆615Updated last year
- Ascend TileLang adapter☆206Updated this week
- how to learn PyTorch and OneFlow☆481Updated last year
- [EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.☆672Updated 2 months ago
- Materials for learning SGLang☆738Updated last month
- The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)☆321Updated last year