DeepLink-org/dlinfer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DeepLink-org/dlinfer)

DeepLink-org / dlinfer

☆74

Alternatives and similar repositories for dlinfer

Users that are interested in dlinfer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DeepLink-org / DLOP-Bench
View on GitHub
A benchmark suited especially for deep learning operators
☆42Feb 13, 2023Updated 3 years ago
DeepLink-org / DeepLinkExt
View on GitHub
☆13May 23, 2025Updated last year
DeepLink-org / DLSlime
View on GitHub
Composable and Embeddable Communication Runtime for Distributed AI Services
☆102Jun 5, 2026Updated last month
InternLM / Kernel-Smith
View on GitHub
☆27Mar 31, 2026Updated 3 months ago
bxttttt / getting-started-guide-and-introduction-to-MXMACA
View on GitHub
MXMACA入门materials
☆22Jun 9, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
DeepLink-org / DLBlas
View on GitHub
DLBlas: clean and efficient kernels
☆43Updated this week
guanlisheng / infobright-4.0.7
View on GitHub
☆15Feb 1, 2016Updated 10 years ago
DeepLink-org / DLRouter
View on GitHub
☆20Jun 11, 2026Updated last month
wangshuai09 / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆39Jun 24, 2026Updated last month
TransferQueue / TransferQueue
View on GitHub
[Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…
☆16Jan 16, 2026Updated 6 months ago
DeepLink-org / deeplink.framework
View on GitHub
☆76Oct 31, 2024Updated last year
ModelTC / EasyLLM
View on GitHub
Built upon Megatron-Deepspeed and HuggingFace Trainer, EasyLLM has reorganized the code logic with a focus on usability. While enhancing …
☆49Sep 18, 2024Updated last year
InternLM / lmdeploy
View on GitHub
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
☆7,972Updated this week
flagos-ai / libtriton_jit
View on GitHub
A Triton JIT runtime and ffi provider in C++
☆37Updated this week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
inisis / OnnxLLM
View on GitHub
Large Language Model Onnx Inference Framework
☆35Nov 25, 2025Updated 7 months ago
vllm-project / vllm-ascend
View on GitHub
Community maintained hardware plugin for vLLM on Ascend
☆2,461Updated this week
sgl-project / sgl-kernel-npu
View on GitHub
SGLang kernel library for NPU
☆170Updated this week
RightNow-AI / StreamIndex
View on GitHub
Memory-bounded compressed sparse attention via streaming top-k. Triton kernels for the DeepSeek-V4 lightning indexer. 32x regime extensio…
☆22May 5, 2026Updated 2 months ago
DeepLink-org / DeepTrace
View on GitHub
DeepTrace: A lightweight, scalable real-time diagnostic and analysis tool for distributed training tasks.
☆18Nov 4, 2025Updated 8 months ago
Ascend / torchair
View on GitHub
☆26Jun 8, 2026Updated last month
chow-q / WeixinBot
View on GitHub
基于鼠标键盘操作的微信自动聊天机器人
☆13Nov 26, 2024Updated last year
DDGRCF / GLCC_AndroidApplication
View on GitHub
An Android Application for GLCC
☆11Sep 30, 2022Updated 3 years ago
BugenZhao / Lime
View on GitHub
🍋 A Rust/Swift-like modern interpreted programming language. First-class functions, first-class expressions, and functional techniques i…
☆11Mar 2, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
verl-project / rl-insight
View on GitHub
Provide performance insight capabilities for RL frameworks.
☆46Updated this week
ant-research / M2-Miner
View on GitHub
[ICLR 2026] M2-Miner: Multi-Agent Enhanced MCTS for Mobile GUI Agent Data Mining
☆55Apr 22, 2026Updated 3 months ago
ganler / memcov
View on GitHub
Collect simple coverage information in memory.
☆11Oct 6, 2022Updated 3 years ago
OpenPPL / hpcc
View on GitHub
CMake configurations for PPL projects
☆12Aug 10, 2024Updated last year
Adlik / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆12Nov 14, 2025Updated 8 months ago
DeepLink-org / DIOPI
View on GitHub
☆76Nov 22, 2024Updated last year
Ascend / TransferQueue
View on GitHub
An asynchronous streaming data management module for efficient post-training.
☆118Jul 12, 2026Updated last week
amaabca / sensitive-param-filter
View on GitHub
A package for filtering sensitive data (parameters, keys) from a variety of JS objects
☆10Feb 17, 2026Updated 5 months ago
TylunasLi / fastllm
View on GitHub
纯c++的全平台llm加速库，支持python调用，支持chatglm-6B, llama, baichuan, moss基座，x86 / ARM
☆13Jun 10, 2026Updated last month
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
hisrg / Onnx-python
View on GitHub
This repository is Onnx tutorial summary for python implements , which comes from other web resource.
☆29Oct 23, 2022Updated 3 years ago
hcd233 / Aris-AI-Model-Server
View on GitHub
An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API
☆18Aug 21, 2025Updated 11 months ago
AlpinDale / RPTQ-for-LLaMA
View on GitHub
Efficient 3bit/4bit quantization of LLaMA models
☆18May 18, 2023Updated 3 years ago
ZiyueHuang / MXSeq2Seq
View on GitHub
seq2seq with attention in mxnet
☆18Oct 13, 2017Updated 8 years ago
jquesnelle / transformers-openai-api
View on GitHub
An OpenAI Completions API compatible server for NLP transformers models
☆65Nov 20, 2023Updated 2 years ago
georgia-tech-db / eva-decord
View on GitHub
An efficient video loader for deep learning with smart shuffling that's super easy to digest
☆54Sep 29, 2023Updated 2 years ago
mlapistudy / ICSE2022_158
View on GitHub
This is the artifact for paper “Automated Testing of Software that Uses Machine Learning APIs (#158)” in ICSE2022
☆12Nov 15, 2022Updated 3 years ago