☆74Apr 2, 2026Updated last week
Alternatives and similar repositories for dlinfer
Users that are interested in dlinfer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A benchmark suited especially for deep learning operators☆42Feb 13, 2023Updated 3 years ago
- ☆13May 23, 2025Updated 10 months ago
- [DAC2024] A Holistic Functionalization Approach to Optimizing Imperative Tensor Programs in Deep Learning☆15Jan 13, 2024Updated 2 years ago
- ☆11Nov 5, 2024Updated last year
- ☆15Feb 1, 2016Updated 10 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- SGLang kernel library for NPU☆115Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆39Aug 30, 2025Updated 7 months ago
- ☆24Updated this week
- Community maintained hardware plugin for vLLM on Ascend☆1,900Updated this week
- Fast and efficient attention method exploration and implementation.☆25Mar 25, 2025Updated last year
- DLBlas: clean and efficient kernels☆36Updated this week
- Long Context Research☆31Jan 26, 2026Updated 2 months ago
- ☆74Oct 31, 2024Updated last year
- FlagCX is a scalable and adaptive cross-chip communication library.☆184Updated this week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Built upon Megatron-Deepspeed and HuggingFace Trainer, EasyLLM has reorganized the code logic with a focus on usability. While enhancing …☆49Sep 18, 2024Updated last year
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,755Apr 4, 2026Updated last week
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 4 months ago
- ☆22Dec 7, 2023Updated 2 years ago
- ☆13Oct 17, 2024Updated last year
- ☆13Oct 25, 2024Updated last year
- ☆54Mar 15, 2025Updated last year
- 🍋 A Rust/Swift-like modern interpreted programming language. First-class functions, first-class expressions, and functional techniques i…☆11Mar 2, 2021Updated 5 years ago
- An Android Application for GLCC☆11Sep 30, 2022Updated 3 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Build LLM from scratch☆106Nov 19, 2025Updated 4 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 4 months ago
- ☆76Nov 22, 2024Updated last year
- Pipeline-Parallel Lecture: Simplest Dualpipe Implementation.☆31Sep 17, 2025Updated 6 months ago
- ☆14Sep 7, 2022Updated 3 years ago
- Code for ICML21 paper "Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation"☆12Feb 8, 2023Updated 3 years ago
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorch☆18Dec 21, 2022Updated 3 years ago
- 纯c++的全平台llm加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆12Jan 30, 2026Updated 2 months ago
- A framework to compare low-bit integer and float-point formats☆71Feb 6, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Simple intermediate representation language for learning and research.☆20Mar 27, 2020Updated 6 years ago
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆419Aug 21, 2025Updated 7 months ago
- ☆57Feb 24, 2026Updated last month
- A Triton JIT runtime and ffi provider in C++☆32Apr 2, 2026Updated last week
- An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API☆18Aug 21, 2025Updated 7 months ago
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆16Sep 25, 2023Updated 2 years ago
- ☆16Mar 6, 2026Updated last month