☆74Jun 30, 2026Updated this week
Alternatives and similar repositories for dlinfer
Users that are interested in dlinfer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A benchmark suited especially for deep learning operators☆42Feb 13, 2023Updated 3 years ago
- ☆13May 23, 2025Updated last year
- Composable and Embeddable Communication Runtime for Distributed AI Services☆102Jun 5, 2026Updated 3 weeks ago
- [DAC2024] A Holistic Functionalization Approach to Optimizing Imperative Tensor Programs in Deep Learning☆15Jan 13, 2024Updated 2 years ago
- ☆15Feb 1, 2016Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆16Jan 16, 2026Updated 5 months ago
- A comprehensive knowledge base for Huawei Ascend NPU development, structured as distributed Agent Skills. https://ascend-ai-coding.github…☆125Updated this week
- ☆26Jun 8, 2026Updated 3 weeks ago
- Community maintained hardware plugin for vLLM on Ascend☆2,295Updated this week
- ☆76Oct 31, 2024Updated last year
- FlagCX is a scalable and adaptive cross-chip communication library.☆216Jun 24, 2026Updated last week
- Built upon Megatron-Deepspeed and HuggingFace Trainer, EasyLLM has reorganized the code logic with a focus on usability. While enhancing …☆49Sep 18, 2024Updated last year
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,928Updated this week
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆23Dec 7, 2023Updated 2 years ago
- ☆13Oct 17, 2024Updated last year
- 🍋 A Rust/Swift-like modern interpreted programming language. First-class functions, first-class expressions, and functional techniques i…☆11Mar 2, 2021Updated 5 years ago
- An Android Application for GLCC☆11Sep 30, 2022Updated 3 years ago
- Codebase for " Reducing Representation Drift in Online Continual Learning"☆14Jun 8, 2021Updated 5 years ago
- Build LLM from scratch☆121Jun 18, 2026Updated 2 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 7 months ago
- ☆76Nov 22, 2024Updated last year
- CMake configurations for PPL projects☆12Aug 10, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 纯c++的全平台llm加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆13Jun 10, 2026Updated 3 weeks ago
- An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API☆18Aug 21, 2025Updated 10 months ago
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆420Aug 21, 2025Updated 10 months ago
- Simple intermediate representation language for learning and research.☆22Mar 27, 2020Updated 6 years ago
- Query tuple info in relation file and clean hint info in tuple, change transaction status in commit log file.☆10Dec 6, 2015Updated 10 years ago
- Nex Venus Communication Library☆75Nov 17, 2025Updated 7 months ago
- ☆59Feb 24, 2026Updated 4 months ago
- seq2seq with attention in mxnet☆18Oct 13, 2017Updated 8 years ago
- ☆19May 20, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- An efficient video loader for deep learning with smart shuffling that's super easy to digest☆54Sep 29, 2023Updated 2 years ago
- Gensis is a lightweight deep learning framework written from scratch in Python, with Triton as its backend for high-performance computing…☆35Jan 15, 2026Updated 5 months ago
- ☆19Feb 13, 2024Updated 2 years ago
- A package for filtering sensitive data (parameters, keys) from a variety of JS objects☆10Feb 17, 2026Updated 4 months ago
- 视觉信息抽取任务中,使用OCR识别结果规范多模态大模型的回答☆44Dec 31, 2024Updated last year
- Ask question to your PDF☆10Jun 11, 2023Updated 3 years ago
- [ICML 2026]A framework to compare low-bit integer and float-point formats☆79May 6, 2026Updated last month