☆74Feb 11, 2026Updated 2 weeks ago
Alternatives and similar repositories for dlinfer
Users that are interested in dlinfer are comparing it to the libraries listed below
Sorting:
- ☆13May 23, 2025Updated 9 months ago
- A benchmark suited especially for deep learning operators☆42Feb 13, 2023Updated 3 years ago
- MXMACA入门materials☆20Jun 9, 2024Updated last year
- SGLang kernel library for NPU☆101Updated this week
- ☆11Nov 5, 2024Updated last year
- [DAC2024] A Holistic Functionalization Approach to Optimizing Imperative Tensor Programs in Deep Learning☆15Jan 13, 2024Updated 2 years ago
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆13Jan 16, 2026Updated last month
- FlagCX is a scalable and adaptive cross-chip communication library.☆174Updated this week
- A Triton JIT runtime and ffi provider in C++☆31Updated this week
- DLBlas: clean and efficient kernels☆33Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆39Aug 30, 2025Updated 6 months ago
- Community maintained hardware plugin for vLLM on Ascend☆1,711Updated this week
- ☆24Updated this week
- ☆54Mar 15, 2025Updated 11 months ago
- Fast and efficient attention method exploration and implementation.☆25Mar 25, 2025Updated 11 months ago
- ☆22Dec 7, 2023Updated 2 years ago
- A framework to compare low-bit integer and float-point formats☆66Feb 6, 2026Updated 3 weeks ago
- Distributed IO-aware Attention algorithm☆24Sep 24, 2025Updated 5 months ago
- Built upon Megatron-Deepspeed and HuggingFace Trainer, EasyLLM has reorganized the code logic with a focus on usability. While enhancing …☆49Sep 18, 2024Updated last year
- ☆30May 22, 2024Updated last year
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆39Jun 4, 2025Updated 8 months ago
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,618Updated this week
- LMCache on Ascend☆49Updated this week
- LoRA-RSC☆11Nov 14, 2025Updated 3 months ago
- ☆76Nov 22, 2024Updated last year
- ☆37Jul 5, 2025Updated 7 months ago
- RespireNet is an innovative web-based application that harnesses the capabilities of deep learning and Mel-frequency cepstral coefficient…☆10Aug 2, 2023Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 3 months ago
- The program ranked first in Audio-only track of DCASE2024 Challenge task3.☆20Apr 12, 2025Updated 10 months ago
- WeChat official account crawler 微信公众号爬虫☆12Apr 13, 2024Updated last year
- ☆79May 6, 2024Updated last year
- [ECCV 2024] AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation☆35Sep 12, 2024Updated last year
- A domain-specific language (DSL) based on Triton but providing higher-level abstractions.☆41Feb 4, 2026Updated 3 weeks ago
- The official repository of the Omni-MATH benchmark.☆93Dec 22, 2024Updated last year
- This project is based on the [LTX-Video](https://github.com/Lightricks/LTX-Video) algorithm of the diffusers and optimized and accelerate…☆13Dec 31, 2024Updated last year
- Code for ADMM-DAD network☆10Apr 22, 2023Updated 2 years ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 11 months ago
- ☆11Oct 31, 2024Updated last year
- easy_clash_tool是一个clash的python库,可以很便捷的自动切换可用节点☆16Apr 17, 2024Updated last year