InfiniTensor/InfiniLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/InfiniTensor/InfiniLM)

InfiniTensor / InfiniLM

☆184

Alternatives and similar repositories for InfiniLM

Users that are interested in InfiniLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

InfiniTensor / InfiniCore
View on GitHub
☆67Updated this week
lofty14 / restaurant-digitalization-blueprint
View on GitHub
餐饮连锁数字化 + AI 全景蓝图:原串(平价烤串连锁)的架构决策、业务口径、踩坑实录与可直接喂给 AI 的复刻指令。纯自然语言方案,不含代码与任何真实经营数据。
☆120Jul 17, 2026Updated last week
InfiniTensor / InfiniTensor
View on GitHub
InfiniTensor is a high-performance inference engine tailored for GPUs and AI accelerators. Its design focuses on effective deployment and…
☆375Updated this week
InfiniTensor / InfiniTrain
View on GitHub
☆46Updated this week
InfiniTensor / TinyInfiniTrain
View on GitHub
训练营训练方向项目
☆29Jan 28, 2026Updated 5 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
InfiniTensor / Learning-CUDA
View on GitHub
InfiniTensor 大模型与人工智能系统训练营 CUDA 方向作业与项目系统
☆51Feb 24, 2026Updated 5 months ago
InfiniTensor / InfiniCCL
View on GitHub
InfiniCCL is a unified, cross-platform collective communication library designed for heterogeneous accelerator environments.
☆16Updated this week
YdrMaster / operators-rs
View on GitHub
算子库（Rust）
☆15Jul 24, 2025Updated last year
InfiniTensor / llaisys
View on GitHub
Let's Learn AI SYStem
☆49Jul 13, 2026Updated 2 weeks ago
wejoncy / sfllm
View on GitHub
Super fast serving stack for LLM on Windows/Linux/Macos
☆17Dec 17, 2025Updated 7 months ago
yuhanzhu612 / OSCPU
View on GitHub
我的一生一芯项目
☆16Dec 14, 2021Updated 4 years ago
gogongxt / nano-vllm
View on GitHub
Nano vLLM
☆25Aug 11, 2025Updated 11 months ago
serdes21 / flashtile
View on GitHub
FlashTile is a CUDA Tile IR compiler that is compatible with NVIDIA's tileiras, targeting SM70 through SM121 NVIDIA GPUs.
☆61Feb 6, 2026Updated 5 months ago
KJLdefeated / RL.cu
View on GitHub
RLVR training for LLM in CUDA/C++
☆42Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ScienceNLP-Lab / LLM-SSC
View on GitHub
Rhetorical sentence classification using LLMs
☆11Oct 26, 2025Updated 9 months ago
InfiniTensor / learning-tools
View on GitHub
☆15Jul 17, 2025Updated last year
wenjunsun / dlsys-needle-m1
View on GitHub
Final project for the class "Deep Learning Systems Algorithms and Implementation" from CMU, where we try to make needle work with Apple M…
☆10Jan 8, 2023Updated 3 years ago
xkuan / USTC-DL
View on GitHub
Deep Learning 2021 in School of Data Science, USTC
☆12May 17, 2023Updated 3 years ago
mlsysAE2022 / ae_mlsys_gnn
View on GitHub
☆11Mar 9, 2022Updated 4 years ago
HuXia7157 / garbage-classification-system
View on GitHub
用pytorch训练18层残差神经网络，用pyqt设计界面
☆12Jun 23, 2020Updated 6 years ago
HJCheng0602 / nanoPD
View on GitHub
A from-scratch Prefill/Decode disaggregation inference engine for LLMs
☆161May 10, 2026Updated 2 months ago
deGravity / hybridbrep
View on GitHub
Self-supervised representation learning for BReps.
☆19Jan 24, 2024Updated 2 years ago
CalvinXKY / BasicCUDA
View on GitHub
A tutorial for CUDA&PyTorch
☆479Mar 23, 2026Updated 4 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
xforcevesa / new-vrwkv
View on GitHub
☆24Mar 23, 2025Updated last year
DeepLink-org / DLBlas
View on GitHub
DLBlas: clean and efficient kernels
☆44Updated this week
guoqingbao / xinfer
View on GitHub
Blazing-fast LLM inference in pure Rust. No PyTorch and Python runtime.
☆294Updated this week
bigconvience / llfs
View on GitHub
learn llvm from scratch
☆14Apr 29, 2023Updated 3 years ago
luliyucoordinate / mynet
View on GitHub
☆20Feb 5, 2022Updated 4 years ago
OpenBMB / RAG-DDR
View on GitHub
This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".
☆23Oct 28, 2024Updated last year
Wenyueh / MinivLLM
View on GitHub
Based on Nano-vLLM, a simple replication of vLLM with self-contained paged attention and flash attention implementation
☆940Updated this week
ecomfe / webpack-auto-cdn-plugin
View on GitHub
Webpack plugin to automatically extract dependencies and reference them via CDN
☆10Jan 7, 2023Updated 3 years ago
Reconfigurable-Computing / Vitis_workflow
View on GitHub
Vitis 部署加速器工作流介绍
☆13Jan 10, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
CSerht / NSCSCC2021FinalCode
View on GitHub
龙芯杯2021个人赛决赛最终代码
☆11Sep 1, 2021Updated 4 years ago
MIDA-group / inspire
View on GitHub
INSPIRE: Intensity and Spatial Information-Based Deformable Image Registration
☆12Jun 30, 2021Updated 5 years ago
thu-pacman / Spindle
View on GitHub
☆34Jun 20, 2023Updated 3 years ago
InfiniTensor / learning-cxx
View on GitHub
☆37Jan 25, 2025Updated last year
deGravity / breploader
View on GitHub
A common C++ BRep interface for OpenCascade and Parasolid.
☆21Jan 24, 2024Updated 2 years ago
stogiannidis / srbench
View on GitHub
Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"
☆19Feb 1, 2026Updated 5 months ago
difey / nano-vllm-v1
View on GitHub
Nano vLLM v1 engine
☆16Aug 6, 2025Updated 11 months ago