gofreelee/SpaceServe

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gofreelee/SpaceServe)

gofreelee / SpaceServe

☆32

Alternatives and similar repositories for SpaceServe

Users that are interested in SpaceServe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

atomicapple0 / libsmctrl
View on GitHub
Artifact from "Hardware Compute Partitioning on NVIDIA GPUs". THIS IS A FORK OF BAKITAS REPO. I AM NOT ONE OF THE AUTHORS OF THE PAPER.
☆67Nov 24, 2025Updated 8 months ago
chenyu-jiang / dcp
View on GitHub
Code repository for the SOSP'25 paper DCP: Addressing Input Dynamism In Long-Context Training via Dynamic Context Parallelism.
☆21Nov 28, 2025Updated 7 months ago
AkideLiu / MiniCache
View on GitHub
☆14Sep 7, 2024Updated last year
vbdi / epdserve
View on GitHub
[ICML 2025] Efficiently Serving Large Multimodal Models Using EPD Disaggregation
☆24Jul 11, 2026Updated 2 weeks ago
pittisl / mPnP-LLM
View on GitHub
Code for paper "Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI"
☆13Jan 19, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
oliverYoung2001 / UltraAttn
View on GitHub
SC'25 UltraAttn: Efficiently Parallelizing Attention through Hierarchical Context-Tiling
☆16Aug 14, 2025Updated 11 months ago
SJTU-IPADS / reef-artifacts
View on GitHub
A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.
☆43May 29, 2022Updated 4 years ago
jiashu-z / how-to-plot
View on GitHub
How to plot for papers, slides, demos, etc.
☆10Apr 7, 2022Updated 4 years ago
hpdps-group / ElasticMM
View on GitHub
ElasticMM: Elastic and Efficient MLLM Serving System
☆44May 10, 2026Updated 2 months ago
infinigence / HamiltonAttention
View on GitHub
☆45Oct 15, 2025Updated 9 months ago
shengshu-ai / TurboServe
View on GitHub
TurboServe: Serving Streaming Video Generation Efficiently and Economically
☆37Jul 12, 2026Updated 2 weeks ago
lsds / Tempo
View on GitHub
Tempo is a system for declarative, efficient, end-to-end compiled dynamic deep learning
☆30Oct 21, 2025Updated 9 months ago
TKONIY / tutorial-any-repo
View on GitHub
Claude Code skill: Generate file-by-file code tutorial websites for any repository with parallel agent teams
☆28Mar 13, 2026Updated 4 months ago
xinjin / course-net-seminar
View on GitHub
Selected Topics in Computer Networks @ Johns Hopkins University
☆19Dec 17, 2020Updated 5 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
lzhangbv / acpsgd
View on GitHub
[ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning
☆10Apr 28, 2023Updated 3 years ago
sjtu-epcc / DVABatch
View on GitHub
☆21May 13, 2022Updated 4 years ago
AlayaDB-AI / ParaGraph
View on GitHub
A cross-modal vector index with fast construction on heterogeneous CPU-GPU environment. Published on DaMoN@SIGMOD 2025.
☆16Jul 16, 2025Updated last year
Oneflow-Inc / serving
View on GitHub
OneFlow Serving
☆20Apr 10, 2025Updated last year
llumnix-project / llumnix
View on GitHub
☆36May 26, 2026Updated 2 months ago
cornserve-ai / cornserve
View on GitHub
Easy, Fast, and Scalable Multimodal AI
☆129Jun 2, 2026Updated last month
Cytosine2020 / crust
View on GitHub
A Rust style C++ library.
☆19Sep 3, 2022Updated 3 years ago
Odysseusq / VLCache
View on GitHub
Official Repo for paper "VLCache: Computing 2% Vision Tokens and Reusing 98% for Vision-Language Inference"
☆16Mar 28, 2026Updated 3 months ago
arnavdantuluri / StableTriton
View on GitHub
The first open source triton inference engine for Stable Diffusion, specifically for sdxl
☆12Nov 27, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
29DCH / AI_ML_DataAnalysis_DataVisualization_Classic-Examples
View on GitHub
关于AI,ML,DA,DV等的几个经典案例，包括堵车模拟(NagelSchreckenberg)、蒙特卡洛排队问题(Monte Carlo Queuing Problem)、人脸识别(RecognitionFace)、遗传算法推断图像(IconGenetic)
☆10Oct 14, 2018Updated 7 years ago
LinkAnonymous / BESA
View on GitHub
☆12Oct 9, 2023Updated 2 years ago
cyhdmjzzy / DeepEP-Code-Analysis
View on GitHub
☆26Feb 27, 2026Updated 4 months ago
maufadel / EnergyMeter
View on GitHub
A Python tool to measure the energy consumption of software
☆16Feb 5, 2026Updated 5 months ago
DerrickYLJ / LessIsMore
View on GitHub
[ICML 2026] Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning
☆34Sep 12, 2025Updated 10 months ago
SiriusInfTra / Sirius
View on GitHub
☆18Sep 21, 2025Updated 10 months ago
DeepLink-org / DLSlime
View on GitHub
Composable and Embeddable Communication Runtime for Distributed AI Services
☆102Jun 5, 2026Updated last month
DBGroup-SUSTech / GHive
View on GitHub
GHive: Accelerating Analytical Query Processing in Apache Hive via CPU-GPU Heterogeneous Computing.
☆14Nov 8, 2023Updated 2 years ago
wangrunji0408 / rjrouter
View on GitHub
[AFK] Hardware router in Chisel (THU Network Joint Lab 2020)
☆14Oct 8, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
GATECH-EIC / ShiftAddNAS
View on GitHub
[ICML 2022] ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks
☆15May 18, 2022Updated 4 years ago
sail-sg / VocabularyParallelism
View on GitHub
Vocabulary Parallelism
☆26Mar 10, 2025Updated last year
mlc-ai / tirx-kernels
View on GitHub
ML kernels and benchmarking infrastructure written in TIRx
☆70Updated this week
Dragonsson / Pseudo_efficientNet
View on GitHub
Pytorch--使用伪标签训练efficientNet模型
☆11Dec 28, 2019Updated 6 years ago
shawnricecake / search-llm
View on GitHub
[NeurIPS 2024] Search for Efficient LLMs
☆16Jan 16, 2025Updated last year
xxyux / SpInfer
View on GitHub
SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs
☆68Mar 25, 2025Updated last year
ATR-DBI / Map-EQA
View on GitHub
☆12Oct 10, 2024Updated last year