VIA-Research / vTrain
☆66Updated last month
Alternatives and similar repositories for vTrain:
Users that are interested in vTrain are comparing it to the libraries listed below
- ☆45Updated last year
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆112Updated 2 months ago
- Open-source of LazyDP published in ASPLOS-2024☆21Updated last year
- ☆20Updated 4 months ago
- [ACM EuroSys '23] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access☆56Updated last year
- ☆24Updated last year
- ☆134Updated 3 months ago
- LLM serving cluster simulator☆99Updated last year
- LLM Inference analyzer for different hardware platforms☆65Updated last week
- NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing☆80Updated 10 months ago
- ☆23Updated 5 months ago
- ☆23Updated 2 years ago
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆11Updated last year
- [HPCA'24] Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System☆44Updated last year
- [ATC '24] Metis: Fast automatic distributed training on heterogeneous GPUs (https://www.usenix.org/conference/atc24/presentation/um)☆25Updated 5 months ago
- ☆11Updated 4 months ago
- ☆25Updated 2 years ago
- ☆36Updated last year
- A Cycle-level simulator for M2NDP☆27Updated this week
- ☆34Updated last week
- Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025☆52Updated this week
- ☆138Updated 10 months ago
- ☆17Updated 6 months ago
- InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)☆128Updated 9 months ago
- ☆37Updated 2 years ago
- ☆48Updated 4 months ago
- UPMEM LLM Framework allows profiling PyTorch layers and functions and simulate those layers/functions with a given hardware profile.☆27Updated last week
- PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization☆29Updated last year
- A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores☆51Updated last year
- ☆59Updated 10 months ago