ISCS-ZJU / Decentralized-inference-based-on-vLLMView external linksLinks
☆30Feb 12, 2025Updated last year
Alternatives and similar repositories for Decentralized-inference-based-on-vLLM
Users that are interested in Decentralized-inference-based-on-vLLM are comparing it to the libraries listed below
Sorting:
- ☆41Jan 10, 2025Updated last year
- Source code for GMBE-SC'23☆35Jun 25, 2023Updated 2 years ago
- Source code for PHAST-TPDS'22☆29Dec 27, 2023Updated 2 years ago
- Source code for XPGraph-MICRO'22☆26Jul 30, 2022Updated 3 years ago
- Source code for AdaMBE-SC'24☆25Jun 20, 2024Updated last year
- Source code for CSWAP-CLUSTER'21 and CSWAP+-TPDS'22☆25Mar 2, 2022Updated 3 years ago
- Appling the asynchronous tensor swapping to PyTorch framework.☆31Jun 20, 2023Updated 2 years ago
- ☆31Apr 26, 2025Updated 9 months ago
- Source code for CCLBTree-EuroSys'24☆44Dec 27, 2023Updated 2 years ago
- ☆30Apr 28, 2025Updated 9 months ago
- Source code for ChunkGraph-ATC'24☆28Jul 13, 2024Updated last year
- Source code for AMBEA-TC'24☆29Jun 29, 2024Updated last year
- Source code for NVAlloc-ASPLOS'22☆59Mar 2, 2022Updated 3 years ago
- Zplot demos☆21Nov 22, 2021Updated 4 years ago
- Source code for iCache-HPCA'23☆50Apr 22, 2023Updated 2 years ago
- ☆10May 12, 2023Updated 2 years ago
- ☆15Jun 14, 2022Updated 3 years ago
- Implementation of the paper: Selective_Backpropagation from paper Accelerating Deep Learning by Focusing on the Biggest Losers☆15Feb 2, 2020Updated 6 years ago
- ☆13Jan 23, 2021Updated 5 years ago
- ☆15Mar 30, 2022Updated 3 years ago
- Low level algorithms for persistent memory.☆16Feb 9, 2021Updated 5 years ago
- Use a local LLM to convert PDF to Markdown☆25Mar 10, 2025Updated 11 months ago
- Linux source code for ISCA 2020 paper "Enhancing and Exploiting Contiguity for Fast Memory Virtualization"☆20Oct 31, 2020Updated 5 years ago
- Adaptive Confidence Multi-View Hashing☆23Dec 13, 2023Updated 2 years ago
- MLSys 2021 paper: MicroRec: efficient recommendation inference by hardware and data structure solutions☆19May 26, 2021Updated 4 years ago
- Profiling and Improving the PyTorch Dataloader for high-latency Storage☆20Apr 18, 2023Updated 2 years ago
- Python Scritpt which can be embedded into PyTorch model to print the model size.☆19Apr 19, 2021Updated 4 years ago
- inline_asm_lockfree_queue☆20Jan 11, 2020Updated 6 years ago
- Benchmarking tools for pmemkv☆23Mar 22, 2023Updated 2 years ago
- FGNN's artifact evaluation (EuroSys 2022)☆18Apr 25, 2022Updated 3 years ago
- This is my guided research work where the goal was to estimeate full body dense human pose from sparse IMU sensor data☆23Nov 22, 2019Updated 6 years ago
- FlashMob is a shared-memory random walk system.☆32Jul 7, 2023Updated 2 years ago
- A practical approach using inertial sensors (MPU-6050) applied to 3D motion tracking.☆26Aug 10, 2020Updated 5 years ago
- Generate draw.io UML Sequence Diagram from text file.☆37Oct 24, 2025Updated 3 months ago
- ddl-benchmarks: Benchmarks for Distributed Deep Learning☆36May 29, 2020Updated 5 years ago
- ☆36Jun 10, 2024Updated last year
- ☆38Jan 15, 2021Updated 5 years ago
- the Stanford Transactional Applications for Multi-Processing; a benchmark suite for transactional memory research☆44Oct 1, 2021Updated 4 years ago
- ☆48Jun 10, 2023Updated 2 years ago