☆54Sep 18, 2025Updated 6 months ago
Alternatives and similar repositories for CS854-F24
Users that are interested in CS854-F24 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Hybrid Framework to Build High-performance Adaptive Neural Networks for Kernel Datapath☆28May 15, 2023Updated 2 years ago
- ☆13May 30, 2024Updated last year
- ☆89Dec 11, 2019Updated 6 years ago
- ☆18Apr 21, 2024Updated last year
- Deduplication over dis-aggregated memory for Serverless Computing☆14Mar 21, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- APEX+ is an LLM Serving Simulator☆44Jun 16, 2025Updated 9 months ago
- NEO is a LLM inference engine built to save the GPU memory crisis by CPU offloading☆91Jun 16, 2025Updated 9 months ago
- A single-file educational implementation for understanding vLLM's core concepts and running LLM inference.☆44Updated this week
- ☆16Apr 22, 2025Updated 11 months ago
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆81Oct 15, 2025Updated 5 months ago
- Tempo is a system for declarative, efficient, end-to-end compiled dynamic deep learning☆29Oct 21, 2025Updated 5 months ago
- ☆21Apr 2, 2023Updated 3 years ago
- ☆23Apr 28, 2024Updated last year
- WHISPER is a comprehensive benchmark suite for emerging persistent memory technologies.☆10May 10, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of pap…☆282Mar 6, 2025Updated last year
- DeeperGEMM: crazy optimized version☆86May 5, 2025Updated 11 months ago
- This is the open-source site for XFDetector (ASPLOS'20)☆11Mar 5, 2021Updated 5 years ago
- A record of reading list on some MLsys popular topic☆23Mar 20, 2025Updated last year
- KFunca: A minimalist, high-performance GPU-based automatic differentiation framework☆29Aug 14, 2025Updated 7 months ago
- Arbitrary offloads for RDMA NICs☆99Apr 25, 2022Updated 3 years ago
- An infrastructure for inline acceleration of network applications☆30Oct 25, 2021Updated 4 years ago
- ☆149Apr 2, 2026Updated last week
- SJTU CS473 Project: Implementation of Deep Closest Point in TensorFlow, and its comparison with other registration methods.☆12Jun 14, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Artifacts of EuroSys'24 paper "Exploring Performance and Cost Optimization with ASIC-Based CXL Memory"☆31Feb 21, 2024Updated 2 years ago
- This is the implementation repository of our SOSP'24 paper: Aceso: Achieving Efficient Fault Tolerance in Memory-Disaggregated Key-Value …☆24Oct 20, 2024Updated last year
- ☆31Apr 4, 2026Updated last week
- ☆23Oct 31, 2023Updated 2 years ago
- Lenovo modifications to Linux memcached for enhanced persistent memory support☆18Nov 4, 2021Updated 4 years ago
- ☆64Jun 29, 2022Updated 3 years ago
- A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of …☆320Jun 10, 2025Updated 10 months ago
- ☆71Feb 13, 2022Updated 4 years ago
- GraphRag vs Embeddings☆16Jul 14, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Artifacts for ATC '22 paper "Faster Software Packet Processing on FPGA NICs with eBPF Program Warping"☆17May 20, 2022Updated 3 years ago
- ☆845Mar 18, 2026Updated 3 weeks ago
- An Automated Performance Optimization Framework for P4-Programmable SmartNICs☆28Nov 18, 2023Updated 2 years ago
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 6 months ago
- Simple example wire app☆14Oct 14, 2021Updated 4 years ago
- Justitia provides RDMA isolation between applications with diverse requirements.☆43May 25, 2022Updated 3 years ago
- ⚡ Bring some magic to i.sjtu.edu.cn☆22Jan 3, 2020Updated 6 years ago