OSU-Nowlab / FloverLinks
A novel temporal fusion framework for propelling autoregressive model inference
☆11Updated this week
Alternatives and similar repositories for Flover
Users that are interested in Flover are comparing it to the libraries listed below
Sorting:
- ☆25Updated 3 months ago
- ☆66Updated 3 weeks ago
- A GPU-driven system framework for scalable AI applications☆114Updated 4 months ago
- ☆29Updated 4 months ago
- ☆71Updated 2 months ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆25Updated 7 months ago
- Sample examples of how to call collective operation functions on multi-GPU environments. A simple example of using broadcast, reduce, all…☆33Updated last year
- ☆37Updated 5 months ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆61Updated 3 months ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆86Updated last week
- Intel® SHMEM - Device initiated shared memory based communication library☆23Updated 2 months ago
- FlexFlow Serve: Low-Latency, High-Performance LLM Serving☆41Updated 3 weeks ago
- A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.☆81Updated 2 weeks ago