HabanaAI / gaudi-pytorch-bridgeLinks
☆15Updated last week
Alternatives and similar repositories for gaudi-pytorch-bridge
Users that are interested in gaudi-pytorch-bridge are comparing it to the libraries listed below
Sorting:
- A CUTLASS implementation using SYCL☆27Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆76Updated this week
- ☆90Updated 5 months ago
- ☆117Updated last month
- ☆77Updated last month
- ☆79Updated 2 years ago
- Github mirror of trition-lang/triton repo.☆39Updated this week
- Artifact from "Hardware Compute Partitioning on NVIDIA GPUs". THIS IS A FORK OF BAKITAS REPO☆29Updated last year
- Optimize GEMM with tensorcore step by step☆26Updated last year
- LLM Inference analyzer for different hardware platforms☆73Updated 3 weeks ago
- A GPU-optimized system for efficient long-context LLMs decoding with low-bit KV cache.☆47Updated 2 weeks ago
- Paella: Low-latency Model Serving with Virtualized GPU Scheduling☆59Updated last year
- ☆62Updated 6 months ago
- ☆96Updated 9 months ago
- ☆98Updated last year
- Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs☆12Updated 2 months ago
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆89Updated 2 years ago
- ☆31Updated 2 years ago
- A lightweight design for computation-communication overlap.☆143Updated this week
- Artifacts of EVT ASPLOS'24☆26Updated last year