quic / cloud-ai-sdk
Qualcomm Cloud AI SDK (Platform and Apps) enable high performance deep learning inference on Qualcomm Cloud AI platforms delivering high throughput and low latency across Computer Vision, Object Detection, Natural Language Processing and Generative AI models.
☆56Updated 4 months ago
Alternatives and similar repositories for cloud-ai-sdk:
Users that are interested in cloud-ai-sdk are comparing it to the libraries listed below
- This library empowers users to seamlessly port pretrained models and checkpoints on the HuggingFace (HF) hub (developed using HF transfor…☆60Updated this week
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…☆57Updated this week
- Notes on quantization in neural networks☆77Updated last year
- ☆202Updated 3 years ago
- ☆141Updated 2 years ago
- CUDA Matrix Multiplication Optimization☆173Updated 8 months ago
- ☆151Updated last year
- Cataloging released Triton kernels.☆204Updated 2 months ago
- Applied AI experiments and examples for PyTorch☆249Updated this week
- Fast low-bit matmul kernels in Triton☆267Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆314Updated this week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆177Updated this week
- QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference☆116Updated last year
- NVIDIA tools guide☆117Updated 2 months ago
- This repository contains the experimental PyTorch native float8 training UX☆222Updated 7 months ago
- Model compression for ONNX