Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning
☆56Feb 22, 2025Updated last year
Alternatives and similar repositories for gh200-llm
Users that are interested in gh200-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17May 19, 2023Updated 2 years ago
- CPU and GPU tutorial examples☆13Apr 4, 2025Updated last year
- ☆72Mar 26, 2025Updated last year
- ☆19Aug 10, 2024Updated last year
- 🚀 LLM inference optimization simulator, modeling compute-bound prefill and memory-bound decode phases.☆14Jul 12, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ✍️ A browser add-on (Firefox, Chrome, Thunderbird) that allows you to autocorrect common text sequences and convert text characters to a …☆12Feb 9, 2026Updated 2 months ago
- A benchmark of real-world DL kernel problems☆160Apr 2, 2026Updated last week
- 这是一个大学四年的cs基础课部分专业课的复习笔记的扫描版备份仓库☆12Jun 29, 2019Updated 6 years ago
- ☆13Jan 7, 2025Updated last year
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Apr 9, 2025Updated last year
- ☆21Mar 29, 2026Updated last week
- ☆13May 23, 2021Updated 4 years ago
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorch☆18Dec 21, 2022Updated 3 years ago
- extensible collectives library in triton☆98Mar 31, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- train with kittens!☆64Oct 25, 2024Updated last year
- Source code and dataset for paper "End-to-End Transition-Based Online Dialogue Disentanglement"☆17May 17, 2021Updated 4 years ago
- The official implementation of the paper SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder☆20Oct 19, 2025Updated 5 months ago
- Code to compute AnthroScore, a computational linguistic measure of anthropomorphism in text☆18Mar 31, 2025Updated last year
- Automatic OCR of clipboard contents.☆14Aug 12, 2022Updated 3 years ago
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆106Jun 28, 2025Updated 9 months ago
- Alice in Wonderland code base for experiments and raw experiments data☆131Feb 4, 2026Updated 2 months ago
- Example scripts for using [my] fine-tuned CLIP models with HuggingFace 🤗☆13Sep 24, 2024Updated last year
- Image caption and manage tool for AI training☆11Jan 24, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Fast low-bit matmul kernels in Triton☆443Updated this week
- Pytorch distributed backend extension with compression support☆17Mar 24, 2025Updated last year
- Writing FLUX in Triton☆42Sep 22, 2024Updated last year
- ☆52Apr 1, 2026Updated last week
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Jun 6, 2024Updated last year
- Easily run PyTorch on multiple GPUs & machines☆60Jan 8, 2026Updated 3 months ago
- ☆80Dec 27, 2024Updated last year
- vLLM Daily Summarization of Merged PRs☆49Updated this week
- [BMVC2020] Image Harmonization with Attention-based Deep Feature Modulation☆12Feb 3, 2021Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official Implementation of K-Paths: Reasoning over Graph Paths for Drug Repurposing and Drug Interaction Prediction.☆20Jul 8, 2025Updated 9 months ago
- ☆66Updated this week
- Quick start boilerplate for a Node API Deployable to Bluemix☆15Oct 24, 2021Updated 4 years ago
- TernGEMM: General Matrix Multiply Library with Ternary Weights for Fast DNN Inference☆14Feb 22, 2022Updated 4 years ago
- All available LTX-2 models, encoders, workflows, LoRAs for ComfyUI☆272Apr 2, 2026Updated last week
- [NeurIPS 2023] Token-Scaled Logit Distillation for Ternary Weight Generative Language Models☆18Dec 6, 2023Updated 2 years ago
- Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding☆29Jan 28, 2024Updated 2 years ago