Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning
☆55Feb 22, 2025Updated last year
Alternatives and similar repositories for gh200-llm
Users that are interested in gh200-llm are comparing it to the libraries listed below
Sorting:
- A weak supervision framework for (partial) labeling functions☆16Jul 15, 2024Updated last year
- Utilities for efficient fine-tuning, inference and evaluation of code generation models☆21Oct 3, 2023Updated 2 years ago
- ☆18May 19, 2023Updated 2 years ago
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated last year
- CPU and GPU tutorial examples☆13Apr 4, 2025Updated 11 months ago
- ☆72Mar 26, 2025Updated 11 months ago
- ✍️ A browser add-on (Firefox, Chrome, Thunderbird) that allows you to autocorrect common text sequences and convert text characters to a …☆12Feb 9, 2026Updated last month
- 🚀 LLM inference optimization simulator, modeling compute-bound prefill and memory-bound decode phases.☆14Jul 12, 2025Updated 8 months ago
- 这是一个大学四年的cs基础课部分专业课的复习笔记的扫描版备份仓库☆12Jun 29, 2019Updated 6 years ago
- ☆13Jan 7, 2025Updated last year
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Apr 9, 2025Updated 11 months ago
- Official implementation for Training Certifiably Robust Neural Networks with Efficient Local Lipschitz Bounds (NeurIPS, 2021).☆25Sep 4, 2022Updated 3 years ago
- A Triton-only attention backend for vLLM☆24Feb 11, 2026Updated last month
- ☆13May 23, 2021Updated 4 years ago
- Argumentative microtexts annotated with RST, SDRT and argumentation structure☆12Jun 19, 2016Updated 9 years ago
- extensible collectives library in triton☆97Mar 31, 2025Updated 11 months ago
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorch☆18Dec 21, 2022Updated 3 years ago
- train with kittens!☆64Oct 25, 2024Updated last year
- scalable data movement in Exascale Supercomputers☆17Updated this week
- ☆14Aug 25, 2024Updated last year
- ☆11Mar 27, 2024Updated last year
- This repo is the official implementation of the ICLR'23 paper "Towards Robustness Certification Against Universal Perturbations." We calc…☆12Feb 14, 2023Updated 3 years ago
- Collect papers related to personalized text generation☆18Sep 6, 2021Updated 4 years ago
- β-CROWN: Efficient Bound Propagation with Per-neuron Split Constraints for Neural Network Verification☆31Nov 9, 2021Updated 4 years ago
- Code for "Adversarial Over-Sensitivity and Over-Stability Strategies for Dialogue Models (CoNLL 2018)"☆15Feb 6, 2019Updated 7 years ago
- Code to compute AnthroScore, a computational linguistic measure of anthropomorphism in text☆18Mar 31, 2025Updated 11 months ago
- The official implementation of the paper SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder☆19Oct 19, 2025Updated 5 months ago
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago
- Automatic OCR of clipboard contents.☆14Aug 12, 2022Updated 3 years ago
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆106Jun 28, 2025Updated 8 months ago
- Boosting 4-bit inference kernels with 2:4 Sparsity☆94Sep 4, 2024Updated last year
- DEPRECATED - THIS WAS A PROTOTYPE. Check out CUDA-QX. -- A collection of application-level libraries for the CUDA-Q.☆15Sep 20, 2024Updated last year
- Code to reproduce the experiments in the paper: Does CLIP Bind Concepts? Probing Compositionality in Large Image Models.☆16Oct 14, 2023Updated 2 years ago
- Fast low-bit matmul kernels in Triton☆438Feb 1, 2026Updated last month
- Pytorch distributed backend extension with compression support☆17Mar 24, 2025Updated 11 months ago
- QCLAB++☆18Mar 1, 2023Updated 3 years ago
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Jun 6, 2024Updated last year
- ☆80Dec 27, 2024Updated last year
- vLLM Daily Summarization of Merged PRs☆48Updated this week