Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning
☆56Jun 22, 2026Updated last week
Alternatives and similar repositories for gh200-llm
Users that are interested in gh200-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Utilities for efficient fine-tuning, inference and evaluation of code generation models☆21Oct 3, 2023Updated 2 years ago
- ☆17May 19, 2023Updated 3 years ago
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated 2 years ago
- ☆117May 10, 2026Updated last month
- ☆75Mar 26, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆19Aug 10, 2024Updated last year
- ☆13Jan 7, 2025Updated last year
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Apr 9, 2025Updated last year
- Legacy Code of ZJU Campus App for iOS☆11Jan 31, 2024Updated 2 years ago
- A Triton-only attention backend for vLLM☆27Mar 17, 2026Updated 3 months ago
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorch☆18Dec 21, 2022Updated 3 years ago
- A benchmark of real-world DL kernel problems☆238May 28, 2026Updated last month
- [arXiv 2025] Pre-training script for Clinical ModernBERT☆35Apr 29, 2025Updated last year
- extensible collectives library in triton☆98Mar 31, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year
- scalable data movement in Exascale Supercomputers☆19Mar 30, 2026Updated 3 months ago
- train with kittens!☆66Oct 25, 2024Updated last year
- ☆11Mar 27, 2024Updated 2 years ago
- Collect papers related to personalized text generation☆18Sep 6, 2021Updated 4 years ago
- Source code and dataset for paper "End-to-End Transition-Based Online Dialogue Disentanglement"☆17May 17, 2021Updated 5 years ago
- Follow-Up Differential Descriptions: Language Models Resolve Ambiguities for Image Classification☆11Nov 15, 2023Updated 2 years ago
- Can LLMs generate code-mixed sentences through zero-shot prompting?☆11Apr 18, 2023Updated 3 years ago
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆112Jun 28, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Boosting 4-bit inference kernels with 2:4 Sparsity☆95Sep 4, 2024Updated last year
- Alice in Wonderland code base for experiments and raw experiments data☆129Feb 4, 2026Updated 4 months ago
- Example scripts for using [my] fine-tuned CLIP models with HuggingFace 🤗☆13Sep 24, 2024Updated last year
- Code to reproduce the experiments in the paper: Does CLIP Bind Concepts? Probing Compositionality in Large Image Models.☆16Oct 14, 2023Updated 2 years ago
- A collection of various custom nodes for ComfyUI (Work in progress)☆14Jun 9, 2025Updated last year
- Image caption and manage tool for AI training☆11Jan 24, 2025Updated last year
- ☆22Apr 17, 2025Updated last year
- Fast low-bit matmul kernels in Triton☆473May 15, 2026Updated last month
- Pytorch distributed backend extension with compression support☆17Mar 24, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Writing FLUX in Triton☆42Sep 22, 2024Updated last year
- ☆11Oct 2, 2024Updated last year
- ☆82Dec 27, 2024Updated last year
- vLLM Daily Summarization of Merged PRs☆51Jun 24, 2026Updated last week
- ☆106Sep 9, 2024Updated last year
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆22May 28, 2024Updated 2 years ago
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆25Jun 6, 2024Updated 2 years ago