☆25Jun 26, 2024Updated last year
Alternatives and similar repositories for Workshop-TRT-LLM
Users that are interested in Workshop-TRT-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Oct 22, 2023Updated 2 years ago
- Code for training & inference with FLAN family of models☆17May 23, 2023Updated 3 years ago
- ☆18Jan 8, 2023Updated 3 years ago
- Writing FLUX in Triton☆42Sep 22, 2024Updated last year
- Exploring how optimizations for GEMMs work☆36Feb 28, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- generate synthetic data for LLM fine-tuning in arbitrary situations within systematic way☆22Mar 18, 2024Updated 2 years ago
- This repository contains expert evaluation interface and data evaluation script for the OpenScholar project.☆42Nov 19, 2024Updated last year
- ☆58Aug 24, 2024Updated last year
- ☆13Sep 28, 2021Updated 4 years ago
- ☆22May 5, 2025Updated last year
- Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits☆45Jan 8, 2026Updated 5 months ago
- AutoGluon Docker☆12Apr 17, 2020Updated 6 years ago
- a collection of skills for vllm-omni☆76Updated this week
- Solo Podcast Creation from Web Page content☆19Sep 23, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Jun 17, 2019Updated 7 years ago
- Development containers for triton and triton-cpu☆28Jun 3, 2026Updated 2 weeks ago
- Code showing how to use a model based on the ML model base class.☆10Sep 30, 2022Updated 3 years ago
- Learn Huff through annotated examples.☆31May 31, 2022Updated 4 years ago
- A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal…☆12Sep 16, 2024Updated last year
- Notes Intel Edge AI☆20Feb 15, 2020Updated 6 years ago
- Contextual knowledge bases☆25Jun 30, 2022Updated 3 years ago
- 训练营训练方向项目☆27Jan 28, 2026Updated 4 months ago
- ☆15Nov 18, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Iterated prisoner's dilemma tournaments implemented with Cairo☆25Jul 10, 2022Updated 3 years ago
- API for coordinating Maintenance in Kubernetes.☆26Jul 18, 2025Updated 11 months ago
- Unofficial implementation for Sigmoid Loss for Language Image Pre-Training☆11Sep 26, 2023Updated 2 years ago
- ☆13Jun 18, 2024Updated 2 years ago
- Pipeline Parallelism Emulation and Visualization☆83Jan 8, 2026Updated 5 months ago
- llama.cpp to PyTorch Converter☆38Apr 8, 2024Updated 2 years ago
- RAGStack is an out of the box solution simplifying Retrieval Augmented Generation (RAG) in AI apps.☆189Mar 12, 2026Updated 3 months ago
- Kubeflow on OpenShift☆14Jan 24, 2019Updated 7 years ago
- Standalone commandline CLI tool for compiling Triton kernels☆20Sep 13, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- train entropix like a champ!☆20Oct 10, 2024Updated last year
- Mining Twitter for Disaster Response☆13Feb 20, 2017Updated 9 years ago
- A Python DSL to write Nvidia PTX for Hopper and Blackwell in JAX and PyTorch☆311May 8, 2026Updated last month
- vLLM Daily Summarization of Merged PRs☆51Updated this week
- A zero-config OpenAI client with support for 20+ providers, API key rotation, rate limits, optional LangChain integration and more.☆19Dec 11, 2025Updated 6 months ago
- Chef cookbooks for managing a Ceph cluster☆12Apr 2, 2023Updated 3 years ago
- EleutherAI ML Performance reading group repository (slides, meeting recordings, annotated papers)☆33Mar 20, 2026Updated 2 months ago