basetenlabs / Workshop-TRT-LLMLinks

☆19

Alternatives and similar repositories for Workshop-TRT-LLM

Users that are interested in Workshop-TRT-LLM are comparing it to the libraries listed below

Sorting:

anyscale / e2e-llm-workflows
Fine-tune an LLM to perform batch inference and online serving.
☆112Updated 2 months ago
charlesfrye / minimodal
A miniature version of Modal
☆20Updated last year
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 7 months ago
tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆110Updated 10 months ago
huggingface / competitions
☆124Updated 9 months ago
aniketmaurya / Agents
Build Agentic workflows with function calling using open LLMs
☆28Updated this week
huggingface / optimum-tpu
Google TPU optimizations for transformers models
☆117Updated 6 months ago
hamelsmu / llama-inference
experiments with inference on llama
☆104Updated last year
YuchenJin / llm.c
LLM training in simple, raw C/CUDA
☆15Updated 8 months ago
MekkCyber / TritonAcademy
A repository to unravel the language of GPUs, making their kernel conversations easy to understand
☆188Updated 2 months ago
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆33Updated 2 months ago
Pleias / Various-Finetuning
Set of scripts to finetune LLMs
☆37Updated last year
stas00 / ml-ways
ML/DL Math and Method notes
☆62Updated last year
davanstrien / data-for-fine-tuning-llms
☆79Updated last year
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
premAI-io / benchmarks
🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
☆137Updated last year
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated last year
huggingface / gpt-oss-recipes
Collection of scripts and notebooks for OpenAI's latest GPT OSS models
☆222Updated this week
facebookresearch / matrix
Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…
☆81Updated last week
Alignment-Lab-AI / datagen
a pipeline for using api calls to agnostically convert unstructured data into structured training data
☆30Updated 10 months ago
pacman100 / peft-codegen-25
☆23Updated 2 years ago
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆268Updated last year
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated last year
aniketmaurya / discord-llm-bot
Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU
☆32Updated last year
apple / ml-hypercloning
☆48Updated 9 months ago
huggingface / kernel-builder
👷 Build compute kernels
☆87Updated last week
Arize-ai / LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆102Updated last year
Upaya07 / NeurIPS-llm-efficiency-challenge
Code for NeurIPS LLM Efficiency Challenge
☆59Updated last year
lancedb / ragged
☆20Updated 9 months ago
cray-lm / cray-lm
Cray-LM unified training and inference stack.
☆22Updated 6 months ago