basetenlabs / Workshop-TRT-LLMLinks
☆19Updated last year
Alternatives and similar repositories for Workshop-TRT-LLM
Users that are interested in Workshop-TRT-LLM are comparing it to the libraries listed below
Sorting:
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 3 months ago
- ☆87Updated last year
- An introduction to LLM Sampling☆79Updated 9 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- Build Agentic workflows with function calling using open LLMs☆28Updated 3 weeks ago
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆50Updated last week
- A miniature version of Modal☆20Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 7 months ago
- A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models☆74Updated 6 months ago
- ☆80Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated 11 months ago
- ☆23Updated 2 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Updated 2 years ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆31Updated last year
- 👷 Build compute kernels☆143Updated this week
- ☆145Updated last year
- ☆31Updated 10 months ago
- ☆124Updated 10 months ago
- LLM training in simple, raw C/CUDA☆15Updated 9 months ago
- Google TPU optimizations for transformers models☆120Updated 8 months ago
- ☆46Updated last year
- ☆135Updated last month
- ☆170Updated last year
- Fun project: LLM powered RAG Discord Bot that works seamlessly on CPU☆32Updated last year
- I learn about and explain quantization☆26Updated last year
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆138Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197Updated last year
- Cray-LM unified training and inference stack.☆22Updated 7 months ago
- Collection of autoregressive model implementation☆86Updated 4 months ago