basetenlabs / Workshop-TRT-LLM
☆17Updated 9 months ago
Alternatives and similar repositories for Workshop-TRT-LLM:
Users that are interested in Workshop-TRT-LLM are comparing it to the libraries listed below
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆107Updated this week
- ☆19Updated 6 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- ☆77Updated 10 months ago
- ☆19Updated 8 months ago
- Cray-LM unified training and inference stack.☆22Updated 2 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆30Updated 7 months ago
- ☆24Updated last year
- ☆28Updated 5 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆106Updated 6 months ago
- A miniature version of Modal☆20Updated 10 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 6 months ago
- ☆48Updated last year
- Verbosity control for AI agents☆61Updated 10 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated last year
- An introduction to LLM Sampling☆77Updated 3 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 3 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models☆72Updated last month
- Build Agentic workflows with function calling using open LLMs☆26Updated last week
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆38Updated last month
- ☆78Updated 10 months ago
- ☆48Updated 5 months ago
- Train, tune, and infer Bamba model☆88Updated 2 months ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆48Updated last week
- ☆85Updated last year
- Collection of autoregressive model implementation☆85Updated 2 months ago