mistralai / TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
☆10Updated 11 months ago
Alternatives and similar repositories for TensorRT-LLM:
Users that are interested in TensorRT-LLM are comparing it to the libraries listed below
- ☆14Updated 7 months ago
- Multiple types of NN model optimization environments. It is possible to directly access the host PC GUI and the camera to verify the oper…☆24Updated 3 years ago
- Tools for merging pretrained large language models.☆19Updated 7 months ago
- YouTube Assistant☆12Updated last year
- ☆37Updated last month
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 2 months ago
- Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.…☆11Updated 8 months ago
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆40Updated 4 months ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated last year
- Simple CogVLM client script☆14Updated last year
- EdgeSAM model for use with Autodistill.☆26Updated 7 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- ☆29Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 5 months ago
- ☆14Updated last year
- ☆14Updated last year
- AI Starter Kit for Synthetic Voice and Audio Generation using Intel® Extension for Pytorch☆2Updated 11 months ago
- Computer Vision Helping Library☆17Updated 2 months ago
- Real-time, YOLO-like object detection using the Florence-2-base-ft model with a user-friendly GUI.☆16Updated 2 weeks ago
- A ⚡️ Lightning.ai ⚡️ app demo for Voice based web search using OpenAI's Whisper and DuckDuckGo☆26Updated 2 years ago
- OcSort-Pip: Packaged version of the OcSort repository☆14Updated 2 years ago
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…☆20Updated 3 months ago
- ☆10Updated 7 months ago
- Build Agentic workflows with function calling☆26Updated this week
- SAM-CLIP module for use with Autodistill.☆13Updated last year
- This repository stores the source code for the Mistral Hackathon 2024 in Paris☆16Updated 4 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆14Updated 3 months ago
- Code for blog posts from OpenCV.AI☆15Updated last year
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆23Updated last year
- Chat Complex PDF with Tables Using IBM WatsonX, Langchain and LlamaParser.☆11Updated 8 months ago