mistralai / TensorRT-LLMLinks
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
☆14Updated last year
Alternatives and similar repositories for TensorRT-LLM
Users that are interested in TensorRT-LLM are comparing it to the libraries listed below
Sorting:
- ☆14Updated last year
- Visual similarity search engine demo with use of PyTorch Metric Learning and Qdrant☆12Updated 2 years ago
- Code for blog posts from OpenCV.AI☆15Updated 2 years ago
- RAG-QA is a free, containerised question-answer framework that allows you to ask questions to your documents in an intuitive way☆17Updated last year
- Deploy DL/ ML inference pipelines with minimal extra code.☆99Updated 10 months ago
- Simple CogVLM client script☆14Updated last year
- ☆12Updated last year
- ☆14Updated 2 years ago
- YoloV3Tiny for Android☆14Updated 6 years ago
- It's AI: LLM Detection Subnet (SN32)☆29Updated 2 weeks ago
- Example Code to Supplement the Label Studio Blog☆28Updated 2 weeks ago
- Imageinary is a reproducible mechanism which is used to generate large image datasets at various resolutions. The tool supports multiple …☆27Updated 2 years ago
- Example showing how to do inference on a video file with Roboflow Infer☆48Updated last year
- Faceprecision is a comprehensive face analysis project leveraging advanced deep learning and computer vision techniques. This project inc…☆14Updated last year
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆16Updated this week
- Benchmarking library for image manipulation detection.☆15Updated 2 years ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Updated last year
- Code for "Learning an adaptation function to assess image visual similarities", ICIP'21☆10Updated 2 years ago
- 👤 Human Face and 🎥 Object Detection using OpenCV☆13Updated 2 years ago
- Passively collect images for computer vision datasets on the edge.☆35Updated last year
- A ⚡️ Lightning.ai ⚡️ app demo for Voice based web search using OpenAI's Whisper and DuckDuckGo☆27Updated 2 years ago
- Submodule for Grounded-SAM☆12Updated 2 years ago
- A collection of models for TensorFlow Go☆12Updated 3 years ago
- Computer Vision Helping Library☆47Updated 10 months ago
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Updated 3 years ago
- Finetune any model on HF in less than 30 seconds☆57Updated 2 weeks ago
- This repository utilizes the Triton Inference Server Client, which streamlines the complexity of model deployment.☆21Updated last year
- This repository is a comprehensive project that leverages the XLM-Roberta model for intent detection. This repository is a valuable resou…☆14Updated last year
- Web-based tool to convert model into MyriadX blob☆17Updated 3 months ago
- Interactive Textbook Demo☆47Updated last year