mistralai / TensorRT-LLMLinks
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
☆14Updated last year
Alternatives and similar repositories for TensorRT-LLM
Users that are interested in TensorRT-LLM are comparing it to the libraries listed below
Sorting:
- ☆14Updated last year
- Simple CogVLM client script☆13Updated last year
- Streamlit app to Translate text to or between 50 languages with mBART-50 from Huggingface and Facebook☆25Updated 4 years ago
- Code for blog posts from OpenCV.AI☆15Updated 2 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated this week
- ☆12Updated 4 months ago
- A ⚡️ Lightning.ai ⚡️ app demo for Voice based web search using OpenAI's Whisper and DuckDuckGo☆27Updated 2 years ago
- Using GPT-3 and Carrot (GPT-3 for computer vision) to create detailed descriptions of images.☆14Updated 3 years ago
- Faceprecision is a comprehensive face analysis project leveraging advanced deep learning and computer vision techniques. This project inc…☆14Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆12Updated last year
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated 11 months ago
- Scripts, data and researches related to cow weight and breed prediction☆13Updated last month
- Multiprocessing in python☆10Updated 4 years ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆37Updated 2 years ago
- Chat Complex PDF with Tables Using IBM WatsonX, Langchain and LlamaParser.☆14Updated last month
- NVIDIA Fleet Command is a hybrid-cloud platform for securely and remotely deploying, managing, and scaling AI across dozens or up to thou…☆13Updated 3 years ago
- YouTube Assistant☆12Updated 2 years ago
- The NVIDIA RTX™ AI Toolkit is a suite of tools and SDKs for Windows developers to customize, optimize, and deploy AI models across RTX PC…☆176Updated 11 months ago
- 👤 Human Face and 🎥 Object Detection using OpenCV☆13Updated 2 years ago
- Interactive Textbook Demo☆48Updated last year
- 3rd party dependencies for DALI project☆10Updated this week
- RAG-QA is a free, containerised question-answer framework that allows you to ask questions to your documents in an intuitive way☆17Updated last year
- EdgeSAM model for use with Autodistill.☆29Updated last year
- 🚀 Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)☆25Updated 2 years ago
- Animal Video Analysis Tool (AVAT)☆13Updated last year
- Imageinary is a reproducible mechanism which is used to generate large image datasets at various resolutions. The tool supports multiple …☆26Updated 2 years ago
- Example Code to Supplement the Label Studio Blog☆28Updated 2 weeks ago
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds☆134Updated last week
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆17Updated this week
- Rust bindings for CTranslate2☆14Updated 2 years ago