mistralai / TensorRT-LLMLinks
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
☆13Updated last year
Alternatives and similar repositories for TensorRT-LLM
Users that are interested in TensorRT-LLM are comparing it to the libraries listed below
Sorting:
- AI Starter Kit for Synthetic Voice and Audio Generation using Intel® Extension for Pytorch☆2Updated last year
- Example showing how to do inference on a video file with Roboflow Infer☆48Updated last year
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- Visual similarity search engine demo with use of PyTorch Metric Learning and Qdrant☆12Updated 2 years ago
- Streamlit app to Translate text to or between 50 languages with mBART-50 from Huggingface and Facebook☆25Updated 4 years ago
- Finetune any model on HF in less than 30 seconds☆57Updated 3 weeks ago
- ☆14Updated last year
- Code for blog posts from OpenCV.AI☆15Updated 2 years ago
- Simple CogVLM client script☆14Updated last year
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆36Updated last year
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆13Updated last year
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated last year
- A Python tool to solve logic games with AI, Deep Learning and Computer Vision☆17Updated 4 years ago
- MehmetOKUYAR / Vehicles-Counting--Tracking-and-Speed-Estimation-with-YOLOv7-DeepSORT-Object-Tracking-and-Zone-Count☆33Updated 11 months ago
- Web-based tool to convert model into MyriadX blob☆17Updated 2 months ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆70Updated this week
- RAG-QA is a free, containerised question-answer framework that allows you to ask questions to your documents in an intuitive way☆15Updated last year
- An autonomous LLM-based agent that generates code to extract structured information from web pages and extracts it.☆10Updated 9 months ago
- Python text-to-speech library with built-in voice effects and support for multiple TTS engines☆23Updated 4 months ago
- Video chat apps with computer vision filters built on top of Streamlit☆50Updated 2 years ago
- Machine Learning Image Annotation Tool☆38Updated 7 years ago
- ☆15Updated last year
- Rust bindings for CTranslate2☆14Updated 2 years ago
- ☆19Updated 2 years ago
- Multiprocessing in python☆10Updated 3 years ago
- H&M Fashion Image similarity search with Weaviate and DocArray☆43Updated last year
- ☆19Updated 2 weeks ago
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds☆133Updated last week
- Road segmentation with pytorch☆18Updated 3 years ago
- A ⚡️ Lightning.ai ⚡️ app demo for Voice based web search using OpenAI's Whisper and DuckDuckGo☆27Updated 2 years ago