mistralai / TensorRT-LLMLinks
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
☆15Updated last year
Alternatives and similar repositories for TensorRT-LLM
Users that are interested in TensorRT-LLM are comparing it to the libraries listed below
Sorting:
- ☆14Updated 2 years ago
- Deploy DL/ ML inference pipelines with minimal extra code.☆102Updated last year
- 3rd party dependencies for DALI project☆10Updated last week
- Visual similarity search engine demo with use of PyTorch Metric Learning and Qdrant☆12Updated 3 years ago
- Simple CogVLM client script☆14Updated 2 years ago
- Code for blog posts from OpenCV.AI☆16Updated 2 years ago
- Finetune any model on HF in less than 30 seconds☆56Updated this week
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆37Updated 2 years ago
- The Next Generation Multi-Modality Superintelligence☆70Updated last year
- Using GPT-3 and Carrot (GPT-3 for computer vision) to create detailed descriptions of images.☆14Updated 3 years ago
- ☆14Updated 2 years ago
- ☆13Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated 2 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 7 years ago
- Passively collect images for computer vision datasets on the edge.☆35Updated 2 years ago
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated last year
- ☆13Updated 2 years ago
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds☆138Updated last week
- Benchmarks for Business Document Foundation Models☆10Updated last year
- Demo combining Whisper for speech recognition and Google TTS for speech synthesis to interact with Alpaca-LoRA.☆20Updated last year
- ☆12Updated 7 months ago
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated 2 years ago
- Modify-Anything is based on yolov5,yolov8 for video and image detection. Segment-anything,lama_cleaner is applied to segment, modify, era…☆17Updated 2 years ago
- Google's Gemini implemented with GPT-4 Vision, Whisper and Resemble AI☆26Updated 2 years ago
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆34Updated 2 years ago
- This repository utilizes the Triton Inference Server Client, which streamlines the complexity of model deployment.☆21Updated last year
- A ⚡️ Lightning.ai ⚡️ app demo for Voice based web search using OpenAI's Whisper and DuckDuckGo☆27Updated 3 years ago
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Updated 2 years ago
- A Python tool to solve logic games with AI, Deep Learning and Computer Vision☆17Updated 4 years ago
- Streamlit app to Translate text to or between 50 languages with mBART-50 from Huggingface and Facebook☆25Updated 4 years ago