mistralai / TensorRT-LLMLinks
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
☆15Updated last year
Alternatives and similar repositories for TensorRT-LLM
Users that are interested in TensorRT-LLM are comparing it to the libraries listed below
Sorting:
- ☆14Updated 2 years ago
- Code for blog posts from OpenCV.AI☆15Updated 2 years ago
- Deploy DL/ ML inference pipelines with minimal extra code.☆101Updated last year
- RAG-QA is a free, containerised question-answer framework that allows you to ask questions to your documents in an intuitive way☆18Updated last year
- Using GPT-3 and Carrot (GPT-3 for computer vision) to create detailed descriptions of images.☆14Updated 3 years ago
- 👤 Human Face and 🎥 Object Detection using OpenCV☆13Updated 2 years ago
- ☆23Updated 3 months ago
- Faceprecision is a comprehensive face analysis project leveraging advanced deep learning and computer vision techniques. This project inc…☆14Updated last year
- NVIDIA Fleet Command is a hybrid-cloud platform for securely and remotely deploying, managing, and scaling AI across dozens or up to thou…☆13Updated 3 years ago
- An LLM-powered self-studying app using retrieval-augmented generation prompting | Streamlit LLM Hackathon 2023☆18Updated 2 years ago
- You AI companion. ChatGPT and translation for Monocle AR☆22Updated last year
- ☆12Updated last year
- This repository will guide you to create your Images via Stable Diffusion using a Smart Virtual Assistant like Google Assistant using Ope…☆36Updated 2 years ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆12Updated last year
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆52Updated 9 months ago
- Vector search with Pinecone and Openai to search through contract law textbook. If downloaded, remeber to install all dependencies. Refer…☆13Updated 2 years ago
- ☆39Updated 11 months ago
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆33Updated last year
- Simple CogVLM client script☆13Updated last year
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 7 years ago
- ☆12Updated 6 months ago
- 3rd party dependencies for DALI project☆10Updated 3 weeks ago
- A bot that scrapes your jobs in real time, sort them according to preferences and runs an alert☆16Updated last year
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆15Updated this week
- Finetune any model on HF in less than 30 seconds☆55Updated last month
- YOLOExplorer : Iterate on your YOLO / CV datasets using SQL, Vector semantic search, and more within seconds☆135Updated 2 weeks ago
- Animal Video Analysis Tool (AVAT)☆13Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆23Updated last year
- Alternative version of st.camera_input which returns the webcam images live, without any button press needed☆37Updated 3 months ago
- It's AI: LLM Detection Subnet (SN32)☆31Updated 2 months ago