mistralai / TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
☆11Updated last year
Alternatives and similar repositories for TensorRT-LLM:
Users that are interested in TensorRT-LLM are comparing it to the libraries listed below
- AI Starter Kit for Synthetic Voice and Audio Generation using Intel® Extension for Pytorch☆2Updated last year
- ☆14Updated last year
- Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.…☆11Updated 11 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- EdgeSAM model for use with Autodistill.☆26Updated 10 months ago
- Chat Complex PDF with Tables Using IBM WatsonX, Langchain and LlamaParser.☆13Updated last year
- Rust bindings for CTranslate2☆14Updated last year
- Create topological graph for image segments.☆22Updated 6 months ago
- Object detection inference with Roboflow Train models on NVIDIA Jetson devices.☆13Updated last year
- ☆13Updated last year
- A server powering LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆13Updated 2 years ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated this week
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆35Updated last year
- LLMs sitting on a council together to decide, by consensus, who among them is the best.☆14Updated 2 weeks ago
- Code for blog posts from OpenCV.AI☆15Updated last year
- ☆11Updated 2 months ago
- Solving Computer Vision with AI agents☆29Updated this week
- ☆10Updated 10 months ago
- Notebooks to demonstrate TimmWrapper☆16Updated 3 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 6 months ago
- Integrate an LLM copilot within your Keras model development workflow☆28Updated last year
- Multi-threading, Concurrency, Asynchrony, and various Execution Methods implemented in a Rust backend for bleeding edge performance.☆12Updated 5 months ago
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆14Updated this week
- Benchmarking vision language vision on face tasks☆12Updated 3 weeks ago
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆15Updated 3 years ago
- Tools for merging pretrained large language models.☆19Updated 10 months ago
- BH hackathon☆14Updated last year
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…☆21Updated 6 months ago
- DocGenius AI - Generative AI Chatbot for your Documents☆11Updated last month
- A repository for creating, and sample code for consuming an ONNX embedding model☆31Updated last year