mistralai / TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
☆11Updated last year
Alternatives and similar repositories for TensorRT-LLM
Users that are interested in TensorRT-LLM are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated 3 weeks ago
- AI Starter Kit for Synthetic Voice and Audio Generation using Intel® Extension for Pytorch☆2Updated last year
- The purpose of this repository is to discuss on Audio transformers☆11Updated last week
- Rust bindings for CTranslate2☆14Updated last year
- ☆14Updated last year
- YouTube Assistant☆12Updated 2 years ago
- Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.…☆11Updated last year
- Chat Complex PDF with Tables Using IBM WatsonX, Langchain and LlamaParser.☆13Updated last week
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆14Updated 3 weeks ago
- Simple CogVLM client script☆14Updated last year
- ☆19Updated this week
- Tools for merging pretrained large language models.☆19Updated 11 months ago
- ☆18Updated 2 years ago
- Object detection inference with Roboflow Train models on NVIDIA Jetson devices.☆13Updated last year
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…☆21Updated 7 months ago
- Scripts for pushing models to huggingface repos☆11Updated 4 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 7 months ago
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- A ⚡️ Lightning.ai ⚡️ app demo for Voice based web search using OpenAI's Whisper and DuckDuckGo☆26Updated 2 years ago
- An open source project built with Streamlit on Python, that focuses on curating awesome resources for learning awesome skills.☆15Updated 11 months ago
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"☆18Updated last year
- ☆20Updated last year
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Updated 2 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- Multimodal Open Source Framework for Conversational Agent Research and Development.☆19Updated 3 months ago
- ☆14Updated 11 months ago
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated last month
- ☆10Updated 11 months ago
- Unity WebGL template for Hugging Face Spaces☆14Updated 3 years ago
- Road segmentation with pytorch☆18Updated 3 years ago