mistralai / TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
☆11Updated last year
Alternatives and similar repositories for TensorRT-LLM:
Users that are interested in TensorRT-LLM are comparing it to the libraries listed below
- AI Starter Kit for Synthetic Voice and Audio Generation using Intel® Extension for Pytorch☆2Updated last year
- ☆14Updated last year
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆14Updated 2 weeks ago
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆16Updated 2 weeks ago
- Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.…☆11Updated 10 months ago
- LLMs sitting on a council together to decide, by consensus, who among them is the best.☆12Updated last month
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- ☆19Updated this week
- A ⚡️ Lightning.ai ⚡️ app demo for Voice based web search using OpenAI's Whisper and DuckDuckGo☆26Updated 2 years ago
- A Python tool to solve logic games with AI, Deep Learning and Computer Vision☆16Updated 4 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated 2 weeks ago
- Simple script to re-rank images using OpenAI's CLIP https://github.com/openai/CLIP.☆15Updated 3 years ago
- Unstract's interface to LLMs, Embeddings and VectorDBs.☆18Updated 8 months ago
- Tracking Of Agent (actions and belief) and Spatio-TEmporal Reasoning☆14Updated 5 years ago
- Simple CogVLM client script☆14Updated last year
- ☆14Updated last year
- An library for editing and rendering motion of 3D characters with deep learning.☆10Updated last year
- Using GPT-3 and Carrot (GPT-3 for computer vision) to create detailed descriptions of images.☆13Updated 3 years ago
- YouTube Assistant☆12Updated last year
- ☆18Updated 2 years ago
- Load any clip model with a standardized interface☆21Updated 11 months ago
- ☆10Updated last month
- ☆15Updated 3 years ago
- Replication materials for "Identifying the Development and Application of Artificial Intelligence in Scientific Text"☆12Updated 5 years ago
- Experiments and tutorials with and for torchaudio☆13Updated 3 years ago
- audio, NLP, ML with huggingface, nvidia/nemo, speechbrain☆10Updated last year
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- ☆11Updated 2 years ago
- ☆11Updated 4 years ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆11Updated 8 months ago