mistralai / TensorRT-LLMLinks

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

☆12

Alternatives and similar repositories for TensorRT-LLM

Users that are interested in TensorRT-LLM are comparing it to the libraries listed below

Sorting:

langchain-ai / multi-modal-code-agent
☆14Updated last year
oneapi-src / voice-data-generation
AI Starter Kit for Synthetic Voice and Audio Generation using Intel® Extension for Pytorch
☆2Updated last year
kyegomez / MM1
PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"
☆23Updated last week
jquesnelle / ctranslate2-rs
Rust bindings for CTranslate2
☆14Updated last year
rajshah4 / huggingface-demos
☆18Updated 2 years ago
kyegomez / GPT4
The open source implementation of the base model behind GPT-4 from OPENAI [Language + Multi-Modal]
☆10Updated last year
kavishgambhir / xy-cut-tree
Segmenting a given document using recursive xy-cut algorithm.
☆12Updated 6 years ago
DOUDOU0314 / GPT-J-hf
GPT-jax based on the official huggingface library
☆13Updated 3 years ago
kyegomez / VisionLLaMA
Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta
☆16Updated 6 months ago
AI4WA / OpenOmniFramework
Multimodal Open Source Framework for Conversational Agent Research and Development.
☆19Updated 3 months ago
kyegomez / CogNetX
CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…
☆14Updated last week
v-prgmr / mergekit
Tools for merging pretrained large language models.
☆19Updated 11 months ago
haotian-liu / transformers_llava
☆13Updated 2 years ago
TheoCoombes / crawlingathome-server
A server powering LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.
☆13Updated 2 years ago
vast-ai / vast-pyworker
☆11Updated 2 weeks ago
sachink1729 / DSPy-Multi-Hop-Chain-of-Thought-RAG
Discover advanced AI techniques in my repository combining Multi-Hop Chain of Thought (CoT) and Retrieval-Augmented Generation (RAG) usin…
☆14Updated 10 months ago
fabridigua / LogicGamesSolver
A Python tool to solve logic games with AI, Deep Learning and Computer Vision
☆16Updated 4 years ago
joheras / Lecturas
☆19Updated this week
Jaykef / min-patchnizer
Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.…
☆11Updated last year
johanndiep / language-models-trajectory-generators
This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…
☆21Updated 8 months ago
iantbutler01 / ditty
A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.
☆16Updated 7 months ago
mmhamdy / open-language-models
A list of language models with permissive licenses such as MIT or Apache 2.0
☆24Updated 3 months ago
echogarden-project / whisper-onnx-exporter
Exports OpenAI Whisper speech recognition models to ONNX. Mainly intended for use with Echogarden.
☆10Updated 7 months ago
nreHieW / loss
Visualising Losses in Deep Neural Networks
☆16Updated 10 months ago
data2ml / all-clip
Load any clip model with a standardized interface
☆21Updated last year
PINTO0309 / mtomo
Multiple types of NN model optimization environments. It is possible to directly access the host PC GUI and the camera to verify the oper…
☆24Updated 4 years ago
llm-council / llm-council
LLMs sitting on a council together to decide, by consensus, who among them is the best.
☆15Updated last month
capjamesg / sam-clip
Use Grounding DINO, Segment Anything, and CLIP to label objects in images.
☆31Updated last year
Nachimak28 / LAI-voice-search-openai-whisper-demo
A ⚡️ Lightning.ai ⚡️ app demo for Voice based web search using OpenAI's Whisper and DuckDuckGo
☆26Updated 2 years ago
stanfordnlp / huggingface-models
Scripts for pushing models to huggingface repos
☆12Updated 5 months ago