mistralai / TensorRT-LLMLinks
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
☆12Updated last year
Alternatives and similar repositories for TensorRT-LLM
Users that are interested in TensorRT-LLM are comparing it to the libraries listed below
Sorting:
- ☆14Updated last year
- AI Starter Kit for Synthetic Voice and Audio Generation using Intel® Extension for Pytorch☆2Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- Rust bindings for CTranslate2☆14Updated last year
- ☆18Updated 2 years ago
- The open source implementation of the base model behind GPT-4 from OPENAI [Language + Multi-Modal]☆10Updated last year
- Segmenting a given document using recursive xy-cut algorithm.☆12Updated 6 years ago
- GPT-jax based on the official huggingface library☆13Updated 3 years ago
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated 6 months ago
- Multimodal Open Source Framework for Conversational Agent Research and Development.☆19Updated 3 months ago
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆14Updated last week
- Tools for merging pretrained large language models.☆19Updated 11 months ago
- ☆13Updated 2 years ago
- A server powering LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆13Updated 2 years ago
- ☆11Updated 2 weeks ago
- Discover advanced AI techniques in my repository combining Multi-Hop Chain of Thought (CoT) and Retrieval-Augmented Generation (RAG) usin…☆14Updated 10 months ago
- A Python tool to solve logic games with AI, Deep Learning and Computer Vision☆16Updated 4 years ago
- ☆19Updated this week
- Minimal, clean code for video/image "patchnization" - a process commonly used in tokenizing visual data for use in a Transformer encoder.…☆11Updated last year
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…☆21Updated 8 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 7 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 3 months ago
- Exports OpenAI Whisper speech recognition models to ONNX. Mainly intended for use with Echogarden.☆10Updated 7 months ago
- Visualising Losses in Deep Neural Networks☆16Updated 10 months ago
- Load any clip model with a standardized interface☆21Updated last year
- Multiple types of NN model optimization environments. It is possible to directly access the host PC GUI and the camera to verify the oper…☆24Updated 4 years ago
- LLMs sitting on a council together to decide, by consensus, who among them is the best.☆15Updated last month
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆31Updated last year
- A ⚡️ Lightning.ai ⚡️ app demo for Voice based web search using OpenAI's Whisper and DuckDuckGo☆26Updated 2 years ago
- Scripts for pushing models to huggingface repos☆12Updated 5 months ago