Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.
☆42Sep 26, 2024Updated last year
Alternatives and similar repositories for cortex.tensorrt-llm
Users that are interested in cortex.tensorrt-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆44Jul 4, 2025Updated 10 months ago
- Ikigai is an AI-powered Open Assignment System☆35Oct 9, 2024Updated last year
- Jan.ai Website & Documentation☆37Oct 14, 2024Updated last year
- Efficient Finetuning for OpenAI GPT-OSS☆24Oct 2, 2025Updated 7 months ago
- 🌐 OpenCrawl: An ethical, high-performance web crawler built for scale A powerful web crawling library that respects robots.txt and rate…☆22Apr 3, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A small utility library for parsing GGUF file info☆29Jan 27, 2025Updated last year
- Attempt at cog wrapper for nightmareai/real-esrgan for larger images☆16Sep 28, 2023Updated 2 years ago
- Local AI API Platform☆2,760Jul 4, 2025Updated 10 months ago
- Inference Llama/Llama2/Llama3 Modes in NumPy☆20Nov 22, 2023Updated 2 years ago
- Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mul…☆17Jan 20, 2025Updated last year
- ☆12Nov 8, 2023Updated 2 years ago
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- NVIDIA TensorRT-RTX is an SDK for high-performance AI inference on NVIDIA RTX GPUs. This repository contains Open-Source Software compone…☆98Mar 18, 2026Updated last month
- A template for running Stable Diffusion 3 with Cog☆14Aug 20, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Cog wrapper for microsoft/OmniParser-v2☆12Feb 25, 2025Updated last year
- AgentOS is a lightweight, single-file implementation that provides a robust foundation for building autonomous AI agents. It implements t…☆23Jul 11, 2025Updated 9 months ago
- Moved to here: https://github.com/lyogavin/airllm☆29Aug 1, 2024Updated last year
- Attempt at cog wrapper for segmind/SSD-1B☆10Dec 11, 2023Updated 2 years ago
- Run AuraFlow on Replicate☆14Jul 12, 2024Updated last year
- Examples using MLX Swift☆13Apr 9, 2025Updated last year
- Attempt at cog wrapper for a SDXL CLIP Interrogator☆10May 16, 2024Updated last year
- Cog wrapper for FalconsAi / nsfw_image_detection☆18Aug 6, 2025Updated 9 months ago
- Generative AI powered Note taking mobile app converts your voice recording or other audio file into short notes and customized questions …☆18Dec 8, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Discord bot that answers questions about Replicate.☆16Jan 5, 2024Updated 2 years ago
- Pay attention to what you're paying attention to.☆29May 17, 2022Updated 3 years ago
- AI News Anchor Generator App built using Midjourney, D-ID, OpenAI, NewsAPI, and Streamlit.☆17Sep 18, 2023Updated 2 years ago
- Cog wrapper for playgroundai/playground-v2.5-1024px-aesthetic☆17Nov 25, 2024Updated last year
- ☆21Feb 20, 2023Updated 3 years ago
- Cog wrapper for canopylabs/orpheus-3b-0.1-ft☆22Mar 20, 2025Updated last year
- A cog implementation of Nvidia's Triton server☆18Oct 23, 2024Updated last year
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.☆15Aug 25, 2024Updated last year
- ☆21Apr 29, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Taming Stable Diffusion for Lip Sync!☆16Mar 18, 2025Updated last year
- The official Python library for Formulaic☆18Apr 25, 2024Updated 2 years ago
- Memory-optimized training scripts for video models based on Diffusers☆16Jan 3, 2025Updated last year
- Cog wrapper for PASD Magnify☆17Jan 8, 2024Updated 2 years ago
- Explore how Flux Dev responds when you change the strengths of layers in the model.☆21Sep 20, 2024Updated last year
- ☆59Nov 21, 2024Updated last year