Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.
☆42Sep 26, 2024Updated last year
Alternatives and similar repositories for cortex.tensorrt-llm
Users that are interested in cortex.tensorrt-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆43Jul 4, 2025Updated 9 months ago
- Jan.ai Website & Documentation☆35Oct 14, 2024Updated last year
- Efficient Finetuning for OpenAI GPT-OSS☆23Oct 2, 2025Updated 6 months ago
- 🌐 OpenCrawl: An ethical, high-performance web crawler built for scale A powerful web crawling library that respects robots.txt and rate…☆20Apr 3, 2025Updated last year
- A small utility library for parsing GGUF file info☆29Jan 27, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Attempt at cog wrapper for nightmareai/real-esrgan for larger images☆16Sep 28, 2023Updated 2 years ago
- Local AI API Platform☆2,762Jul 4, 2025Updated 9 months ago
- Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mul…☆17Jan 20, 2025Updated last year
- ☆12Nov 8, 2023Updated 2 years ago
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- Source code for SuperAGI's Zapier Integration☆13Aug 20, 2023Updated 2 years ago
- A template for running Stable Diffusion 3 with Cog☆14Aug 20, 2024Updated last year
- ☆140Apr 23, 2024Updated last year
- Qualitative data analysis for text, images, audio, video. Cross platform. Python 3.8 or newer and PyQt6.☆11Apr 4, 2026Updated last week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Moved to here: https://github.com/lyogavin/airllm☆29Aug 1, 2024Updated last year
- Attempt at cog wrapper for segmind/SSD-1B☆10Dec 11, 2023Updated 2 years ago
- Run AuraFlow on Replicate☆14Jul 12, 2024Updated last year
- Attempt at cog wrapper for a SDXL CLIP Interrogator☆10May 16, 2024Updated last year
- Generative AI powered Note taking mobile app converts your voice recording or other audio file into short notes and customized questions …☆18Dec 8, 2023Updated 2 years ago
- Real-world Conversational AI personas.☆21Oct 23, 2023Updated 2 years ago
- A Discord bot that answers questions about Replicate.☆16Jan 5, 2024Updated 2 years ago
- Pay attention to what you're paying attention to.☆29May 17, 2022Updated 3 years ago
- Attempt at cog wrapper for SDXL Controlnet - Canny☆13Nov 25, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Safely push a Cog model version by making sure it works and is backwards-compatible with previous versions.☆16Dec 4, 2025Updated 4 months ago
- SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution☆14Jan 12, 2024Updated 2 years ago
- WebAISum is a Python script that allows you to summarize web pages using AI models. It supports both local models like Ollama and remote …☆15Apr 28, 2024Updated last year
- AI News Anchor Generator App built using Midjourney, D-ID, OpenAI, NewsAPI, and Streamlit.☆17Sep 18, 2023Updated 2 years ago
- Browser extensions for the Knowledge application☆33Jul 16, 2022Updated 3 years ago
- Cog wrapper for playgroundai/playground-v2.5-1024px-aesthetic☆17Nov 25, 2024Updated last year
- ☆28Dec 29, 2025Updated 3 months ago
- A cog implementation of Nvidia's Triton server☆18Oct 23, 2024Updated last year
- code☆13Jan 24, 2021Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.☆15Aug 25, 2024Updated last year
- ☆21Apr 29, 2024Updated last year
- ☆21Jan 15, 2026Updated 2 months ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- A sensible approach to writing common code for react and react-native☆16Jan 8, 2019Updated 7 years ago
- SYN flood implementation using Boost.Asio☆12Nov 20, 2014Updated 11 years ago
- Collection of useful and working Excel RTD server samples.☆12Sep 4, 2021Updated 4 years ago