Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.
☆42Sep 26, 2024Updated last year
Alternatives and similar repositories for cortex.tensorrt-llm
Users that are interested in cortex.tensorrt-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆44Jul 4, 2025Updated 10 months ago
- Jan.ai Website & Documentation☆37Oct 14, 2024Updated last year
- ☆20Mar 25, 2025Updated last year
- Efficient Finetuning for OpenAI GPT-OSS☆24Oct 2, 2025Updated 7 months ago
- 🌐 OpenCrawl: An ethical, high-performance web crawler built for scale A powerful web crawling library that respects robots.txt and rate…☆23Apr 3, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Local AI API Platform☆2,756Jul 4, 2025Updated 10 months ago
- Inference Llama/Llama2/Llama3 Modes in NumPy☆20Nov 22, 2023Updated 2 years ago
- Ichigo Whisper is a compact (22M parameters), open-source speech tokenizer for the Whisper-medium, designed to enhance performance on mul…☆17Jan 20, 2025Updated last year
- ☆12Nov 8, 2023Updated 2 years ago
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- Source code for SuperAGI's Zapier Integration☆13Aug 20, 2023Updated 2 years ago
- Cog wrapper for microsoft/OmniParser-v2☆12Feb 25, 2025Updated last year
- Moved to here: https://github.com/lyogavin/airllm☆31Aug 1, 2024Updated last year
- Qualitative data analysis for text, images, audio, video. Cross platform. Python 3.8 or newer and PyQt6.☆13Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Attempt at cog wrapper for segmind/SSD-1B☆10Dec 11, 2023Updated 2 years ago
- Run AuraFlow on Replicate☆14Jul 12, 2024Updated last year
- Examples using MLX Swift☆13Apr 9, 2025Updated last year
- Attempt at cog wrapper for a SDXL CLIP Interrogator☆10May 16, 2024Updated 2 years ago
- Artistic Data Visualization☆11May 29, 2018Updated 7 years ago
- Cog wrapper for FalconsAi / nsfw_image_detection☆18Aug 6, 2025Updated 9 months ago
- Pay attention to what you're paying attention to.☆29May 17, 2022Updated 4 years ago
- Attempt at cog wrapper for SDXL Controlnet - Canny☆13Nov 25, 2024Updated last year
- Safely push a Cog model version by making sure it works and is backwards-compatible with previous versions.☆16Dec 4, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution☆14Jan 12, 2024Updated 2 years ago
- Real-world Conversational AI personas.☆22Oct 23, 2023Updated 2 years ago
- ☆19Dec 4, 2025Updated 5 months ago
- The largest open source arabic words list☆16Oct 18, 2021Updated 4 years ago
- WebAISum is a Python script that allows you to summarize web pages using AI models. It supports both local models like Ollama and remote …☆15Apr 28, 2024Updated 2 years ago
- Browser extensions for the Knowledge application☆33Jul 16, 2022Updated 3 years ago
- C# DDE Client for MetaTrader 4 (via Ndde)☆10Jan 1, 2018Updated 8 years ago
- fast-embeddings-api☆16Nov 23, 2023Updated 2 years ago
- ☆29Apr 28, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of F5-TTS in MLX☆14Dec 13, 2024Updated last year
- Excel RTD server sourcing data from Redis☆11Dec 11, 2024Updated last year
- ☆21Jan 15, 2026Updated 4 months ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- The official Python library for Formulaic☆18Apr 25, 2024Updated 2 years ago
- Explore how Flux Dev responds when you change the strengths of layers in the model.☆21Sep 20, 2024Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Mar 5, 2024Updated 2 years ago