Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU accelerated inference on NVIDIA's GPUs.
☆42Sep 26, 2024Updated last year
Alternatives and similar repositories for cortex.tensorrt-llm
Users that are interested in cortex.tensorrt-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆43Jul 4, 2025Updated 8 months ago
- Ikigai is an AI-powered Open Assignment System☆35Oct 9, 2024Updated last year
- 🌐 OpenCrawl: An ethical, high-performance web crawler built for scale A powerful web crawling library that respects robots.txt and rate…☆19Apr 3, 2025Updated 11 months ago
- A small utility library for parsing GGUF file info☆29Jan 27, 2025Updated last year
- Local AI API Platform☆2,762Jul 4, 2025Updated 8 months ago
- ☆12Nov 8, 2023Updated 2 years ago
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- Moved to here: https://github.com/lyogavin/airllm☆27Aug 1, 2024Updated last year
- Source code for SuperAGI's Zapier Integration☆13Aug 20, 2023Updated 2 years ago
- AgentOS is a lightweight, single-file implementation that provides a robust foundation for building autonomous AI agents. It implements t…☆23Jul 11, 2025Updated 8 months ago
- Qualitative data analysis for text, images, audio, video. Cross platform. Python 3.8 or newer and PyQt6.☆11Updated this week
- Pay attention to what you're paying attention to.☆28May 17, 2022Updated 3 years ago
- Real-world Conversational AI personas.☆21Oct 23, 2023Updated 2 years ago
- ☆19Dec 4, 2025Updated 3 months ago
- AI News Anchor Generator App built using Midjourney, D-ID, OpenAI, NewsAPI, and Streamlit.☆17Sep 18, 2023Updated 2 years ago
- Browser extensions for the Knowledge application☆33Jul 16, 2022Updated 3 years ago
- fast-embeddings-api☆16Nov 23, 2023Updated 2 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Jun 1, 2024Updated last year
- The official Python library for Formulaic☆18Apr 25, 2024Updated last year
- In-browser semantic search demo using EmbeddingGemma and Transformers.js. No server required.☆31Sep 7, 2025Updated 6 months ago
- llama INT4 cuda inference with AWQ☆54Jan 20, 2025Updated last year
- I will be adding different kind of opensource data extraction tools code using python☆10Nov 15, 2024Updated last year
- Browse Lance tables from your local machine in a simple web UI. No database to set up. Mount a folder and go.☆23Mar 16, 2026Updated last week
- ☆15Jun 9, 2023Updated 2 years ago
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- ☆10Aug 18, 2025Updated 7 months ago
- Scalable Kubernetes-native implementation of the Open Data Fabric protocol for global collaborative data processing☆22Updated this week
- A data extraction example showing how to get a pdf's content.☆12Feb 24, 2021Updated 5 years ago
- Tkinter Rapid Application Development (RAD) library - Tkinter XML widget building☆10Oct 1, 2020Updated 5 years ago
- awesome templates for textgenertor obsidian plugin☆10Nov 28, 2023Updated 2 years ago
- This is a complete website in which you can chat with pdf, extract meta data, text, links, image, and lot more . Check my blog for more d…☆31Jun 1, 2024Updated last year
- ☆12Jan 20, 2024Updated 2 years ago
- OAuth authentication plugin for personal coding assistance with ChatGPT Plus/Pro subscriptions - uses OpenAI's official authentication me…☆27Updated this week
- An Android app for real-time facial emotion recognition, designed to improve accuracy for Middle Eastern faces and women wearing hijabs. …☆21Sep 11, 2023Updated 2 years ago
- ultimate openpose editor with render☆36Jun 1, 2025Updated 9 months ago
- This is AutoGenDemo☆11Mar 12, 2024Updated 2 years ago
- Export Apple News saved articles to SQLite☆14Mar 16, 2023Updated 3 years ago
- GPTQ inference Triton kernel☆321May 18, 2023Updated 2 years ago