Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, distributed multi-GPU inference, multimodal capabilities, and a Gradio chat interface.
☆160Dec 8, 2025Updated 3 months ago
Alternatives and similar repositories for grps_trtllm
Users that are interested in grps_trtllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AI solution for Patent Classification☆143Jun 29, 2020Updated 5 years ago
- kight is a static analysis tool for c/c++ programs.☆214Dec 27, 2024Updated last year
- Build a simple yet effective CNN to work as a sketch recognizer. Just like Google Quick-Draw Project.☆143Mar 23, 2023Updated 3 years ago
- High performance rank executor for advertisement and recommendation system, implemented in C/C++ and support ensembled into Java/Scala ho…☆74Feb 28, 2024Updated 2 years ago
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆155Oct 18, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…☆316Jul 31, 2025Updated 7 months ago
- ☆142Nov 13, 2024Updated last year
- ☆230Jun 9, 2025Updated 9 months ago
- ☆247Nov 24, 2024Updated last year
- Deep Reinforcement Learning Algorithms for solving Atari 2600 Games☆143Mar 23, 2023Updated 3 years ago
- Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].☆274Dec 3, 2024Updated last year
- Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…☆156May 25, 2024Updated last year
- 网络硬盘是通过存储、分类、检索、分享、协作、下发、回收、展示等方式管理文档、文件、图片、音频、视频等资料的工具。网络硬盘擅长在国产的私有化环境中管控文档权限、存储空间分配、安全加密、链接分享,同时支持一定轻量级的文件任务收发。网络硬盘需要依赖开源的数字底座进行人员岗位管控。☆353Mar 19, 2026Updated last week
- An Workspace for HMI tools☆164Jul 11, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆121Sep 30, 2024Updated last year
- [ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models☆190Jul 15, 2024Updated last year
- ☆252Feb 11, 2025Updated last year
- ☆287Jul 6, 2024Updated last year
- C++ codes for FDTD Maxwell's equation.☆161Jun 11, 2023Updated 2 years ago
- YiTu is an easy-to-use runtime to fully exploit the hybrid parallelism of different hardwares (e.g., GPU) to efficiently support the exec…☆254Jan 7, 2026Updated 2 months ago
- 权益计算框架☆169Dec 31, 2023Updated 2 years ago
- ☆143May 25, 2024Updated last year
- A system demo based on Retrival Argument Generation to answer buddism question☆83Sep 27, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- An awesome list of self-sovereign identity resources.☆138Jul 9, 2024Updated last year
- Book Recommendation System☆235May 2, 2024Updated last year
- A ReAct-Based Highly Robust Autonomous Agent Framework.☆209Mar 19, 2026Updated last week
- AI-powered document summarization engine that transforms lengthy texts into crystallized insights☆146Nov 5, 2024Updated last year
- A code repository designed to show the best GitHub has to offer.☆165Jun 30, 2024Updated last year
- Visualization, simulation, manipulation of Intrinsically disorder proteins with Gibbs sampling☆288Oct 24, 2024Updated last year
- Analysis and visualization of multi-omics data. In ongoing development: multi-modal fusion, sparse learning, and spatio-temporal effects.…☆206Jan 15, 2026Updated 2 months ago
- 🔗 Serverless blockchain analytics pipeline on AWS - Extract, process and visualize Ethereum data using Kinesis, Lambda, Redshift Serverl…☆103Oct 5, 2023Updated 2 years ago
- A python package that integrate algorithms and various machine learning approaches to extract features (genes) effective for classificati…☆252Jan 15, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Large-Scale Selfie Video Dataset (L-SVD): A Benchmark for Emotion Recognition☆306Aug 18, 2024Updated last year
- R package for autoregressive, reduced-rank, and factor models in time series.☆122Mar 3, 2025Updated last year
- easy-ngo是由网易传媒开发的基于Go语言的开发工具包,基于easy-ngo工具包,开发者可以快速 构建高可用、大并发的应用。☆302Dec 29, 2023Updated 2 years ago
- 🤙 Control Your Mouse with Hand Gestures in the Air 🤙☆250Jun 19, 2023Updated 2 years ago
- Next Generation Java Starter Project☆107Apr 11, 2025Updated 11 months ago
- Accelerate your Stable Diffusion inference with the library's universal C/C++ framework design, powered by ONNXRuntime & across platforms…☆445Aug 16, 2024Updated last year
- GlucoInsight:Framework for Glucose Management Application☆84Aug 6, 2024Updated last year