☆48Sep 7, 2024Updated last year
Alternatives and similar repositories for LLMServingPerfEvaluator
Users that are interested in LLMServingPerfEvaluator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FMO (Friendli Model Optimizer)☆13Jan 8, 2025Updated last year
- Welcome to PeriFlow CLI ☁︎☆12Aug 3, 2023Updated 2 years ago
- [⛔️ DEPRECATED] Friendli: the fastest serving engine for generative AI☆50Jun 25, 2025Updated 9 months ago
- FriendliAI Model Hub☆90Jun 9, 2022Updated 3 years ago
- MIST: High-performance IoT Stream Processing☆18Mar 19, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆21Jan 7, 2018Updated 8 years ago
- Cruise: A Distributed Machine Learning Framework with Automatic System Configuration☆26Mar 19, 2019Updated 7 years ago
- ☆15Oct 4, 2022Updated 3 years ago
- Nemo: A flexible data processing system☆21Mar 12, 2018Updated 8 years ago
- Apache Nemo (Incubating) - Data Processing System for Flexible Employment With Different Deployment Characteristics☆113Jul 1, 2025Updated 8 months ago
- ☆15Jun 8, 2021Updated 4 years ago
- Mirror of Apache REEF☆99Jul 6, 2022Updated 3 years ago
- ☆18Dec 4, 2017Updated 8 years ago
- ☆28Oct 10, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.☆131Feb 21, 2022Updated 4 years ago
- Lightweight and Parallel Deep Learning Framework☆264Nov 26, 2022Updated 3 years ago
- ☆22Sep 7, 2019Updated 6 years ago
- ☆24Nov 24, 2018Updated 7 years ago
- minimal C implementation of speculative decoding based on llama2.c☆29Jul 15, 2024Updated last year
- 컴퓨터 신기술 특강☆13Dec 22, 2018Updated 7 years ago
- FOSSLight Scanner☆18Updated this week
- Zye's YouTube stuff☆61Feb 6, 2026Updated last month
- ☆14Mar 28, 2014Updated 12 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆20Jul 24, 2024Updated last year
- AI Agent who manages your Jira project☆21Jun 23, 2024Updated last year
- Alkali is a MLIR-based compiler infrastructure for SmartNICs. It allows developers to write target-independent programs, with the compile…☆28Sep 28, 2025Updated 6 months ago
- CPUID powered by Python.☆20Mar 9, 2021Updated 5 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 4 months ago
- Intro to Computer Architecture, Assembly language, MIPS, Hardware with Professor Glenn Reinman☆10Jun 20, 2014Updated 11 years ago
- 👀 Project What is my IP?☆18Nov 29, 2024Updated last year
- [ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs☆123Jul 4, 2025Updated 8 months ago
- Official implementation of ICML'24 paper "LQER: Low-Rank Quantization Error Reconstruction for LLMs"☆19Jul 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- POC integration Airbyte+Dagster+Langchain☆13Jun 1, 2023Updated 2 years ago
- This projects aims to show how whisper model can be fine-tuned on language it was not trained but is trained on similar language to it.☆11May 10, 2024Updated last year
- ☆12Jun 7, 2023Updated 2 years ago
- NVIDIA Riva SDK Demonstration for Feb 2022,2023 Developer Meetup☆10Jan 11, 2023Updated 3 years ago
- Official repository for "Embodied Agents Meet Personalization: Investigating Challenges and Solutions Through the Lens of Memory Utilizat…☆20Oct 24, 2025Updated 5 months ago
- ☆13Jan 23, 2021Updated 5 years ago
- Nexusflow function call, tool use, and agent benchmarks.☆30Dec 13, 2024Updated last year