☆48Sep 7, 2024Updated last year
Alternatives and similar repositories for LLMServingPerfEvaluator
Users that are interested in LLMServingPerfEvaluator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FMO (Friendli Model Optimizer)☆14Jan 8, 2025Updated last year
- Welcome to PeriFlow CLI ☁︎☆12Aug 3, 2023Updated 2 years ago
- [⛔️ DEPRECATED] Friendli: the fastest serving engine for generative AI☆50Jun 25, 2025Updated 11 months ago
- FriendliAI Model Hub☆90Jun 9, 2022Updated 3 years ago
- MIST: High-performance IoT Stream Processing☆18Mar 19, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆21Jan 7, 2018Updated 8 years ago
- Cruise: A Distributed Machine Learning Framework with Automatic System Configuration☆26Mar 19, 2019Updated 7 years ago
- ☆15Oct 4, 2022Updated 3 years ago
- ☆15Jun 8, 2021Updated 4 years ago
- Mirror of Apache REEF☆99Jul 6, 2022Updated 3 years ago
- ☆28Oct 10, 2021Updated 4 years ago
- Ethereum VM fuzzer☆62Jul 14, 2021Updated 4 years ago
- Dotfile management with bare git☆22May 16, 2026Updated last week
- ☆22Sep 7, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆24Nov 24, 2018Updated 7 years ago
- minimal C implementation of speculative decoding based on llama2.c☆30Jul 15, 2024Updated last year
- 컴퓨터 신기술 특강☆13Dec 22, 2018Updated 7 years ago
- Example and helpers for building rust projects under cmake☆16Oct 5, 2018Updated 7 years ago
- FOSSLight Scanner☆18May 22, 2026Updated last week
- Zye's YouTube stuff☆66Feb 6, 2026Updated 3 months ago
- 2020 Rookies 세미나☆27Oct 26, 2022Updated 3 years ago
- ☆14Mar 28, 2014Updated 12 years ago
- ☆20Jul 24, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- TCP 혼잡 제어 시 윈도우 크기가 진짜 평행상태로 수렴 하는지 확인하기위한 데모☆11Nov 28, 2019Updated 6 years ago
- CPUID powered by Python.☆20Mar 9, 2021Updated 5 years ago
- 👀 Project What is my IP?☆17Nov 29, 2024Updated last year
- [ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs☆126Jul 4, 2025Updated 10 months ago
- Official implementation of ICML'24 paper "LQER: Low-Rank Quantization Error Reconstruction for LLMs"☆19Jul 11, 2024Updated last year
- ☆12Jun 7, 2023Updated 2 years ago
- NSMC, KorSTS ... fine-tunings☆18Feb 23, 2022Updated 4 years ago
- Cross Platform Socket Library☆12Jul 9, 2017Updated 8 years ago
- 맛있는 코딩 Yummy Coding 코드 소스 모음☆14Jul 2, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Nexusflow function call, tool use, and agent benchmarks.☆30Dec 13, 2024Updated last year
- ☆13Jan 23, 2021Updated 5 years ago
- { 고퀄리티 개발 컨텐츠 모음 }☆10May 30, 2021Updated 5 years ago
- ☆18Dec 3, 2020Updated 5 years ago
- A class for synchronizing sensor readings to the system clock☆11Oct 25, 2018Updated 7 years ago
- Javelin is a dialect of Lisp. It is designed to be an embedded language (minimal Lisp for the Java Virtual Machine).☆30Oct 4, 2023Updated 2 years ago
- Transformer Encoder with Char information for text classification☆15Jan 17, 2020Updated 6 years ago