☆48Sep 7, 2024Updated last year
Alternatives and similar repositories for LLMServingPerfEvaluator
Users that are interested in LLMServingPerfEvaluator are comparing it to the libraries listed below
Sorting:
- FMO (Friendli Model Optimizer)☆13Jan 8, 2025Updated last year
- [⛔️ DEPRECATED] Friendli: the fastest serving engine for generative AI☆49Jun 25, 2025Updated 8 months ago
- FriendliAI Model Hub☆90Jun 9, 2022Updated 3 years ago
- ☆21Jan 7, 2018Updated 8 years ago
- ☆15Jun 8, 2021Updated 4 years ago
- ☆15Oct 4, 2022Updated 3 years ago
- Dotfile management with bare git☆21Feb 19, 2026Updated 2 weeks ago
- ☆18Dec 4, 2017Updated 8 years ago
- Apache Nemo (Incubating) - Data Processing System for Flexible Employment With Different Deployment Characteristics☆113Jul 1, 2025Updated 8 months ago
- ☆22Sep 7, 2019Updated 6 years ago
- 👀 Project What is my IP?☆18Nov 29, 2024Updated last year
- AI Agent who manages your Jira project☆21Jun 23, 2024Updated last year
- A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.☆132Feb 21, 2022Updated 4 years ago
- Nexusflow function call, tool use, and agent benchmarks.☆30Dec 13, 2024Updated last year
- ☆20Jul 24, 2024Updated last year
- Everything you need to build state-of-the-art foundation multimodal desktop agent, end-to-end.☆34Feb 27, 2026Updated last week
- ☆24Nov 24, 2018Updated 7 years ago
- ☆61Sep 17, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 3 months ago
- ☆16Feb 22, 2025Updated last year
- https://www.tekna.no/en/events/workshop-practical-use-of-rag-49875/☆11Apr 25, 2025Updated 10 months ago
- Integrating neurosymbolic representations into LLMs for interpretability, steering, and running symbolic algorithms☆14Feb 2, 2026Updated last month
- Official implementation of ICML'24 paper "LQER: Low-Rank Quantization Error Reconstruction for LLMs"☆19Jul 11, 2024Updated last year
- Unofficial implementation of DreamTalk in ComfyUI☆12Aug 15, 2024Updated last year
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆11Jun 16, 2025Updated 8 months ago
- A class for synchronizing sensor readings to the system clock☆11Oct 25, 2018Updated 7 years ago
- Run GEPA on your favorite non-python libraries.☆33Jan 22, 2026Updated last month
- Prompt templates for language models☆10Feb 28, 2026Updated last week
- Helm charts for Paralus☆14Jan 16, 2026Updated last month
- ☆12Oct 21, 2020Updated 5 years ago
- ☆12Jun 7, 2023Updated 2 years ago
- record power consumption on thinkpads and create a gnuplot graph☆10May 8, 2019Updated 6 years ago
- personal website, blog, proj showcase☆15Feb 18, 2026Updated 2 weeks ago
- a collection of presentations using Github pages☆13May 23, 2019Updated 6 years ago
- Yet another lightweight version for K8S, and even lighter than K3S.☆11Mar 12, 2020Updated 5 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆14Apr 30, 2025Updated 10 months ago
- ☆15Apr 26, 2025Updated 10 months ago
- My dotfiles - I found most of the stuff on the Arch Linux forums. Those are heavily outdated.☆20Oct 19, 2013Updated 12 years ago
- Langchain + Docker + Neo4j☆10Oct 29, 2024Updated last year