Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)
☆28Jun 28, 2023Updated 3 years ago
Alternatives and similar repositories for inference-benchmark
Users that are interested in inference-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sentence Embedding as a Service☆15Jun 30, 2025Updated last year
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Dec 26, 2019Updated 6 years ago
- Cloud Native Machine Learning Model Registry☆81Jan 12, 2023Updated 3 years ago
- Yet another system call tracer written in Go.☆45Mar 27, 2018Updated 8 years ago
- SJTU SE3357 操作系统笔记 OS Notes☆17Jun 4, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 一个批量下载人人网相册照片的工具。☆10Nov 15, 2018Updated 7 years ago
- Model factory is a ML training platform to help engineers to build ML models at scale☆18Sep 27, 2021Updated 4 years ago
- A drawable MNIST demo using streamlit.☆11Nov 27, 2020Updated 5 years ago
- ☆125Mar 17, 2024Updated 2 years ago
- Neutron plugins for Ironic/Neutron integration. Mirror of code maintained at opendev.org.☆11Jun 9, 2026Updated 3 weeks ago
- ☆30Jun 15, 2021Updated 5 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- MUSDB25 - A Fully Multitrack Dataset for Music Source Separation☆13Mar 29, 2025Updated last year
- ☆10Dec 16, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Some Demo Code for the MPA Exercise.☆10Dec 4, 2017Updated 8 years ago
- Awesome-GenAITech: a curated list of Generative AI Techniques☆11Jul 11, 2023Updated 2 years ago
- Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"☆12Aug 17, 2021Updated 4 years ago
- Elastic Deep Learning Training based on Kubernetes by Leveraging EDL and Volcano☆32May 19, 2023Updated 3 years ago
- 应用系统体系架构☆23Dec 1, 2023Updated 2 years ago
- PEP-DNA: a Performance Enhancing Proxy for Deploying Network Architectures☆11Jun 19, 2024Updated 2 years ago
- Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".☆23Jun 10, 2024Updated 2 years ago
- Paper list of federated learning: About system design☆13Apr 13, 2022Updated 4 years ago
- ☆10May 5, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Midas is a memory management system that efficiently and safely harvests idle memory for applications' soft state.☆11Oct 30, 2024Updated last year
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- ☆21Dec 8, 2020Updated 5 years ago
- 非沪籍高校毕业生留沪各项流程汇总☆17Jan 24, 2018Updated 8 years ago
- DEPRECATED. see github.com/ccp-project/portus☆10Dec 31, 2018Updated 7 years ago
- Online BaseHangul Encoder And Decoder☆13Jan 30, 2023Updated 3 years ago
- ☆15Sep 26, 2022Updated 3 years ago
- OpenSFEDS, a near-eye gaze estimation dataset containing approximately 2M synthetic camera-photosensor image pairs sampled at 500 Hz unde…☆13Apr 18, 2024Updated 2 years ago
- Latency and Memory Analysis of Transformer Models for Training and Inference☆487Apr 19, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Python-based voice assistant integrating speech-to-text (STT), text-to-speech (TTS), and powerful AI capabilities using either a local …☆18Updated this week
- This repository consists of useful tools or guides for system software development or anything interesting.☆11Feb 27, 2026Updated 4 months ago
- A basic Docker-based installation of TVM☆11Jun 23, 2022Updated 4 years ago
- Slides from 2021-12-15 talk, "TVM Developer Bootcamp – Writing Hardware Backends"☆11Jan 20, 2022Updated 4 years ago
- Code for our ACL-2023 paper: "Combo of Thinking and Observing for Outside-Knowledge VQA"☆12Jun 30, 2023Updated 3 years ago
- Ultra-fast audio super resolution custom node for ComfyUI, powered by the NovaSR model.☆33Feb 12, 2026Updated 4 months ago
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Oct 12, 2025Updated 8 months ago