Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)
☆28Jun 28, 2023Updated 2 years ago
Alternatives and similar repositories for inference-benchmark
Users that are interested in inference-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sentence Embedding as a Service☆15Jun 30, 2025Updated 10 months ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Dec 26, 2019Updated 6 years ago
- Repository for Semi-supervised Synthesizer Sound Matching with Differentiable DSP☆24Jun 10, 2022Updated 3 years ago
- Cloud Native Machine Learning Model Registry☆81Jan 12, 2023Updated 3 years ago
- A benchmark framework for LLM serving performance, based on API call☆14Apr 15, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Yet another system call tracer written in Go.☆45Mar 27, 2018Updated 8 years ago
- Source code for "Learning Similarity Metrics for Melody Retrieval"☆29Oct 29, 2019Updated 6 years ago
- SJTU SE3357 操作系统笔记 OS Notes☆18Jun 4, 2023Updated 2 years ago
- 一个批量下载人人网相册照片的工具。☆10Nov 15, 2018Updated 7 years ago
- Model factory is a ML training platform to help engineers to build ML models at scale☆18Sep 27, 2021Updated 4 years ago
- WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.☆17Aug 4, 2022Updated 3 years ago
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 5 months ago
- End-to-end real-world polyphonic piano audio-to-score transcription with hierarchical decoding (IJCAI 2024)☆40Sep 17, 2024Updated last year
- Neutron plugins for Ironic/Neutron integration. Mirror of code maintained at opendev.org.☆10May 13, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆31Jun 15, 2021Updated 4 years ago
- Testing effects of the Go CPU profiler on microbenchmarks☆13Aug 31, 2022Updated 3 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- Implicit neural differentiable FM synthesizer☆47Oct 27, 2022Updated 3 years ago
- Build a feature-less eBPF vm on eBPF, just for fun.☆16Mar 10, 2024Updated 2 years ago
- MUSDB25 - A Fully Multitrack Dataset for Music Source Separation☆13Mar 29, 2025Updated last year
- A general-purpose GPU monitor, witch can monitor GPU cards and the usage of each pods or containers.☆21Mar 8, 2022Updated 4 years ago
- Some Demo Code for the MPA Exercise.☆10Dec 4, 2017Updated 8 years ago
- Elastic Deep Learning Training based on Kubernetes by Leveraging EDL and Volcano☆32May 19, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- Lightning In-Memory Object Store☆46Jan 22, 2022Updated 4 years ago
- Semi-supervised learning using teacher-student models for vocal melody extraction☆42Sep 14, 2021Updated 4 years ago
- A simple AI/ML tool for non-technical creatives☆11May 5, 2023Updated 3 years ago
- Music Demixing Challenge Submission Repo☆16Sep 8, 2023Updated 2 years ago
- PEP-DNA: a Performance Enhancing Proxy for Deploying Network Architectures☆11Jun 19, 2024Updated last year
- ☆48Nov 13, 2021Updated 4 years ago
- Paper list of federated learning: About system design☆13Apr 13, 2022Updated 4 years ago
- ☆10May 5, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Midas is a memory management system that efficiently and safely harvests idle memory for applications' soft state.☆11Oct 30, 2024Updated last year
- NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that del…☆26Jul 21, 2023Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- 非沪籍高校毕业生留沪各项流程汇总☆17Jan 24, 2018Updated 8 years ago
- SyMuRBench: Benchmark for symbolic music representations☆19Nov 6, 2025Updated 6 months ago
- Online BaseHangul Encoder And Decoder☆12Jan 30, 2023Updated 3 years ago
- ☆13May 16, 2021Updated 5 years ago