The backend behind the LLM-Perf Leaderboard
☆11May 5, 2024Updated last year
Alternatives and similar repositories for llm-perf-backend
Users that are interested in llm-perf-backend are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Benchmarking tool for assessing LLM models' performance across different hardwares☆17Dec 8, 2023Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 7 months ago
- Scrape and export data from the Open LLM Leaderboard.☆48Dec 17, 2024Updated last year
- A logrotate script for ROS log folders☆15Jul 31, 2019Updated 6 years ago
- ☆12Updated this week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- SQLGPT is an advanced SQL query generator powered by natural language processing. Seamlessly transforming plain English queries into comp…☆10Oct 24, 2023Updated 2 years ago
- BeautifulSoup+Requests based Web Scrapers☆13Jun 12, 2020Updated 5 years ago
- A template code for running modular and reproducible experiments in pytorch☆13Sep 3, 2025Updated 7 months ago
- An instance segmentation challenge on Basketball images, with a particular focus on occlusion resolution. An opportunity to publish at MM…☆16Aug 8, 2023Updated 2 years ago
- Runpod VLLM Worker that Works !☆10Nov 14, 2023Updated 2 years ago
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- A C++ generic programming library for machine learning☆12Nov 10, 2025Updated 5 months ago
- 📚 Learn ML with clean code, simplified math and illustrative visuals. As you learn, work on interesting projects and share them on https…☆12Apr 6, 2020Updated 6 years ago
- ☆12Dec 8, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [CoRL 2022] Context-Aware Attention-based Network for Informative Path Planning - Public code and model☆34Nov 9, 2022Updated 3 years ago
- Agentic Store Skills☆23Updated this week
- Monitor processes and parallel workloads for hangs☆16Dec 27, 2019Updated 6 years ago
- A benchmark framework for LLM serving performance, based on API call☆14Apr 15, 2024Updated 2 years ago
- Official code of our work, Summarize and Generate to Back-Translate: Unsupervised Translation of Programming Languages [arXiv].☆10Oct 6, 2022Updated 3 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- ☆14May 28, 2024Updated last year
- Code and plugin for paper "Automated Query Reformulation for Efficient Search based on Query Logs From Stack Overflow“☆16Nov 19, 2022Updated 3 years ago
- named entity recognition combined with rule from entity dict☆13Aug 25, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Lightweight Python implementation of SHAP☆36Apr 1, 2026Updated 2 weeks ago
- A Redis-compatible in-memory database server written in Rust with MLua-based Lua 5.1 scripting☆18Nov 28, 2025Updated 4 months ago
- ☆12Mar 25, 2024Updated 2 years ago
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentation☆11Mar 7, 2023Updated 3 years ago
- python package of rocm-smi-lib☆24Dec 15, 2025Updated 4 months ago
- ☆33Jul 11, 2024Updated last year
- ☆19Jan 4, 2026Updated 3 months ago
- ☆13Nov 5, 2024Updated last year
- code and data for paper "ComFormer: Code Comment Generation via Transformer and Fusion Method-based Hybrid Code Representation" accepted …☆14May 10, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The API Gateway & Microservice Management Layer, built on NGINX☆11Jul 5, 2018Updated 7 years ago
- Apps that run on modal.com☆13Sep 14, 2025Updated 7 months ago
- Web browser automation through agentic workflows.☆20Sep 14, 2024Updated last year
- Lightweight face detectors with landmarks. Training code using pytorch and inference using pytorch/ncnn/tensorflow/tflite.☆10Jul 1, 2020Updated 5 years ago
- My 5 Machine Learning projects that I've built as part of my freeCodeCamp assignment.☆15Dec 14, 2022Updated 3 years ago
- Scaling structural learning with NO-BEARS☆14Dec 30, 2019Updated 6 years ago
- WayFAST: a minimal data waypoints free autonomous navigation algorithm for field robots☆54Mar 13, 2024Updated 2 years ago