πΉοΈ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
β138Jul 25, 2024Updated last year
Alternatives and similar repositories for benchmarks
Users that are interested in benchmarks are comparing it to the libraries listed below
Sorting:
- an auto-sleeping and -waking framework around llama.cppβ12Feb 8, 2025Updated last year
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β59Jan 5, 2026Updated last month
- Exploring limitations of LLM-as-a-judgeβ20Aug 17, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMsβ267Dec 4, 2025Updated 2 months ago
- This project showcases engaging interactions between two AI chatbots.β10Jan 10, 2024Updated 2 years ago
- Proxy server for triton gRPC server that inferences embedding model in Rustβ21Aug 10, 2024Updated last year
- β12Jan 19, 2024Updated 2 years ago
- REBUS: A Robust Evaluation Benchmark of Understanding Symbolsβ13Aug 13, 2024Updated last year
- AI_Powered_Dev_Search_Engineβ12Mar 10, 2024Updated last year
- Identify and automatically fix issues in shell scriptsβ15Nov 24, 2023Updated 2 years ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.β26Jun 3, 2024Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β84Oct 29, 2024Updated last year
- Attend - to what matters.β17Feb 22, 2025Updated last year
- Benchmark scripts for comparing different tokenizers and sentence segmenters of Germanβ12Feb 27, 2023Updated 3 years ago
- WebAISum is a Python script that allows you to summarize web pages using AI models. It supports both local models like Ollama and remote β¦β15Apr 28, 2024Updated last year
- Docker images for Stable Diffusion WebUI (AUTOMATIC1111) for AMD Radeon RX5500XT and similar boardsβ13Oct 15, 2024Updated last year
- Learning and rediscovering ML from total scratchβ12Aug 30, 2021Updated 4 years ago
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.β36Jul 2, 2025Updated 8 months ago
- Small tools to enhance your AI app with little effort.β12Jan 9, 2024Updated 2 years ago
- When real time Yoga Position classification meets GNNβ11Sep 17, 2023Updated 2 years ago
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applicationsβ28Jun 7, 2024Updated last year
- This repository contains the metadata and data of different databases that we use for testingβ14Jan 29, 2025Updated last year
- Quantized inference code for LLaMA modelsβ13Mar 12, 2023Updated 2 years ago
- A guidance compatibility layer for llama-cpp-pythonβ36Sep 11, 2023Updated 2 years ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPUβ13May 5, 2024Updated last year
- Python library for automatic training, optimization and comparison of Transformer models on most NLP tasks.β20May 6, 2023Updated 2 years ago
- Implementation of various Machine learning and MLOps applications/tutorials used within my Medium blog.β11Jan 28, 2023Updated 3 years ago