ivanfioravanti / llm_context_benchmarksLinks
📊 LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimized for Apple Silicon), visual performance charts.
☆18Updated 2 months ago
Alternatives and similar repositories for llm_context_benchmarks
Users that are interested in llm_context_benchmarks are comparing it to the libraries listed below
Sorting:
- Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework…☆11Updated last year
- Distributed Inference for mlx LLm☆99Updated last year
- ☆107Updated last month
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆99Updated 5 months ago
- For inferring and serving local LLMs using the MLX framework☆109Updated last year
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆51Updated 7 months ago
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆43Updated 5 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
- Embed anything.☆27Updated last year
- 🔥 LitLytics - an affordable, simple analytics platform that leverages LLMs to automate data analysis☆103Updated last year
- A stock market bot that automatically, once a day, rebalances your Robinhood portfolio by gathering information about each ticker in the …☆58Updated 9 months ago
- Train Large Language Models on MLX.☆232Updated last week
- ☆28Updated last year
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆219Updated last month
- KAN (Kolmogorov–Arnold Networks) in the MLX framework for Apple Silicon☆31Updated 6 months ago
- Run AI generated code in isolated sandboxes☆128Updated 10 months ago
- Easily view and modify JSON datasets for large language models☆84Updated 7 months ago
- Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.☆148Updated 5 months ago
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆53Updated last year
- Deep research agents using MiniMax-M2 interleaved thinking☆143Updated 3 weeks ago
- ☆108Updated 3 months ago
- tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.☆38Updated last year
- ☆16Updated 4 months ago
- MLX-GUI MLX Inference Server for Apple Silicone☆157Updated this week
- ☆134Updated last week
- ☆30Updated last year
- ☆23Updated last week
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆85Updated last week
- ☆101Updated 6 months ago
- Grammar checker with a keyboard shortcut for Ollama and Apple MLX with Automator on macOS.☆82Updated last year