ivanfioravanti / llm_context_benchmarksLinks
📊 LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimized for Apple Silicon), visual performance charts.
☆18Updated 3 months ago
Alternatives and similar repositories for llm_context_benchmarks
Users that are interested in llm_context_benchmarks are comparing it to the libraries listed below
Sorting:
- Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework…☆11Updated last year
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆99Updated 6 months ago
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆43Updated 6 months ago
- For inferring and serving local LLMs using the MLX framework☆109Updated last year
- A stock market bot that automatically, once a day, rebalances your Robinhood portfolio by gathering information about each ticker in the …☆58Updated 10 months ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆50Updated 7 months ago
- Embed anything.☆27Updated last year
- ☆107Updated 2 months ago
- Use Codestral Mamba with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.☆29Updated last year
- Distributed Inference for mlx LLm☆100Updated last year
- ☆20Updated 2 months ago
- ☆134Updated last month
- ☆108Updated 4 months ago
- Train Large Language Models on MLX.☆239Updated last month
- ☆37Updated 5 months ago
- Running Microsoft's BitNet via Electron, React & Astro☆49Updated 3 months ago
- powerful and fast tool calling agents☆79Updated 9 months ago
- ☆15Updated last year
- Open source implementation for computer use, using light OCR models and LLMs. Get Android app in link below.☆30Updated last week
- Use smol agents to do research and then update csv coumns with its findings.☆41Updated 11 months ago
- ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)☆121Updated this week
- A proxy for minimax-m2, enabling interleaved thinking, and tool calls.☆36Updated last month
- A MCP server allowing LLM agents to easily connect and retrieve data from any database☆99Updated 5 months ago
- KAN (Kolmogorov–Arnold Networks) in the MLX framework for Apple Silicon☆31Updated 6 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Updated last year
- A real-time shared memory layer for multi-agent LLM systems.☆50Updated 6 months ago
- Mixture-of-Ollamas☆30Updated last year
- A command-line utility to manage MLX models between your Hugging Face cache and LM Studio.☆73Updated 2 months ago
- ☆101Updated 7 months ago
- Grammar checker with a keyboard shortcut for Ollama and Apple MLX with Automator on macOS.☆82Updated last year