π LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimized for Apple Silicon), visual performance charts.
β50Apr 4, 2026Updated last week
Alternatives and similar repositories for llm_context_benchmarks
Users that are interested in llm_context_benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- REAP expert pruning for MoE LLMs on Apple Silicon via MLXβ53Mar 16, 2026Updated 3 weeks ago
- Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX frameworkβ¦β11May 4, 2024Updated last year
- Chain-of-thought λ°©μμ νμ©νμ¬ llama2λ₯Ό fine-tuningβ10Nov 18, 2023Updated 2 years ago
- MCP for SemaphoreUIβ52Feb 3, 2026Updated 2 months ago
- Recursive Self-Aggregation evals on ARC-AGIβ29Jan 26, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of StrongDM's Attractor spec (https://github.com/strongdm/attractor) in Rustβ29Mar 9, 2026Updated last month
- Java library to enable AfterBurn with OpenFaaSβ11Mar 31, 2018Updated 8 years ago
- GDT (Ghidra Data Type) generated from IDA tilsβ22Mar 10, 2023Updated 3 years ago
- Implementation of Visual Intelligence Using SmolVLM 2 by Hugging Faceβ39Jan 15, 2026Updated 3 months ago
- β33Jan 10, 2026Updated 3 months ago
- High performance async Mssql library for Python.β19Updated this week
- A command-line utility to manage MLX models between your Hugging Face cache and LM Studio.β84Nov 11, 2025Updated 5 months ago
- An AI agent to create short stories, using Gemini and Imagen for illustrations. The project is developed in Java 21 with LangChain4j, andβ¦β12Sep 4, 2025Updated 7 months ago
- This repo maintains a 'cheat sheet' for LLMs that are undertrained on mlxβ32Mar 12, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Dotfiles optimized for performance and high productivity in the terminalβ20Updated this week
- πA lightweight, high-performance string manipulation library optimized for speed-sensitive applications.β16Mar 28, 2026Updated 2 weeks ago
- Ghidra Struct Importerβ20Oct 18, 2023Updated 2 years ago
- My personal dotfiles with automated macOS setup. Features smart installation scripts, Bats testing (bash), performance monitoring, and 2β¦β11Feb 6, 2026Updated 2 months ago
- Advanced Ocean Simulation for Unreal Engine 5 using the Niagara system and C++. Designed to enhance FPS with high-performance mesh and wiβ¦β10Aug 19, 2024Updated last year
- β15Apr 7, 2024Updated 2 years ago
- an AI rock tumbler (orchestrator)β35Feb 24, 2026Updated last month
- A Particle System implemented in android, handling collinsions, optimized for performanceβ10Dec 18, 2023Updated 2 years ago
- A lightweight React hook that automatically manages fade overlays for scrollable containers. Provides smooth gradient transitions at the β¦β12Aug 11, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- β25Aug 6, 2025Updated 8 months ago
- Discover Netflix's Open Connect Appliance (OCA) assigned to your connection. This tool fetches and displays detailed connectivity and hosβ¦β19Jul 22, 2025Updated 8 months ago
- β14Apr 1, 2019Updated 7 years ago
- The old repository used to store ToroDB related products and librariesβ12Apr 10, 2017Updated 9 years ago
- Parse SVG files and render them as PNG, PDF, SVG, or raw memory buffer images.β16May 7, 2019Updated 6 years ago
- An experimental python library to compile and analyze the cost of any desired composite simulation in real or imaginary time, and with orβ¦β10Feb 9, 2024Updated 2 years ago
- 10K+ req/s batch API client for LLM endpoints β Rust, async, load-balancedβ19Feb 21, 2026Updated last month
- Pi coding agent extension that gives the agent the ability to switch models on its ownβ61Updated this week
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasksβ31May 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code indexing MCP server to provide context to coding agents.β48Apr 7, 2026Updated last week
- r3conwhale aims to develop a multifunctional recon chain for web applications, intelligently interpreting collected data, and optimizing β¦β14Jul 3, 2024Updated last year
- A web-based helper for the Spark Core.β23Dec 13, 2013Updated 12 years ago
- Multi-arch templates for OpenFaaSβ12Sep 11, 2020Updated 5 years ago
- PowerShell script to optimize Windows performance and reduce latency (24H2 compatible) for a better Data Science experience.β23Jul 30, 2025Updated 8 months ago
- A Kubernetes Controller that will ensure that the EC2 Source Destination Check (source-dest-check attribute) is disabled on nodes within β¦β18Jul 28, 2020Updated 5 years ago
- β10Apr 1, 2026Updated 2 weeks ago