π LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual benchmark modes (API/CLI), automatic hardware detection (optimized for Apple Silicon), visual performance charts.
β60Apr 30, 2026Updated this week
Alternatives and similar repositories for llm_context_benchmarks
Users that are interested in llm_context_benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX frameworkβ¦β11May 4, 2024Updated 2 years ago
- Recursive Self-Aggregation evals on ARC-AGIβ33Jan 26, 2026Updated 3 months ago
- β41Updated this week
- MLX-Video is the best package for inference and finetuning of Image-Video-Audio generation models on your Mac using MLX.β211Mar 18, 2026Updated last month
- Java library to enable AfterBurn with OpenFaaSβ11Mar 31, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A high-performance, in-memory virtual file system for Markdown files. Unix-like commands, Git-style versioning, content-addressable storaβ¦β189Updated this week
- A command-line utility to manage MLX models between your Hugging Face cache and LM Studio.β85Nov 11, 2025Updated 5 months ago
- β113Mar 31, 2026Updated last month
- This repo maintains a 'cheat sheet' for LLMs that are undertrained on mlxβ33Mar 12, 2026Updated last month
- β38Jan 10, 2026Updated 3 months ago
- πΎ Optimize Laravel caching with Cachetastic! Cache method results, force refresh, handle errors, and boost app performance effortlessly.β13Jan 26, 2026Updated 3 months ago
- β114Apr 10, 2026Updated 3 weeks ago
- High performance async Mssql library for Python.β21Updated this week
- My personal dotfiles with automated macOS setup. Features smart installation scripts, Bats testing (bash), performance monitoring, and 2β¦β11Apr 24, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Backpack Attachments is a FiveM resource for attaching weapons and items to players' backs. It supports customizable attachment points, hβ¦β10Nov 14, 2024Updated last year
- Stable Magisk modules for performance and efficient battery usage on rooted Android devices.β25Updated this week
- A lightweight React hook that automatically manages fade overlays for scrollable containers. Provides smooth gradient transitions at the β¦β12Aug 11, 2025Updated 8 months ago
- Open source time-tracking application, dockerized.β15Mar 23, 2024Updated 2 years ago
- β29Aug 24, 2025Updated 8 months ago
- Sekai Viewer but built with Next, optimized for performanceβ11Jan 20, 2023Updated 3 years ago
- Value Vault is the core feature of Value in order to achieve long-term profitability of the token.β26Dec 14, 2020Updated 5 years ago
- A batched implementation for efficient Qwen2.5-VL inference.β25Jul 16, 2025Updated 9 months ago
- an AI rock tumbler (orchestrator)β41Feb 24, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The Lyrebird CLI launcher application, written in Dartβ13Feb 7, 2023Updated 3 years ago
- Optimize the performance of important tasks by delaying background-tasksβ22Mar 13, 2026Updated last month
- Easily optimize generic performance metrics in differentiable learning.β18Jun 6, 2020Updated 5 years ago
- β10Apr 1, 2026Updated last month
- β58Mar 9, 2026Updated last month
- β10Nov 22, 2022Updated 3 years ago
- machine learning model performance metrics & charts with confidence intervals, optimized with numba to be fastβ16Dec 15, 2021Updated 4 years ago
- This project is a real-time Wav2Lip implementation that I am actively optimizing to enhance the precision and performance of audio-to-lipβ¦β11Dec 6, 2023Updated 2 years ago
- Tools for merging pretrained large language models.β19Jun 12, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Pi coding agent extension that gives the agent the ability to switch models on its ownβ82Apr 14, 2026Updated 3 weeks ago
- Codebase for VidHal: Benchmarking Hallucinations in Vision LLMsβ14Apr 23, 2026Updated last week
- Wen? Now! A library to simplify your Web3 data fetching.β20Jun 30, 2022Updated 3 years ago
- Process to gather streaming data from Airline API using NiFi & batch data using AWS redshift using Sqoop and build a data pipeline to anβ¦β11Jul 20, 2022Updated 3 years ago
- This Python script uses YOLOv8 from Ultralytics for real-time object detection using OpenCV. The script initializes a camera, loads the Yβ¦β11Sep 6, 2024Updated last year
- β79Updated this week
- Magento 2 performance optimizations aimed for developers actively developing for Magento 2β13Aug 19, 2019Updated 6 years ago