Since OpenAI and friends refuse to give us a max_ctx param in /models, here's the current context window, input token and output token limits for OpenAI (API), Anthropic, Qwen, Deepseek, llama, Phi, Gemini and Mistral
☆67Dec 20, 2025Updated 5 months ago
Alternatives and similar repositories for llm-context-limits
Users that are interested in llm-context-limits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Add/remove knowledge from local files/folder.☆25May 15, 2025Updated last year
- ☆15Feb 23, 2026Updated 3 months ago
- ☆11Feb 20, 2025Updated last year
- Personal voice assistant, with voice interruption and Twilio support☆18Feb 24, 2025Updated last year
- An fully autonomous agent that accesses the browser and performs tasks.☆18Apr 25, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆24Jan 22, 2025Updated last year
- A meta-framework for self-improving LLMs with transparent reasoning☆41Dec 10, 2025Updated 6 months ago
- Tools for Open-WebUI☆26May 14, 2025Updated last year
- Quick start for Open WebUI☆183Nov 11, 2025Updated 7 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Nov 26, 2025Updated 6 months ago
- Moondream MCP Server in Python☆50Jul 2, 2025Updated 11 months ago
- A Field-Theoretic Approach to Unbounded Memory in Large Language Models☆20Apr 15, 2025Updated last year
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Mar 15, 2025Updated last year
- openweb UI scripts☆11Jan 27, 2026Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆21Jan 25, 2025Updated last year
- The Fastest Way to Fine-Tune LLMs Locally☆339Dec 18, 2025Updated 6 months ago
- Compose, manage, and run MCP servers as Docker containers. With a Unified API gateway built in.☆56Oct 9, 2025Updated 8 months ago
- Revolutionizing collaborative thinking and problem-solving through intelligent ranking systems and gamified AI research contribution. A h…☆28Nov 27, 2025Updated 6 months ago
- ☆19Nov 5, 2024Updated last year
- ☆17Mar 11, 2025Updated last year
- tools for bitwarden☆11Sep 28, 2024Updated last year
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Oct 9, 2024Updated last year
- open-webui function designed to manage and calculate the costs associated with user interactions and model usage in a Open WebUI.☆70Jun 25, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆57Feb 10, 2025Updated last year
- MCP server enabling AI agents to perform natural knowledge discovery and analysis across Obsidian vault☆21May 31, 2025Updated last year
- Local LLM Powered Recursive Search & Smart Knowledge Explorer☆263May 13, 2026Updated last month
- A unified search engine for all your online knowledge → The Invisible Companion for Work + Life☆141Sep 5, 2025Updated 9 months ago
- setting up Mac as much as automatically!☆11Apr 19, 2026Updated last month
- ☆20Nov 26, 2025Updated 6 months ago
- A lightweight LLaMA.cpp HTTP server Docker image based on Alpine Linux.☆37Jun 8, 2026Updated last week
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 9 months ago
- InstantLingua – LLM-Driven PopClip Extension for Translation & Writing. Supports AI from OpenAI, Claude, Grok, and Gemini.☆35Jan 15, 2026Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- A MCP stdio toolpack for local LLMs☆33Apr 6, 2026Updated 2 months ago
- Custom tools for agent based crewAI langchain solutions☆10May 27, 2024Updated 2 years ago
- ☆10May 2, 2025Updated last year
- Protocol for Augmented Memory of Project Artifacts (MCP compatible) - extended☆25Jan 24, 2026Updated 4 months ago
- One stop shop - Local-first RAG stack with intelligent polyglot-code/docs, remote code execution, local llama enrichment, progressive dis…☆34Feb 17, 2026Updated 4 months ago
- ☆11Oct 11, 2023Updated 2 years ago