alexziskind1 / llm-inference-calculatorLinks
☆142Updated 3 months ago
Alternatives and similar repositories for llm-inference-calculator
Users that are interested in llm-inference-calculator are comparing it to the libraries listed below
Sorting:
- Instantly calculate the maximum size of quantized language models that can fit in your available RAM, helping you optimize your models fo…☆243Updated 8 months ago
- This project was generated 100% by AI, with one prompt. NOTE: This neuroca project was generated in 3 hours on 3/3/2025. There are depend…☆47Updated 9 months ago
- beep boop 🤖 (experimental)☆118Updated last year
- A cross platform App that gives you the best UX to run models locally or remotely on your own hardware☆70Updated 2 weeks ago
- Your gateway to both Ollama & Apple MlX models☆150Updated 10 months ago
- Link you Ollama models to LM-Studio☆150Updated last year
- CoexistAI is a modular, developer-friendly research assistant framework . It enables you to build, search, summarize, and automate resear…☆393Updated 2 months ago
- MLX-GUI MLX Inference Server for Apple Silicone☆162Updated 3 weeks ago
- You don’t need to read the code to understand how to build!☆249Updated 2 weeks ago
- Local coding agent with neat UI☆337Updated 7 months ago
- This is a cross-platform desktop application that allows you to chat with locally hosted LLMs and enjoy features like MCP support☆226Updated 5 months ago
- Parse files (e.g. code repos) and websites to clipboard or a file for ingestions by AI / LLMs☆353Updated last month
- An open-source VSCode extension, the AI coding assistant, integrates with Ollama, HuggingFace, OpenAI, and Anthropic.☆264Updated 6 months ago
- Official python implementation of UTCP. UTCP is an open standard that lets AI agents call any API directly, without extra middleware.☆633Updated last month
- Ollama desktop client for everyday use☆89Updated 7 months ago
- Notate is a desktop chat application that takes AI conversations to the next level. It combines the simplicity of chat with advanced feat…☆263Updated 10 months ago
- ☆203Updated 4 months ago
- Fast local speech-to-text for any app using faster-whisper☆145Updated 3 months ago
- This project benchmarks 41 open-source large language models across 19 evaluation tasks using the lm-evaluation-harness library.☆85Updated 4 months ago
- Agent MCP for ffmpeg☆211Updated 7 months ago
- A lightweight UI for chatting with Ollama models. Streaming responses, conversation history, and multi-model support.☆147Updated 9 months ago
- Welcome!☆141Updated last year
- Optimized Ollama LLM server configuration for Mac Studio and other Apple Silicon Macs. Headless setup with automatic startup, resource op…☆273Updated 10 months ago
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆183Updated 11 months ago
- A multi-agent AI architecture that connects 25+ specialized agents through n8n and MCP servers. Project NOVA routes requests to domain-sp…☆253Updated 7 months ago
- Explore the unknown, build the future, own your data.☆224Updated this week
- Blueprint by Mozilla.ai for generating podcasts from documents using local AI☆127Updated 2 weeks ago
- Nginx proxy server in a Docker container to Authenticate & Proxy requests to Ollama from Public Internet via Cloudflare Tunnel☆155Updated 4 months ago
- ✨ AI interface for tinkerers (Ollama, Haystack RAG, Python)☆473Updated 4 months ago
- AI Studio is an independent app for utilizing LLMs.☆363Updated last week