RayFernando1337 / LLM-Calc
Instantly calculate the maximum size of quantized language models that can fit in your available RAM, helping you optimize your models for inference.
☆212Updated 3 months ago
Alternatives and similar repositories for LLM-Calc:
Users that are interested in LLM-Calc are comparing it to the libraries listed below
- FastMLX is a high performance production ready API to host MLX models.☆283Updated 2 weeks ago
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆298Updated last week
- ☆184Updated 4 months ago
- ☆75Updated 3 months ago
- ☆132Updated 2 months ago
- A Multi-Agent AI Tool that creates beautiful presentations with voice-overs 🎦🔥☆160Updated last month
- the simplest self-building general autonomous agent☆298Updated 5 months ago
- API Server for Transformer Lab☆45Updated this week
- A powerful Python tool for performing technical searches using the Perplexity API, optimized for retrieving precise facts, code examples,…☆203Updated 2 months ago
- ☆182Updated 2 months ago
- Scrapybara Python SDK☆53Updated last week
- Make any LLM to think like OpenAI o1 and deepseek R1☆479Updated last month
- ☆219Updated 5 months ago
- A multi-agent AI research system designed to know what it knows (and doesn't know) when conducting research and creating content.☆146Updated last month
- ☆146Updated last month
- Optimized Ollama LLM server configuration for Mac Studio and other Apple Silicon Macs. Headless setup with automatic startup, resource op…☆152Updated 3 weeks ago
- ☆86Updated last month
- ☆95Updated last week
- A list of useful Open Source tools and scrapers to gather data for LLMs☆224Updated last month
- A simple Python program to implement the search-extract-summarize flow.☆258Updated 2 months ago
- ☆85Updated 2 months ago
- ☆138Updated this week
- A Chrome extension for asking questions over websites☆331Updated last month
- A Python-based web-assisted large language model (LLM) search assistant using Llama.cpp☆346Updated 5 months ago
- ☆43Updated last week
- MCP server for enabling LLM applications to perform deep research via the MCP protocol☆69Updated this week
- II-Researcher: a new open-source framework designed to aid building search / research agents☆107Updated this week
- Letting Claude Code develop his own MCP tools :)☆91Updated 3 weeks ago
- Interactive timeline of AI history☆45Updated last week
- ☆214Updated this week