RayFernando1337 / LLM-Calc
Instantly calculate the maximum size of quantized language models that can fit in your available RAM, helping you optimize your models for inference.
☆151Updated last month
Alternatives and similar repositories for LLM-Calc:
Users that are interested in LLM-Calc are comparing it to the libraries listed below
- ☆73Updated last month
- ☆178Updated 2 months ago
- HawkinsDB is our take on giving AI systems a more human-like way to store and recall information, inspired by how our own brains work. Ba…☆150Updated last month
- A powerful Python tool for performing technical searches using the Perplexity API, optimized for retrieving precise facts, code examples,…☆189Updated 2 weeks ago
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆241Updated 2 weeks ago
- An automated machine learning system that leverages O1 and Claude to iteratively develop, improve, and optimize ML solutions.☆78Updated 2 weeks ago
- ☆29Updated last month
- FastMLX is a high performance production ready API to host MLX models.☆256Updated 2 months ago
- ☆71Updated 2 weeks ago
- A growing collection of guides and tools based on Anthropic's Model Context Protocol standard for interfacing with LLMs☆46Updated this week
- 🤖 Headless IDE for AI agents☆156Updated 2 months ago
- ☆109Updated last month
- You don’t need to read the code to understand how to build!☆176Updated 2 weeks ago
- Hallucination Detector is a free and open-source tool that helps you verify the accuracy of your LLM generated content instantly.☆136Updated last week
- the simplest self-building general autonomous agent☆281Updated 3 months ago
- ☆220Updated last month
- ☆60Updated 3 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆144Updated last week
- Routing on Random Forest (RoRF)☆100Updated 4 months ago
- ☆75Updated 4 months ago
- Multi-person podcast audio to videocast☆10Updated 4 months ago
- Generate train.jsonl and valid.jsonl files to use for fine-tuning Mistral and other LLMs.☆84Updated 11 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆114Updated 8 months ago
- Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generate…☆36Updated 5 months ago
- Turn any developer documentation into a GPT☆83Updated 4 months ago