intel / llm-scalerLinks
☆38Updated this week
Alternatives and similar repositories for llm-scaler
Users that are interested in llm-scaler are comparing it to the libraries listed below
Sorting:
- Lightweight Inference server for OpenVINO☆206Updated this week
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆73Updated 2 weeks ago
- LLM Ripper is a framework for component extraction (embeddings, attention heads, FFNs), activation capture, functional analysis, and adap…☆42Updated last week
- Intel® AI Assistant Builder☆101Updated this week
- Benchmark for local LLMs with German "Who Wants to Be a Millionaire" questions.☆34Updated this week
- ☆66Updated this week
- No-code CLI designed for accelerating ONNX workflows☆210Updated 2 months ago
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆39Updated last year
- Running Microsoft's BitNet via Electron, React & Astro☆43Updated 3 months ago
- GPU Power and Performance Manager☆61Updated 10 months ago
- ☆53Updated last year
- Enhancing LLMs with LoRA☆128Updated 3 weeks ago
- Simple system tray application to monitor the status of your LLM models running on Ollama☆21Updated 2 months ago
- llama.cpp fork used by GPT4All☆56Updated 6 months ago
- ☆83Updated this week
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆42Updated last month
- Kick is an AI-powered assistant that provides voice and keyboard control over your Windows device, enabling seamless automation of your d…☆16Updated last month
- Bookmarklet to pull and run hugging face GGUF models in Ollama☆15Updated 10 months ago
- Ampere optimized llama.cpp☆23Updated this week
- Build an AI Agent from Libraries of Functions -- My most advanced agent framework☆122Updated 2 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆43Updated last week
- RetroChat is a powerful command-line interface for interacting with various AI language models. It provides a seamless experience for eng…☆80Updated last month
- An extension for oobabooga/text-generation-webui that automatically unloads and reloads your model.☆17Updated last year
- The easiest & fastest way to run LLMs in your home lab☆65Updated last week
- Autonomous, agentic, creative story writing system that incorporates stored embeddings and Knowledge Graphs.☆76Updated this week
- the IDE for research, built from the ground up with AI integrations☆76Updated this week
- Allows two LLMs to communicate and run code in the terminal☆26Updated 8 months ago
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆11Updated 3 months ago
- Onboarding documentation source for the AMD Ryzen™ AI Software Platform. The AMD Ryzen™ AI Software Platform enables developers to take…☆77Updated 3 weeks ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆33Updated 2 weeks ago