☆380Jun 26, 2026Updated this week
Alternatives and similar repositories for llm-scaler
Users that are interested in llm-scaler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository Flash Local Linear Attention☆37May 28, 2026Updated last month
- Samples running deep learning models on Intel GPU Arc A770☆15Jul 4, 2024Updated last year
- Llama.cpp launcher with integrated huggingface☆57Jun 4, 2026Updated 3 weeks ago
- ☆178Updated this week
- Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.☆471Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- HTTP proxy that fixes malformed tool calls from Qwen3-Coder LLM models for seamless integration with OpenCode☆37Aug 15, 2025Updated 10 months ago
- Batch processor to enable large content be digested by Ollama, focused around book processing and translations by default, fully, configu…☆36Oct 27, 2025Updated 8 months ago
- A lightweight chat interface for interacting with local models, featuring persistent memory using a seamless SQLite database to store you…☆34Sep 15, 2025Updated 9 months ago
- 🎮 Material You TUI for monitoring NVIDIA GPUs☆57Jan 16, 2026Updated 5 months ago
- ☆60Mar 6, 2026Updated 3 months ago
- A Streamlit app for generating high-quality Q&A training datasets from text and PDFs, leveraging Gemini, Claude, and OpenAI for LLM fine-…☆40Jul 5, 2025Updated 11 months ago
- Local runner for Microsoft VibeVoice Realtime TTS Fully compatible with Open-Webui Plug and Play. OpenAI api endpoint .Run the Colab note…☆40May 9, 2026Updated last month
- Deploy an elm HTTP API to AWS Lambda using serverless☆12Jan 25, 2021Updated 5 years ago
- Local AI runtime for training & running small LLMs directly on Apple Neural Engine (ANE). No CoreML. No Metal. Offline, on-device fine-tu…☆102Mar 6, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14May 25, 2023Updated 3 years ago
- Simple and Ideal Circuit Simulation☆13Dec 4, 2017Updated 8 years ago
- AirLLM 70B inference with single 4GB GPU☆21Jun 27, 2025Updated last year
- Explainable AI Tooling (XAI). XAI is used to discover and explain a model's prediction in a way that is interpretable to the user. Releva…☆39Sep 22, 2025Updated 9 months ago
- ONNX Runtime: cross-platform, high performance scoring engine for ML models☆88Updated this week
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Simple model memory requirements calculator for GGUF☆85Jan 20, 2026Updated 5 months ago
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆250Updated this week
- Software kit for Qualcomm Cloud AI 100☆19Dec 15, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Production-ready Python library for multi-provider LLM orchestration☆41Jun 19, 2026Updated last week
- Velocity And Luminance Adaptive Rasterization☆16Mar 31, 2023Updated 3 years ago
- A real-time face landmark detection application built with React, TypeScript, and MediaPipe.☆49May 11, 2025Updated last year
- Advanced drum machine for ComfyUI featuring a 64-step sequencer, custom sample support, and retro hardware aesthetics.☆20Jan 19, 2026Updated 5 months ago
- A PyTorch implementation of a conditional Denoising Diffusion Probabilistic Model (DDPM) for multi-modal trajectory prediction. This proj…☆40Feb 20, 2026Updated 4 months ago
- Doom for Gear VR☆19Jun 4, 2019Updated 7 years ago
- An open-source, self-hosted crypto-payment service. Your cryptos, your data, your control — no tracking, no ads, no subscription fees.☆37Jan 16, 2026Updated 5 months ago
- android_device_moto_wingray☆11May 11, 2016Updated 10 years ago
- 💻 SETA: Scaling Environments for Terminal Agents - Environments☆139Feb 16, 2026Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for the ICML 2025 Paper "Product of Experts with LLMs: Boosting Performance on ARC is a Matter of Perspective"☆55Nov 9, 2025Updated 7 months ago
- A Prot paper related materials☆11Sep 5, 2022Updated 3 years ago
- Unofficial implementation of Hippoformer, Integrating Hippocampus-inspired Spatial Memory with Transformers☆53Apr 28, 2026Updated 2 months ago
- HanaVerse is a interactive web UI for chatting with ollama with a lively 2D anime character Hana. Star it on GitHub!☆61May 17, 2025Updated last year
- Research that compiles.☆85Apr 19, 2026Updated 2 months ago
- ☆23Jul 23, 2025Updated 11 months ago
- Self-hosted personal AI agent and employee for workflow automation in your DMs. It writes code, runs tools, schedules jobs, saves workflo…☆38Jun 17, 2026Updated last week