☆231Mar 23, 2026Updated 2 weeks ago
Alternatives and similar repositories for llm-scaler
Users that are interested in llm-scaler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.☆376Updated this week
- Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases☆13Dec 2, 2024Updated last year
- LlamaNet: Decentralized Inference Swarm for llama.cpp☆23Jan 18, 2026Updated 2 months ago
- Batch processor to enable large content be digested by Ollama, focused around book processing and translations by default, fully, configu…☆36Oct 27, 2025Updated 5 months ago
- A lightweight chat interface for interacting with local models, featuring persistent memory using a seamless SQLite database to store you…☆32Sep 15, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Local runner for Microsoft VibeVoice Realtime TTS Fully compatible with Open-Webui Plug and Play. OpenAI api endpoint .Run the Colab note…☆32Mar 13, 2026Updated 3 weeks ago
- ☆59Mar 6, 2026Updated last month
- A Streamlit app for generating high-quality Q&A training datasets from text and PDFs, leveraging Gemini, Claude, and OpenAI for LLM fine-…☆39Jul 5, 2025Updated 9 months ago
- Long-term Research Assistants with Self-Scheduling☆53Mar 22, 2026Updated 2 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆85Apr 3, 2026Updated last week
- VEDA (VE Driver API)☆19Mar 17, 2026Updated 3 weeks ago
- Model souping for LLMs☆73Nov 18, 2025Updated 4 months ago
- Repository with examples and exercises for OLCF and AMD's HIP training series☆17Oct 16, 2023Updated 2 years ago
- Standalone repo for our Atropos integration with Thinking Machines Tinker API (https://thinkingmachines.ai/tinker/)☆31Mar 22, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Layered Omni-architecture Openfluke Machine☆131Updated this week
- llamacpp的整合包,自用于AI MAX+ 395机器,但是其它设备实际通用。如有问题可以提ISSUE,回复不一定及时;也可以加QQ群:829631748☆65Updated this week
- AirLLM 70B inference with single 4GB GPU☆20Jun 27, 2025Updated 9 months ago
- ☆83Updated this week
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆179Apr 1, 2026Updated last week
- Simple model memory requirements calculator for GGUF☆82Jan 20, 2026Updated 2 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Production-ready Python library for multi-provider LLM orchestration☆41Oct 10, 2025Updated 6 months ago
- Velocity And Luminance Adaptive Rasterization☆16Mar 31, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A real-time face landmark detection application built with React, TypeScript, and MediaPipe.☆49May 11, 2025Updated 10 months ago
- Python barebones for uProbe-1 ultrasound probe acquisitions☆15Nov 11, 2017Updated 8 years ago
- A PyTorch implementation of a conditional Denoising Diffusion Probabilistic Model (DDPM) for multi-modal trajectory prediction. This proj…☆36Feb 20, 2026Updated last month
- Doom for Gear VR☆19Jun 4, 2019Updated 6 years ago
- ☆13Jan 14, 2026Updated 2 months ago
- Code for the ICML 2025 Paper "Product of Experts with LLMs: Boosting Performance on ARC is a Matter of Perspective"☆51Nov 9, 2025Updated 5 months ago
- Implementation of Hippoformer, Integrating Hippocampus-inspired Spatial Memory with Transformers☆49Feb 5, 2026Updated 2 months ago
- A Prot paper related materials☆11Sep 5, 2022Updated 3 years ago
- ComfyUI nodes for Wan 2.2 SVI 2 Pro with Keyframe control via First/Last Frame and seamless video stitching.☆56Mar 31, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- RosenPy is a complex-valued neural network library, written in Python; Incorporates CVNNs such as CV-FFNN (complex-valued feedforward neu…☆14Sep 17, 2024Updated last year
- ☆20Jul 23, 2025Updated 8 months ago
- feature-rich web interface designed to interact with a local ComfyUI☆76Dec 10, 2025Updated 4 months ago
- An iOS app for Jamf Pro Cloud Server☆11May 27, 2021Updated 4 years ago
- ☆76Mar 31, 2026Updated last week
- A Windows executable to generate MilkVR ".mvrl" files for a collection of videos on your local PC, allowing easy access to those videos f…☆12Aug 24, 2017Updated 8 years ago
- Train and run transformers directly on Apple's Neural Engine in Swift☆92Updated this week