☆187Mar 12, 2026Updated last week
Alternatives and similar repositories for llm-scaler
Users that are interested in llm-scaler are comparing it to the libraries listed below
Sorting:
- Cache-DiT Node for Comfyui☆251Feb 11, 2026Updated last month
- BitDance custom nodes for ComfyUI with unified loader, text encode, sampler, and VAE nodes.☆33Feb 26, 2026Updated 3 weeks ago
- Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.☆346Updated this week
- ☆19Aug 19, 2025Updated 7 months ago
- Batch processor to enable large content be digested by Ollama, focused around book processing and translations by default, fully, configu…☆36Oct 27, 2025Updated 4 months ago
- ☆58Mar 6, 2026Updated 2 weeks ago
- Long-term Research Assistants with Self-Scheduling☆53Mar 10, 2026Updated last week
- A Streamlit app for generating high-quality Q&A training datasets from text and PDFs, leveraging Gemini, Claude, and OpenAI for LLM fine-…☆39Jul 5, 2025Updated 8 months ago
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆18Feb 22, 2026Updated 3 weeks ago
- Mini-Engine Demonstration of Combining XeSS with VRS Tier 2.☆14Jan 26, 2026Updated last month
- Deploy an elm HTTP API to AWS Lambda using serverless☆11Jan 25, 2021Updated 5 years ago
- MLX Implementation of Recursive Reasoning with Tiny Networks☆79Oct 11, 2025Updated 5 months ago
- llamacpp的整合包,自用于AI MAX+ 395机器,但是Linux+Windows实际通用。如有问题可以私信B站:防爆键盘☆55Updated this week
- Layered Omni-architecture Openfluke Machine☆130Updated this week
- AirLLM 70B inference with single 4GB GPU☆19Jun 27, 2025Updated 8 months ago
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆170Mar 13, 2026Updated last week
- Simple model memory requirements calculator for GGUF☆82Jan 20, 2026Updated 2 months ago
- LEMMA: Logical Engine for Multi-domain Mathematical Analysis☆28Feb 14, 2026Updated last month
- Production-ready Python library for multi-provider LLM orchestration☆40Oct 10, 2025Updated 5 months ago
- A benchmarking tool for Wisp protocol implementations.☆10Dec 12, 2025Updated 3 months ago
- A highly-configurable RISC-V Core☆33Dec 27, 2025Updated 2 months ago
- Advanced drum machine for ComfyUI featuring a 64-step sequencer, custom sample support, and retro hardware aesthetics.☆20Jan 19, 2026Updated 2 months ago
- Doom for Gear VR☆19Jun 4, 2019Updated 6 years ago
- android_device_moto_wingray☆11May 11, 2016Updated 9 years ago
- Code for the ICML 2025 Paper "Product of Experts with LLMs: Boosting Performance on ARC is a Matter of Perspective"☆50Nov 9, 2025Updated 4 months ago
- Official implementation of Categorical Flow Maps on text.☆47Feb 16, 2026Updated last month
- My Tampermonkey scripts☆18Feb 26, 2026Updated 3 weeks ago
- Powdered Metal — High performance LLM fine-tuning framework for Apple Silicon☆133Updated this week
- [CVPR 2025] Official Implementation of LOCORE: Image Re-ranking with Long-Context☆15Apr 15, 2025Updated 11 months ago
- An iOS app for Jamf Pro Cloud Server☆11May 27, 2021Updated 4 years ago
- ☆10Updated this week
- A lightweight graphics library for the Elm programming language☆15Jul 15, 2017Updated 8 years ago
- Spark-Cassandra Bulk Reader CASSANDRA-16222☆21Jul 24, 2023Updated 2 years ago
- ☆40Feb 14, 2026Updated last month
- Fast linear algebra for Elm☆26Sep 3, 2018Updated 7 years ago
- 4 bits quantization of LLaMa using GPTQ☆12Jun 2, 2023Updated 2 years ago
- A repository of Dockerfiles, scripts, yaml files, Helm Charts, etc. used to build and scale the sample AI workflows with python, kubernet…☆12Feb 22, 2024Updated 2 years ago
- Why would you do this?☆11Feb 10, 2025Updated last year
- Docker image serving element, a matrix client.☆11Mar 12, 2026Updated last week