Lsglang is a special extension of sglang that fully utilizes CPU and GPU computing resources with an efficient GPU parallel + NUMA parallel architecture, suitable for MOE model hybrid inference.
☆83Jun 15, 2026Updated this week
Alternatives and similar repositories for Lsglang
Users that are interested in Lsglang are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ExHIBIT RLD String Editor☆15Oct 14, 2024Updated last year
- TNT☆13May 29, 2026Updated 2 weeks ago
- 🎭 character card editor online.☆30Apr 13, 2026Updated 2 months ago
- Tries to UI development. Clone of https://www.perplexity.ai/☆11Sep 30, 2023Updated 2 years ago
- 仅供自用☆11Updated this week
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 基于 MisakaTranslator 的互动文字小说阅读工具。☆12Feb 22, 2026Updated 3 months ago
- Valkyria's Engine Tools. | .sdt .dat .mg2☆15Oct 7, 2024Updated last year
- CPU/GPU Implicit & Explicit Finite Element Solver for Large Strains☆26Feb 20, 2026Updated 3 months ago
- Mixed-precision quantization for LLMs. Every layer refracts into a different format based on its sensitivity. Native compressed-tensors e…☆80Updated this week
- Support finetuning GLM4v with zero2☆16Jun 29, 2024Updated last year
- Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and in…☆18Nov 11, 2024Updated last year
- repo for AMD ROCDXG project☆106Jun 10, 2026Updated last week
- Minecraft mod in which you unlock the world chunk by chunk☆22Dec 31, 2024Updated last year
- Claude2 to OpenAI API☆17Aug 30, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 基于Langchain的学术论文RAG知识库系统☆17Sep 25, 2024Updated last year
- ☆71Feb 27, 2026Updated 3 months ago
- ☆73May 2, 2026Updated last month
- ☆32Jun 10, 2026Updated last week
- Forked from yanhua0518/GALgameScriptTools/tree/master/SiglusEngine☆30Nov 9, 2020Updated 5 years ago
- AnyTrans: Translate AnyText in the Image with Large Scale Models (EMNLP2024 Findings)☆25Dec 11, 2024Updated last year
- triton for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆49Dec 8, 2025Updated 6 months ago
- This is an AI software toolkit in the UNIX tradition.☆49Updated this week
- Simple Hierarchical RAG Framework☆84May 8, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆30Jan 19, 2025Updated last year
- Triton for AMD MI25/50/60. Development repository for the Triton language and compiler☆34Dec 15, 2025Updated 6 months ago
- Retrieval Augmented Generation (RAG) implementation through libraries like Tavily, LangChain, ChatGLM3☆21Jul 23, 2025Updated 10 months ago
- 常用 Docker 服务☆31Jul 22, 2023Updated 2 years ago
- ☆174Feb 14, 2026Updated 4 months ago
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆99Oct 23, 2025Updated 7 months ago
- Place to hack on UI for InstructLab☆38Feb 11, 2026Updated 4 months ago
- 为酒馆用户搭建LightRAG并连接酒馆,以实现更先进更好用的RAG+酒馆(目前与另一个仓库同步开发中)☆52Jan 6, 2025Updated last year
- OpenFOAM Foundation repository for OpenFOAM version 13☆94May 26, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Efficient computer use agent powered by Meta Llama 4 Maverick☆46May 11, 2026Updated last month
- sparkrun - launch, manage, and stop LLM inference workloads on NVIDIA DGX Spark systems☆343Updated this week
- 逆向claude网页端成openai以及claude api的形式☆28Jul 13, 2023Updated 2 years ago
- Livemaker中文汉化教程☆51Feb 11, 2025Updated last year
- ☆79Updated this week
- ☆52Mar 17, 2025Updated last year
- Notebooks to run SakuraLLM on colab☆55Oct 1, 2024Updated last year