Lsglang is a special extension of sglang that fully utilizes CPU and GPU computing resources with an efficient GPU parallel + NUMA parallel architecture, suitable for MOE model hybrid inference.
☆81Apr 22, 2026Updated last month
Alternatives and similar repositories for Lsglang
Users that are interested in Lsglang are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DeepSeek免费API☆72Apr 25, 2026Updated last month
- LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features …☆366Apr 28, 2026Updated last month
- AI Demo 项目,一个专门为希望学习和探索人工智能(AI)技术的开发者准备的实战案例集合。☆31May 17, 2026Updated last week
- AMUSE CRAFT旗下会社所用引擎工具☆16Jan 11, 2025Updated last year
- NovelFlow 是一个将小说自动转换为视频的 AI 平台。文生图、图生图、图生视频都是基于开源模型及B站大佬分享的工作流。(NovelFlow is an AI platform that automatically converts novels into video…☆78May 10, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- TNT☆13May 20, 2026Updated last week
- ☆14Sep 4, 2024Updated last year
- 将北航课表导入到各个平台的系统日历中☆10Mar 5, 2020Updated 6 years ago
- KTransformers 一键部署脚本☆59Apr 18, 2025Updated last year
- Tries to UI development. Clone of https://www.perplexity.ai/☆11Sep 30, 2023Updated 2 years ago
- 基于 MisakaTranslator 的互动文字小说阅读工具。☆12Feb 22, 2026Updated 3 months ago
- Valkyria's Engine Tools. | .sdt .dat .mg2☆15Oct 7, 2024Updated last year
- SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization☆28Jul 13, 2022Updated 3 years ago
- CPU/GPU Implicit & Explicit Finite Element Solver for Large Strains☆24Feb 20, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Support finetuning GLM4v with zero2☆16Jun 29, 2024Updated last year
- LLM inference in C/C++☆21May 22, 2026Updated last week
- Minecraft mod in which you unlock the world chunk by chunk☆22Dec 31, 2024Updated last year
- Claude2 to OpenAI API☆17Aug 30, 2023Updated 2 years ago
- 基于BERT预训练模型使用pythorch训练文本分类模型☆19Dec 26, 2023Updated 2 years ago
- Forked from yanhua0518/GALgameScriptTools/tree/master/SiglusEngine☆30Nov 9, 2020Updated 5 years ago
- An extension that handles TeX math rendering for your Flarum forum.☆14Oct 7, 2022Updated 3 years ago
- triton for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆47Dec 8, 2025Updated 5 months ago
- Squrve is a lightweight yet powerful framework for translating natural language into SQL over complex databases.☆50Feb 23, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆30Jan 19, 2025Updated last year
- Retrieval Augmented Generation (RAG) implementation through libraries like Tavily, LangChain, ChatGLM3☆22Jul 23, 2025Updated 10 months ago
- Chat with New Bing via API☆22Jan 24, 2024Updated 2 years ago
- OpenFOAM Foundation repository for OpenFOAM version 13☆85Updated this week
- Efficient computer use agent powered by Meta Llama 4 Maverick☆46May 11, 2026Updated 2 weeks ago
- Livemaker中文汉化教程☆51Feb 11, 2025Updated last year
- 逆向claude网页端成openai以及claude api的形式☆28Jul 13, 2023Updated 2 years ago
- ☆77May 15, 2026Updated 2 weeks ago
- OpenClaw skills for deep search — multi-source search, content extraction, and structured research reports.☆434Mar 18, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆126May 19, 2026Updated last week
- 无内容审核写作大模型rwkv的本地webui项目,接入GPT-SoVITS☆57Apr 4, 2024Updated 2 years ago
- Random ipv6 egress proxy server (support http/socks5)☆42Jan 15, 2024Updated 2 years ago
- Use my dll tools as plugin's☆52Nov 23, 2025Updated 6 months ago
- Notebooks to run SakuraLLM on colab☆55Oct 1, 2024Updated last year
- mcp server for slidev to make web ppt quickly and elegantly☆92Nov 17, 2025Updated 6 months ago
- Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossi…☆140Mar 24, 2026Updated 2 months ago