Lsglang is a special extension of sglang that fully utilizes CPU and GPU computing resources with an efficient GPU parallel + NUMA parallel architecture, suitable for MOE model hybrid inference.
☆46Mar 12, 2026Updated 2 weeks ago
Alternatives and similar repositories for Lsglang
Users that are interested in Lsglang are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features …☆283Mar 20, 2026Updated last week
- AI Demo 项目,一个专门为希望学习和探索人工智能(AI)技术的开发者准备的实战案例集合。☆25Jan 3, 2026Updated 2 months ago
- ExHIBIT RLD String Editor☆15Oct 14, 2024Updated last year
- AMUSE CRAFT旗下会社所用引擎工具☆15Jan 11, 2025Updated last year
- 🎭 character card editor online.☆25Jan 8, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- TNT☆11Updated this week
- Multi-tenant RAG API powered by LightRAG/RAG-Anything. Auto-selects best parser (DeepSeek-OCR/MinerU/Docling) via complexity scoring☆49Dec 15, 2025Updated 3 months ago
- ☆14Sep 4, 2024Updated last year
- Cross-platform image decoder(png/jpeg/gif) and encoder(png/jpeg) for Nodejs☆26Jun 28, 2017Updated 8 years ago
- 将北航课表导入到各个平台的系统日历中☆10Mar 5, 2020Updated 6 years ago
- A dedicated Colab notebooks to experiment (Nanonets OCR, Monkey OCR, OCRFlux 3B, Typhoo OCR 3B & more..) On T4 GPU - free tier☆23Feb 12, 2026Updated last month
- KTransformers 一键部署脚本☆60Apr 18, 2025Updated 11 months ago
- Tries to UI development. Clone of https://www.perplexity.ai/☆11Sep 30, 2023Updated 2 years ago
- 仅供自用☆11Updated this week
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- 基于 MisakaTranslator 的互动文字小说阅读工具。☆12Feb 22, 2026Updated last month
- Valkyria's Engine Tools. | .sdt .dat .mg2☆15Oct 7, 2024Updated last year
- Project examples for spm.☆20Jun 19, 2015Updated 10 years ago
- LLM as Agent☆21Sep 23, 2023Updated 2 years ago
- ☆77Mar 23, 2026Updated last week
- The code for haze removal using dark channel prior, which was a part of the self-driving car project☆18Sep 26, 2021Updated 4 years ago
- Simple downloader for filehosting service BuzzHeavier, written in Python.☆16Jul 9, 2025Updated 8 months ago
- SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization☆28Jul 13, 2022Updated 3 years ago
- CPU/GPU Implicit & Explicit Finite Element Solver for Large Strains☆22Feb 20, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- socket阻塞式长链接和非阻塞式长链接的讲解示例☆11Nov 18, 2016Updated 9 years ago
- Support finetuning GLM4v with zero2☆16Jun 29, 2024Updated last year
- Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and in…☆18Nov 11, 2024Updated last year
- Dynamo Multi AI Agent POC: Unlock the Power of Spring AI and LangGraph4J☆25Dec 12, 2024Updated last year
- NACOS漏洞利用脚本,检测默认弱口令,未授权,以及任意用户添加☆16May 28, 2023Updated 2 years ago
- A python nacos sdk client based on the official openapi(一个基于Nacos官方API的python客户端实现,支持同步和异步)☆13Mar 14, 2026Updated 2 weeks ago
- ☆14Feb 24, 2020Updated 6 years ago
- Full-text searching for NodeBB using Meilisearch☆15Updated this week
- LLM inference in C/C++☆21Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Minecraft mod in which you unlock the world chunk by chunk☆22Dec 31, 2024Updated last year
- ☆12Feb 5, 2025Updated last year
- 检测透视图像中的矩形文档并对其进行矫正☆31Sep 16, 2022Updated 3 years ago
- OMGEval😮: An Open Multilingual Generative Evaluation Benchmark for Foundation Models☆36Jul 19, 2024Updated last year
- 基于ThinkPHP3.2框架完成的企业网站CMS系统,快速搭建可商用的企业网站,接私活利器☆16Jul 29, 2018Updated 7 years ago
- ☆29Updated this week
- Claude2 to OpenAI API☆17Aug 30, 2023Updated 2 years ago