Lsglang is a special extension of sglang that fully utilizes CPU and GPU computing resources with an efficient GPU parallel + NUMA parallel architecture, suitable for MOE model hybrid inference.
☆38Mar 7, 2026Updated this week
Alternatives and similar repositories for Lsglang
Users that are interested in Lsglang are comparing it to the libraries listed below
Sorting:
- Simple downloader for filehosting service BuzzHeavier, written in Python.☆16Jul 9, 2025Updated 8 months ago
- ExHIBIT RLD String Editor☆15Oct 14, 2024Updated last year
- LvLLM is a special NUMA extension of vllm that makes full use of CPU and memory resources, reduces GPU memory requirements, and features …☆250Updated this week
- AMUSE CRAFT旗下会社所用引擎工具☆14Jan 11, 2025Updated last year
- TNT☆11Feb 15, 2025Updated last year
- ☆12Feb 5, 2025Updated last year
- 🎭 character card editor online.☆20Jan 8, 2026Updated 2 months ago
- KTransformers 一键部署脚本☆58Apr 18, 2025Updated 10 months ago
- ☆14Sep 4, 2024Updated last year
- Valkyria's Engine Tools. | .sdt .dat .mg2☆15Oct 7, 2024Updated last year
- Tries to UI development. Clone of https://www.perplexity.ai/☆11Sep 30, 2023Updated 2 years ago
- AI Demo 项目,一个专门为希望学习和探索人工智能(AI)技术的开发者准备的实战案例集合。☆25Jan 3, 2026Updated 2 months ago
- 基于 MisakaTranslator 的互动文字小说阅读工具。☆12Feb 22, 2026Updated 2 weeks ago
- CPU/GPU Implicit & Explicit Finite Element Solver for Large Strains☆21Feb 20, 2026Updated 2 weeks ago
- LLM as Agent☆21Sep 23, 2023Updated 2 years ago
- 仅供自用☆11Updated this week
- ☆74Updated this week
- 将北航课表导入到各个平台的系统日历中,可以方便地查看课表,并支持上课提醒☆10Mar 5, 2020Updated 6 years ago
- Full-text searching for NodeBB using Meilisearch☆15Updated this week
- The code for haze removal using dark channel prior, which was a part of the self-driving car project☆18Sep 26, 2021Updated 4 years ago
- An extension that handles TeX math rendering for your Flarum forum.☆13Oct 7, 2022Updated 3 years ago
- Multi-tenant RAG API powered by LightRAG/RAG-Anything. Auto-selects best parser (DeepSeek-OCR/MinerU/Docling) via complexity scoring☆46Dec 15, 2025Updated 2 months ago
- Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and in…☆18Nov 11, 2024Updated last year
- A dedicated Colab notebooks to experiment (Nanonets OCR, Monkey OCR, OCRFlux 3B, Typhoo OCR 3B & more..) On T4 GPU - free tier☆23Feb 12, 2026Updated 3 weeks ago
- Claude2 to OpenAI API☆17Aug 30, 2023Updated 2 years ago
- ☆14Feb 24, 2020Updated 6 years ago
- Cross-platform image decoder(png/jpeg/gif) and encoder(png/jpeg) for Nodejs☆26Jun 28, 2017Updated 8 years ago
- LLM inference in C/C++☆21Mar 22, 2025Updated 11 months ago
- triton for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆42Dec 8, 2025Updated 3 months ago
- Interactive class notebooks for ECE4076 Computer Vision.☆30Mar 2, 2026Updated last week
- Minecraft mod in which you unlock the world chunk by chunk☆21Dec 31, 2024Updated last year
- ☆28Updated this week
- Project examples for spm.☆20Jun 19, 2015Updated 10 years ago
- ☆56Nov 12, 2025Updated 3 months ago
- Chat with New Bing via API☆23Jan 24, 2024Updated 2 years ago
- 专业的金融数据获取工具库 - A Professional Financial Data Fetching Toolkit for Python☆85Updated this week
- SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization☆28Jul 13, 2022Updated 3 years ago
- Intercept Google Antigravity IDE API calls and use your own Gemini API token☆55Dec 15, 2025Updated 2 months ago
- Automatically exported from code.google.com/p/xp3dumpergui☆31Jan 17, 2020Updated 6 years ago