meituan-longcat / SGLang-FluentLLMView external linksLinks
☆58Updated this week
Alternatives and similar repositories for SGLang-FluentLLM
Users that are interested in SGLang-FluentLLM are comparing it to the libraries listed below
Sorting:
- MongoDB Wrapper with Go Generics☆12Jun 16, 2024Updated last year
- Pure Java Llama2 inference with optional multi-GPU CUDA implementation☆13Sep 2, 2023Updated 2 years ago
- ☆32Dec 10, 2025Updated 2 months ago
- ☆10Mar 2, 2024Updated last year
- An all-platform VRChat heart rate broadcasting tool developed with Flutter.☆25Feb 3, 2026Updated last week
- The repository maintains the source code for the article titled "Optimizing Attention by Exploiting Data Reuse on ARM Multi-core CPUs."☆15Dec 1, 2024Updated last year
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Updated this week
- [ICLR25] STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs☆18Jun 3, 2025Updated 8 months ago
- Example of applying CUDA graphs to LLaMA-v2☆12Aug 25, 2023Updated 2 years ago
- Style, manipulate and create links in your editor using regular expressions.☆20Feb 11, 2024Updated 2 years ago
- A squarified treemap☆19Nov 18, 2025Updated 2 months ago
- ☆18Apr 8, 2022Updated 3 years ago
- ☆21Apr 13, 2022Updated 3 years ago
- TernGEMM: General Matrix Multiply Library with Ternary Weights for Fast DNN Inference☆14Feb 22, 2022Updated 3 years ago
- ☆15Jul 2, 2024Updated last year
- A direct convolution library targeting ARM multi-core CPUs.☆12Nov 27, 2024Updated last year
- Kether动作补全:原生动作+TabooLib提供公有动作+questengine提供的私人动作+Chemdah公私动作+KetherScript代码高亮已完成!☆15Dec 10, 2023Updated 2 years ago
- ☆17Jun 23, 2024Updated last year
- some data mods, generated programmatically, so they can be recreated automagically for every new patch☆21Aug 17, 2025Updated 5 months ago
- pdf书☆19Aug 16, 2018Updated 7 years ago
- CAKE Library for constant-bandwidth matrix multiplication on CPUs☆14Apr 6, 2024Updated last year
- Explore Inter-layer Expert Affinity in MoE Model Inference☆16May 6, 2024Updated last year
- 明日方舟集成战略开局生成器 Arknights Integrated Strategies Opening Generator (以及其他玩具)☆20Updated this week
- [ECAI 2023 Oral] Official Implementation of High Dynamic Range Image Reconstruction via Deep Explicit Polynomial Curve Estimation☆21Nov 3, 2024Updated last year
- Vue3 Components inspired by Microsoft's Fluent Design System.☆19Jan 23, 2026Updated 3 weeks ago
- 💳 A highly customizable distributed unique ID generator in TypeScript☆17Mar 22, 2025Updated 10 months ago
- ☆50Jan 28, 2026Updated 2 weeks ago
- A light-weight data management system for large-scale pretraining☆21May 17, 2025Updated 8 months ago
- Official code and data repository of MathChat: MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Inte…☆22Jun 3, 2024Updated last year
- Awesome Continual Multi-view Clustering is a collection of SOTA, novel continual multi-view clustering methods (papers, codes).☆27Oct 31, 2025Updated 3 months ago
- Nanos6 is a runtime that implements the OmpSs-2 parallel programming model, developed by the System Tools and Advanced Runtimes (STAR) gr…☆22Jun 6, 2025Updated 8 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Nov 11, 2024Updated last year
- ☆23Jan 23, 2026Updated 3 weeks ago
- Official repository of Sparse ISO-FLOP Transformations for Maximizing Training Efficiency☆25Jul 31, 2024Updated last year
- ☆26Sep 1, 2020Updated 5 years ago
- DCPO: Dynamic Adaptive Clipping for RL☆45Dec 20, 2025Updated last month
- Using Go to create a program that logins BUAA GW from the command line. "gw.buaa.edu.cn" login commandline 北航校园网登录程序☆22May 2, 2023Updated 2 years ago
- 微信群消息屏蔽器☆22Sep 1, 2021Updated 4 years ago