yassa9/qwen600

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yassa9/qwen600)

yassa9 / qwen600

Static suckless single batch CUDA-only qwen3-0.6B mini inference engine

☆545

Alternatives and similar repositories for qwen600

Users that are interested in qwen600 are comparing it to the libraries listed below

Sorting:

hellangleZ / Qwen3_autothink_adapter
View on GitHub
Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…
☆22May 9, 2025Updated 10 months ago
karminski / teach-fish-to-swim
View on GitHub
Wanna breeze through some papers?
☆95Feb 27, 2026Updated last week
montanaflynn / asxiv
View on GitHub
An AI-powered interface for exploring and understanding arXiv research papers
☆241Jan 4, 2026Updated 2 months ago
j178 / github-contrib-stats
View on GitHub
☆74Feb 25, 2026Updated last week
thomaschlt / mla.c
View on GitHub
Implementation from scratch in C of the Multi-head latent attention used in the Deepseek-v3 technical paper.
☆18Jan 15, 2025Updated last year
httaotao / filesystem-study-book
View on GitHub
《深入理解文件系统原理和实践》pdf, ISBN: 978-7-89381-214-9
☆28May 6, 2024Updated last year
lcy-seso / DLFrameworkTest
View on GitHub
My tests and experiments with some popular dl frameworks.
☆17Sep 11, 2025Updated 5 months ago
AaronFeng753 / Better-Qwen3
View on GitHub
Auto Thinking Mode switch for Qwen3 in Open webui
☆70May 8, 2025Updated 10 months ago
andrewkchan / yalm
View on GitHub
Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O
☆554Sep 13, 2025Updated 5 months ago
xlite-dev / LeetCUDA
View on GitHub
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
☆9,815Feb 25, 2026Updated last week
horus-ai-labs / DistillFlow
View on GitHub
Library for model distillation
☆165Sep 6, 2025Updated 6 months ago
joeseesun / drawnix-seedream
View on GitHub
Drawnix whiteboard application - open source collaborative drawing tool with mind maps, flowcharts, and free drawing capabilities
☆260Sep 11, 2025Updated 5 months ago
farshed / sage
View on GitHub
Self-hosted voice chat with LLMs
☆463Feb 28, 2025Updated last year
mscheong01 / speculative_decoding.c
View on GitHub
minimal C implementation of speculative decoding based on llama2.c
☆28Jul 15, 2024Updated last year
muriloboratto / NVSHEMEM
View on GitHub
Sample Codes using NVSHMEM on Multi-GPU
☆30Jan 22, 2023Updated 3 years ago
itorr / pixel-kumiko
View on GitHub
🐙「久美子大冒险」2D像素小游戏
☆20Oct 10, 2024Updated last year
yan5xu / ququ
View on GitHub
开源免费的 Wispr Flow 替代方案 | 集成FunASR本地模型和可配置大语言模型的下一代中文桌面语音工作流
☆2,012Oct 8, 2025Updated 5 months ago
CIS1900 / 2022-fall
View on GitHub
☆13Sep 5, 2024Updated last year
alterxyz / YTelegraph
View on GitHub
Python Telegraph api.
☆15Mar 22, 2025Updated 11 months ago
dcaoyuan / vibetrader
View on GitHub
VibeTrader - Toward an open source AI-friendly trading platform.
☆29Updated this week
OpenXiangShan / CPU2006LiteWrapper
View on GitHub
☆13Jan 16, 2026Updated last month
MoonshotAI / kosong
View on GitHub
The LLM abstraction layer for modern AI agent applications.
☆509Feb 24, 2026Updated last week
karminski / one-small-step
View on GitHub
这是一个简单的技术科普教程项目，主要聚焦于解释一些有趣的，前沿的技术概念和原理。每篇文章都力求在 5 分钟内阅读完成。
☆6,730Nov 10, 2025Updated 3 months ago
ConvolutedDog / gpgpu-sim-comments
View on GitHub
GPGPU-Sim 中文注释版代码，包含 GPGPU-Sim 模拟器的最新版代码，经过中文注释，以帮助中文用户更好地理解和使用该模拟器。
☆28Dec 18, 2024Updated last year
WALLE-AI / uReasoningLLMs
View on GitHub
Deepseek-r1复现科普与资源汇总
☆22Mar 5, 2025Updated last year
opencamp-cn / Rustlings
View on GitHub
☆20Jan 16, 2025Updated last year
foreveryh / langgraph-deep-research
View on GitHub
☆249Jun 6, 2025Updated 9 months ago
cangtianhuang / BIT-compiler
View on GitHub
北理 ”编译原理与设计“ 课设，一款使用 Java 开发的简易 C 语言编译器（x86 架构），支持绝大部分 C 语言语法。
☆119Mar 4, 2025Updated last year
ademeure / DeeperGEMM
View on GitHub
DeeperGEMM: crazy optimized version
☆74May 5, 2025Updated 10 months ago
hydropix / TranslateBooksWithLLMs
View on GitHub
Translate full-length books and documents with Ollama, OpenAI (comptatible), Gemini, Mistral, Poe or OpenRouter. Preserves formatting. Re…
☆516Feb 26, 2026Updated last week
hetbhalani / Neural_Network_from_Scratch
View on GitHub
this is the repository about building a neural network from scratch using PURE MATHS!
☆37Jun 22, 2025Updated 8 months ago
fagenorn / handcrafted-persona-engine
View on GitHub
An AI-powered interactive avatar engine using Live2D, LLM, ASR, TTS, and RVC. Ideal for VTubing, streaming, and virtual assistant applica…
☆1,011Oct 28, 2025Updated 4 months ago
deri-protocol / deriprotocol-v2
View on GitHub
deriprotocol-v2
☆10Nov 1, 2021Updated 4 years ago
shakfu / cyllama
View on GitHub
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
☆16Feb 10, 2026Updated 3 weeks ago
guoguo1314 / llama3_learn.c
View on GitHub
Inference deployment of the llama3
☆11Apr 21, 2024Updated last year
LuJH12 / Weaviate-use
View on GitHub
由于官网的教程写得比较复杂，所以笔者写一个简单的例子
☆10Jul 18, 2023Updated 2 years ago
yhinai / TensorGPGPU
View on GitHub
RISC-V vector and tensor compute extensions for Vortex GPGPU acceleration for ML workloads. Optimized for transformer models, CNNs, and g…
☆21Apr 25, 2025Updated 10 months ago
yu-yake2002 / ysyx-docker
View on GitHub
A docker image for One Student One Chip's debug exam
☆10Sep 22, 2023Updated 2 years ago
yxzwang / FamilyTool
View on GitHub
FamilyTool benchmark
☆12Sep 10, 2025Updated 6 months ago