hellangleZ/Qwen3_autothink_adapter

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hellangleZ/Qwen3_autothink_adapter)

hellangleZ / Qwen3_autothink_adapter

Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The inference framework can be sglang, or it can be adapted/modified to use vLLM

☆22

Alternatives and similar repositories for Qwen3_autothink_adapter

Users that are interested in Qwen3_autothink_adapter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AaronFeng753 / Better-Qwen3
View on GitHub
Auto Thinking Mode switch for Qwen3 in Open webui
☆72May 8, 2025Updated last year
CogComp / Salient-Event-Detection
View on GitHub
The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"
☆10Jul 5, 2022Updated 4 years ago
MoonshotAI / walle
View on GitHub
☆25Jun 29, 2026Updated 3 weeks ago
LuckyyySTA / Fine-grained-Attribution
View on GitHub
[ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Models
☆21Oct 24, 2024Updated last year
lutongyv / Textin_Tester
View on GitHub
如需体验textin文档解析，请点击https://cc.co/16YSIy
☆21Jul 9, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hellangleZ / BY_RAG_V2
View on GitHub
support BM25+vecetor
☆27May 26, 2025Updated last year
hellangleZ / Agent-MemoryForge
View on GitHub
Production-grade memory layer for AI agents with durable multi-tenant memory, semantic recall, async distillation, and SDK/Gateway integr…
☆177Jun 14, 2026Updated last month
FeiSun / LaTeX-Drawing
View on GitHub
LaTeX Drawing
☆18Dec 22, 2025Updated 7 months ago
xpan413 / FSMoE
View on GitHub
☆16Jan 14, 2025Updated last year
DWCTOD / DeepLearning-Weekly
View on GitHub
Collect the most interesting deep learning（CV） applications, papers, and code for everyone！
☆10Jan 8, 2021Updated 5 years ago
asaddi / f5-tts-serve
View on GitHub
A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…
☆14Feb 7, 2025Updated last year
typoverflow / pytorch-crf
View on GitHub
条件随机场（CRF）的pytorch实现
☆10Mar 7, 2021Updated 5 years ago
Adlik / smoothquantplus
View on GitHub
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
☆23Mar 15, 2024Updated 2 years ago
garipovroma / autojudge
View on GitHub
[NeurIPS 2025] Official PyTorch implementation for the paper AutoJudge: Judge Decoding Without Manual Annotation
☆21Dec 22, 2025Updated 6 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
h9-tec / Qwen3_chat_local
View on GitHub
☆10Apr 30, 2025Updated last year
ustc-ai4science / PaperArena
View on GitHub
Official implement of the paper: 🤺 PaperArena: An Evaluation Benchmark for Tool-Augmented Agentic Reasoning on Scientific Literature
☆17Nov 3, 2025Updated 8 months ago
Eddie-Wang1120 / Eddie-Wang-Hackathon2023
View on GitHub
Whisper inference with TensorRT-LLM
☆25Sep 22, 2023Updated 2 years ago
riddle911 / SuperInsights
View on GitHub
☆67Sep 18, 2024Updated last year
SuDIS-ZJU / llm-inference-all-in-one
View on GitHub
☆19Feb 18, 2025Updated last year
Complicateddd / PaddlePL
View on GitHub
山东省第二届数据应用创新创业大赛-主赛场-检验报告单识别-Baseline
☆13Jan 15, 2021Updated 5 years ago
maxiee / RaySystem
View on GitHub
RaySystem 是 Maeiee 为自己量身打造的个人系统项目。
☆60May 26, 2025Updated last year
aiha-lab / MX-QLLM
View on GitHub
LLM Inference with Microscaling Format
☆35Nov 12, 2024Updated last year
moinnadeem / CDSSM
View on GitHub
Convolutional Deep Semantic Similarity Model
☆20Feb 15, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
XiaMooo / yolov8-visualizations
View on GitHub
Yolov8-visualizations
☆11Mar 10, 2023Updated 3 years ago
CodeDuoGun / deepseek_lora
View on GitHub
基于deepseek、qwen3大模型，lora sft 医疗行业数据
☆15Apr 10, 2026Updated 3 months ago
Paul33333 / Agentic_RAG
View on GitHub
Local DeepSearch (Advantage: Low Threshold): an implementation of Agentic RAG based on DeepSeek-R1 API and Tavily API
☆17Jun 21, 2025Updated last year
deletexiumu / screen-analyzer
View on GitHub
🖥️ 基于 Tauri + Rust 的 AI 屏幕活动分析器。自动截屏记录、LLM 智能分析、生成时间线视频。灵感来自 Dayflow。100% AI 编程实现（Claude Code + Codex），零人工代码。隐私优先，数据完全本地化。
☆42Nov 18, 2025Updated 8 months ago
KRLabsOrg / LettuceDetect
View on GitHub
Span-level grounding verification for RAG, code, and tool-grounded AI outputs.
☆586Jul 15, 2026Updated last week
czy1999 / MultiTQ
View on GitHub
MULTITQ is a large-scale dataset featuring ample relevant facts and multiple temporal granularities.
☆27Apr 28, 2026Updated 2 months ago
jeasonstudio / USTB_FPGA_ExperimentReport_Pros
View on GitHub
北京科技大学数字逻辑 FPGA 实验报告及项目文件
☆27Jun 4, 2017Updated 9 years ago
gitkaz / mlx_gguf_server
View on GitHub
This is a FastAPI based LLM server. Load multiple LLM models (MLX or llama.cpp) simultaneously using multiprocessing.
☆18Apr 8, 2026Updated 3 months ago
vac-architector / VAC-Memory-System
View on GitHub
From cell-tower climber & handyman to AI Architect in 4.5 months via Claude CLI. Built VAC Memory System: SOTA RAG (80.1% LoCoMo) on gpt-…
☆33Dec 9, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
MobiSense / SpecOffload-public
View on GitHub
☆29Feb 3, 2026Updated 5 months ago
nambo / menu-rag
View on GitHub
Beyond Basic RAG, Empowering Real-Time Deep Research
☆20Sep 12, 2025Updated 10 months ago
jackfsuia / LLM-Data-Cleaner
View on GitHub
用大模型批量处理数据，现支持各种大模型做OCR，支持通义千问, 月之暗面, 百度飞桨OCR, OpenAI 和LLAVA。Use LLM to generate or clean data for academic use. Support OCR with qwen, m…
☆17Sep 15, 2024Updated last year
providence-replay / providence
View on GitHub
An open-source session replay tool for single-page applications that uses AI analysis, aggregated trends, and a RAG chatbot to help devel…
☆11Jan 23, 2026Updated 5 months ago
zhisbug / ray-scalable-ml-design
View on GitHub
Some microbenchmarks and design docs before commencement
☆11Feb 1, 2021Updated 5 years ago
vivekvar-dl / GSPO-DeepSeek-R1-Distill-Qwen-1.5B
View on GitHub
☆18Mar 15, 2026Updated 4 months ago
KunihiroS / google-patents-mcp
View on GitHub
☆37Aug 25, 2025Updated 10 months ago