Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The inference framework can be sglang, or it can be adapted/modified to use vLLM
☆22May 9, 2025Updated last year
Alternatives and similar repositories for Qwen3_autothink_adapter
Users that are interested in Qwen3_autothink_adapter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Models☆19Oct 24, 2024Updated last year
- support BM25+vecetor☆27May 26, 2025Updated last year
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Jul 9, 2024Updated last year
- Regex Engine using SIMD and Roaring-Bitmaps☆11Dec 26, 2022Updated 3 years ago
- ☆15Jun 26, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Mermaid AI Diagram Generator☆41May 14, 2025Updated last year
- ☆19Feb 18, 2025Updated last year
- Demo app with Loguru logging, async middleware to generate X-request-Id. Works with Gunicorn or Uvicorn, and is safe to use with async/th…☆10Feb 2, 2022Updated 4 years ago
- 中文公开聊天语料库☆11Nov 5, 2018Updated 7 years ago
- ☆25Aug 29, 2025Updated 9 months ago
- 条件随机场(CRF)的pytorch实现☆10Mar 7, 2021Updated 5 years ago
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆14Feb 7, 2025Updated last year
- This is a FastAPI based LLM server. Load multiple LLM models (MLX or llama.cpp) simultaneously using multiprocessing.☆17Apr 8, 2026Updated 2 months ago
- 100行解决中文模糊实体识别with字典树和编辑距离 Chinese fuzzy entity matching with prefix tree and distance editing☆11Sep 25, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆67Sep 18, 2024Updated last year
- interprocess communication between Python and kdb+☆14May 25, 2026Updated 2 weeks ago
- ☆30Jul 22, 2024Updated last year
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆32Nov 16, 2024Updated last year
- Convolutional Deep Semantic Similarity Model☆20Feb 15, 2023Updated 3 years ago
- Local DeepSearch (Advantage: Low Threshold): an implementation of Agentic RAG based on DeepSeek-R1 API and Tavily API☆17Jun 21, 2025Updated 11 months ago
- Lightweight hallucination detection framework for RAG applications☆577Jun 2, 2026Updated last week
- ☆18May 1, 2023Updated 3 years ago
- Demo of knowledge graph creation and Graph RAG with Dspy and Kuzu☆22Jun 30, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Yolov8-visualizations☆11Mar 10, 2023Updated 3 years ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆43Feb 27, 2025Updated last year
- Official repository for the review article "Modeling protein–ligand interactions for drug discovery in the era of deep learning."☆42Mar 25, 2026Updated 2 months ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆23Mar 14, 2024Updated 2 years ago
- 用大模型批量处理数据,现支持各种大模型做OCR,支持通义千问, 月之暗面, 百度飞桨OCR, OpenAI 和LLAVA。Use LLM to generate or clean data for academic use. Support OCR with qwen, m…☆17Sep 15, 2024Updated last year
- VenomPred 2.0 API☆11Feb 4, 2026Updated 4 months ago
- Claude Code 官网已泄露代码备份☆125Mar 31, 2026Updated 2 months ago
- Some microbenchmarks and design docs before commencement☆11Feb 1, 2021Updated 5 years ago
- Pipeline-Parallel Lecture: Simplest Dualpipe Implementation.☆31Sep 17, 2025Updated 8 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 一个用于训练句子embedding的工具,支持Cosent以及Simcse、infonce☆23Jun 17, 2025Updated 11 months ago
- BERT 代码中文注释☆18Mar 8, 2019Updated 7 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- Do some checks every day, so that you can read the "news" in the morning while drinking coffee.☆17Feb 27, 2024Updated 2 years ago
- LLaMA: Open and Efficient Foundation Language Models☆19Apr 21, 2023Updated 3 years ago
- Longitudinal Evaluation of LLMs via Data Compression☆33May 29, 2024Updated 2 years ago
- This tool can easily make / build an emr cluster edge node / client node / gateway node☆10Jun 1, 2022Updated 4 years ago