hellangleZ / Qwen3_autothink_adapterLinks
Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The inference framework can be sglang, or it can be adapted/modified to use vLLM
☆22Updated 3 months ago
Alternatives and similar repositories for Qwen3_autothink_adapter
Users that are interested in Qwen3_autothink_adapter are comparing it to the libraries listed below
Sorting:
- support BM25+vecetor☆29Updated 2 months ago
- Auto Thinking Mode switch for Qwen3 in Open webui☆68Updated 3 months ago
- Evaluation for AI apps and agent☆43Updated last year
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆43Updated 6 months ago
- AgileGen: Empowering Agile-Based Generative Software Development through Human-AI Teamwork (accepted by ACM TOSEM)☆24Updated 9 months ago
- MCP DeepResearch Server: 基于 LangGraph + Ollama + Tavily 的深度研究服务器,支持异步运行、超时控制与进度推送☆29Updated 2 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 6 months ago
- Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth☆156Updated this week
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆85Updated 7 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆53Updated last month
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated last year
- ☆94Updated 8 months ago
- DeepSearch Code-Actions Agent (DSCA). Build 🙌 with 🤗 smolagents☆112Updated 2 weeks ago
- ☆87Updated last month
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆46Updated 2 months ago
- Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆53Updated 2 months ago
- ☆16Updated last month
- diagnosis_zero, R1 Zero reproduce on disease diagnosis☆30Updated last month
- 🔥 AgentScale: A Scalable Microservices-based Agent Orchestration Framework☆27Updated last year
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆140Updated 2 months ago
- ASTChunk is a Python toolkit for code chunking using Abstract Syntax Trees (ASTs), designed to create structurally sound and meaningful c…☆65Updated last month
- Query Expension for Better Query Embedding using LLMs☆56Updated 6 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Updated last year
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆44Updated 7 months ago
- Countdown Game Distill&RL☆46Updated 4 months ago
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆111Updated 6 months ago
- ☆54Updated 5 months ago
- Open replication of DeepSeek R1 for text-to-graph extraction.☆98Updated 6 months ago
- Submodular optimization for context engineering: query fan-out, text selection, passage reranking☆68Updated last month