hellangleZ / Qwen3_autothink_adapterLinks
Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The inference framework can be sglang, or it can be adapted/modified to use vLLM
☆22Updated 9 months ago
Alternatives and similar repositories for Qwen3_autothink_adapter
Users that are interested in Qwen3_autothink_adapter are comparing it to the libraries listed below
Sorting:
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year
- support BM25+vecetor☆29Updated 8 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆33Updated last year
- Auto Thinking Mode switch for Qwen3 in Open webui☆70Updated 9 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated last year
- ☆19Updated 6 months ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆148Updated 8 months ago
- Evaluation for AI apps and agent☆44Updated 2 years ago
- ☆96Updated last year
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆62Updated 7 months ago
- MCP DeepResearch Server: 基于 LangGraph + Ollama + Tavily 的深度研究服务器,支持异步运行、超时控制与进度推送☆31Updated 7 months ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆41Updated 10 months ago
- AgileGen: Empowering Agile-Based Generative Software Development through Human-AI Teamwork (accepted by ACM TOSEM)☆23Updated last year
- A lightweight script for processing HTML page to markdown format with support for code blocks☆82Updated last year
- HearSight智能音视频内容分析工具,支持多源视频(包括 URL和上传文件方式)导入能够从输入的视频源中提取上下文信息,从而提供更精准的 AI问答交互。平台基于视频语义单元进行智能切片,用户可通过问答方式灵活调整切片维度,快速定位所需内容同时,HearSight支持自动…☆32Updated last month
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆88Updated last year
- diagnosis_zero, R1 Zero reproduce on disease diagnosis☆34Updated 6 months ago
- ☆21Updated 7 months ago
- Submodular optimization for context engineering: query fan-out, text selection, passage reranking☆78Updated 6 months ago
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆46Updated 8 months ago
- DeepSearch Code-Actions Agent (DSCA). Build 🙌 with 🤗 smolagents☆133Updated 6 months ago
- agentcp是一个基于ACP协议的Agent sdk,用于解决Agent间的身份认证及通信问题;用于创建AID、连接入网、构建会话,收发消息等;支持多Agent协作,异步消息处理,支持内网穿透,支持Agent访问的负载均衡☆19Updated 7 months ago
- REFRAG-style RAG (compress → sense/select → expand) — Single-file reference implementation☆208Updated last month
- ☆31Updated last year
- CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter☆22Updated 8 months ago
- Train a Language Model with GRPO to create a schedule from a list of events and priorities☆260Updated 9 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Updated 2 years ago
- Countdown Game Distill&RL☆47Updated 5 months ago
- Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆58Updated 8 months ago
- Qwen GRPO Graph Extraction RL Finetune☆60Updated 10 months ago