hellangleZ / Qwen3_autothink_adapterLinks
Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The inference framework can be sglang, or it can be adapted/modified to use vLLM
☆22Updated 6 months ago
Alternatives and similar repositories for Qwen3_autothink_adapter
Users that are interested in Qwen3_autothink_adapter are comparing it to the libraries listed below
Sorting:
- support BM25+vecetor☆29Updated 6 months ago
- Auto Thinking Mode switch for Qwen3 in Open webui☆70Updated 6 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 9 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆44Updated 9 months ago
- Evaluation for AI apps and agent☆43Updated last year
- AgileGen: Empowering Agile-Based Generative Software Development through Human-AI Teamwork (accepted by ACM TOSEM)☆23Updated last year
- ☆95Updated 11 months ago
- diagnosis_zero, R1 Zero reproduce on disease diagnosis☆34Updated 4 months ago
- DeepSearch Code-Actions Agent (DSCA). Build 🙌 with 🤗 smolagents☆126Updated 3 months ago
- REFRAG-style RAG (compress → sense/select → expand) — Single-file reference implementation☆170Updated 2 months ago
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆146Updated 6 months ago
- ☆17Updated 4 months ago
- HearSight智能视频内容分析工具,支持多源视频(包括 URL和上传文件方式)导入能够从输入的视频源中提取上下文信息,从而提供更精准的 AI问答交互。平台基于视频语义单元进行智能切片,用户可通过问答方式灵活调整切片维度,快速定位所需内容同时,HearSight支持自动生…☆26Updated last week
- Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth☆212Updated 3 weeks ago
- agentcp是一个基于ACP协议的Agent sdk,用于解决Agent间的身份认证及通信问题;用于创建AID、连接入网、构建会话,收发消息等;支持多Agent协作,异步消息处理,支持内网穿透,支持Agent访问的负载均衡☆18Updated 4 months ago
- MCP DeepResearch Server: 基于 LangGraph + Ollama + Tavily 的深度研究服务器,支持异步运行、超时控制与进度推送☆32Updated 5 months ago
- 大模型智能体Agent中文教程,博客代码仓库☆50Updated last month
- Countdown Game Distill&RL☆47Updated 3 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆81Updated last year
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆41Updated 8 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆63Updated 5 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Updated last year
- Qwen GRPO Graph Extraction RL Finetune☆57Updated 8 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆88Updated 10 months ago
- ☆54Updated 9 months ago
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆45Updated 10 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆29Updated last year
- ☆19Updated last year
- ☆19Updated 5 months ago