hellangleZ / Qwen3_autothink_adapterLinks
Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The inference framework can be sglang, or it can be adapted/modified to use vLLM
☆22Updated 4 months ago
Alternatives and similar repositories for Qwen3_autothink_adapter
Users that are interested in Qwen3_autothink_adapter are comparing it to the libraries listed below
Sorting:
- support BM25+vecetor☆29Updated 3 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 7 months ago
- Auto Thinking Mode switch for Qwen3 in Open webui☆67Updated 4 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆44Updated 7 months ago
- Evaluation for AI apps and agent☆43Updated last year
- DeepSearch Code-Actions Agent (DSCA). Build 🙌 with 🤗 smolagents☆113Updated last month
- Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth☆171Updated last week
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆56Updated 2 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆80Updated last year
- ☆93Updated last month
- AgileGen: Empowering Agile-Based Generative Software Development through Human-AI Teamwork (accepted by ACM TOSEM)☆23Updated 10 months ago
- 大模型智能体Agent中文教程,博客代码仓库☆35Updated last week
- Countdown Game Distill&RL☆47Updated last week
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆145Updated 3 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆85Updated 7 months ago
- Using APPL to reimplement popular algorithms for Large Language Models (LLMs) and prompts☆45Updated 8 months ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆38Updated 5 months ago
- ☆94Updated 9 months ago
- Submodular optimization for context engineering: query fan-out, text selection, passage reranking☆70Updated 2 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆28Updated 11 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Updated last year
- Deep Reasoning Translation (DRT) Project☆230Updated last week
- Qwen GRPO Graph Extraction RL Finetune☆55Updated 5 months ago
- diagnosis_zero, R1 Zero reproduce on disease diagnosis☆31Updated last month
- ☆16Updated last month
- Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆53Updated 3 months ago
- agentcp是一个基于ACP协议的Agent sdk,用于解决Agent间的身份认证及通信问题;用于创建AID、连接入网、构建会话,收发消息等;支持多Agent协作,异步消息处理,支持内网穿透,支持Agent访问的负载均衡☆13Updated 2 months ago
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆112Updated 7 months ago
- Open replication of DeepSeek R1 for text-to-graph extraction.☆98Updated 7 months ago