fuyuantan / qwen3-finetuneLinks
Finetune and Inference Qwen3-0.6B.
☆15Updated last month
Alternatives and similar repositories for qwen3-finetune
Users that are interested in qwen3-finetune are comparing it to the libraries listed below
Sorting:
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆20Updated last month
- MCP DeepResearch Server: 基于 LangGraph + Ollama + Tavily 的深度研究服务器,支持异步运行、超时控制与进度推送☆18Updated last week
- Code for Robust Fine-tuning (RbFT)☆12Updated 4 months ago
- A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using …☆18Updated 2 months ago
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆17Updated last year
- ☆16Updated 11 months ago
- ☆10Updated last month
- 中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.☆11Updated 10 months ago
- LLM RAG 应用,支持 API 调用,语音交互。☆11Updated last year
- 基于大模型生成内容的智能语音对讲☆10Updated 7 months ago
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆15Updated 4 months ago
- AgileGen: Empowering Agile-Based Generative Software Development through Human-AI Teamwork (accepted by ACM TOSEM)☆23Updated 7 months ago
- An open source implementation of R1☆26Updated this week
- 通义千问的DPO训练☆49Updated 9 months ago
- Self Supervised Prompt Optimization.☆12Updated last month
- 🚀 轻量视频🎥 大模型🤖☆16Updated 2 months ago
- deepseek思维树模式实现☆15Updated 4 months ago
- An open-source toolkit helping developers build natural language database query solutions☆16Updated last month
- ☆12Updated 3 months ago
- [ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement☆30Updated 3 weeks ago
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆31Updated 11 months ago
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation☆22Updated 3 weeks ago
- 🤖 基于AutoGen的AI辩论系统 | 🗣️ 支持中文交互 | 🔄 多智能体协作 | 📝 自动记录辩论过程 🤖 AI Debate System based on AutoGen | 🗣️ Chinese Interaction | 🔄 Multi-Age…☆14Updated 7 months ago
- the newest version of llama3,source code explained line by line using Chinese☆22Updated last year
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆27Updated 11 months ago
- The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…☆35Updated 2 months ago
- 超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of Dee…☆21Updated 2 months ago
- rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking☆38Updated 5 months ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆78Updated last month
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆34Updated 3 months ago