fuyuantan / qwen3-finetuneLinks
Finetune and Inference Qwen3-0.6B.
☆14Updated last month
Alternatives and similar repositories for qwen3-finetune
Users that are interested in qwen3-finetune are comparing it to the libraries listed below
Sorting:
- 🤖 基于AutoGen的AI辩论系统 | 🗣️ 支持中文交互 | 🔄 多智能体协作 | 📝 自动记录辩论过程 🤖 AI Debate System based on AutoGen | 🗣️ Chinese Interaction | 🔄 Multi-Age…☆13Updated 7 months ago
- LLM RAG 应用,支持 API 调用,语音交互。☆11Updated 11 months ago
- The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…☆33Updated last month
- An open-source toolkit helping developers build natural language database query solutions☆14Updated last month
- 基于大模型生成内容的智能语音对讲☆10Updated 6 months ago
- 🚀 轻量视频🎥 大模型🤖☆14Updated last month
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆59Updated last week
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆17Updated last year
- 超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of Dee…☆19Updated 2 months ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆20Updated 3 weeks ago
- 通义千问的DPO训练☆48Updated 8 months ago
- 集中管理所有的prompt。☆14Updated 6 months ago
- Self Supervised Prompt Optimization.☆12Updated 3 weeks ago
- Code for Robust Fine-tuning (RbFT)☆12Updated 4 months ago
- PodAgent: A Comprehensive Framework for Podcast Generation☆87Updated 3 weeks ago
- ☆10Updated 2 weeks ago
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆11Updated 3 weeks ago
- ☆17Updated 3 months ago
- 基于GLM4-Chat实现本地知识库查询以及Agent☆7Updated 11 months ago
- ☆12Updated 2 months ago
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆27Updated 11 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 3 months ago
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆32Updated 2 months ago
- Parameter-Efficient Fine-Tuning for Foundation Models☆65Updated 2 months ago
- A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using …☆15Updated last month
- ☆41Updated 2 months ago
- (ICLR 2025) AgentRefine: Enhancing Agent Generalization through Refinement Tuning☆13Updated 3 months ago
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory☆95Updated last month
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆76Updated 2 weeks ago