yongzhuo / Qwen-SFTView external linksLinks
阿里通义千问(Qwen-7B-Chat/Qwen-7B), 微调/LORA/推理
☆138May 17, 2024Updated last year
Alternatives and similar repositories for Qwen-SFT
Users that are interested in Qwen-SFT are comparing it to the libraries listed below
Sorting:
- ☆20Dec 27, 2025Updated last month
- The official implementation of paper "Drop-Activation: Implicit Parameter Reduction and Harmonious Regularization".☆10May 30, 2019Updated 6 years ago
- 纯c++的全平台llm加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆12Jan 30, 2026Updated 2 weeks ago
- This plugin provides tools to extract text from a document using the Azure AI Document Intelligence service.☆12Jan 17, 2025Updated last year
- WWW'24, Mirror Gradient (MG) makes multimodal recommendation models approach flat local minima easier compared to models with normal trai…☆17Nov 1, 2024Updated last year
- 大模型微调工具集合☆26Mar 15, 2024Updated last year
- A Python implementation of the Sequential Thinking MCP server using the official Model Context Protocol (MCP) Python SDK. This server fac…☆24Jun 1, 2025Updated 8 months ago
- 集成Qwen与DeepSeek等先进大语言模型,支持纯LLM+分类层模式及LLM+LoRA+分类层模式,使用transformers模块化设计和训练便于根据需要调整或替换组件。☆19Sep 1, 2025Updated 5 months ago
- 基于电商数据微调的Qwen2.5系列的电商大模型,电商数据sft后电商大模型。是https://github.com/leeguandong/EcommerceLLM的升级版本。qwen2.5的效果很好。☆13Oct 4, 2024Updated last year
- 微调阿里开源的文字检测模型,利用合合识别返回的OCR结果作为初始训 练数据,对模型进行优化训练,使其更加适应1万张图片的具体场景,提高文字识别的精度。☆10Dec 9, 2024Updated last year
- 针对qwen微调模型进行数据预处理☆13Jan 8, 2024Updated 2 years ago
- Mojuan: Write your own AI application.☆15Jul 12, 2024Updated last year
- 基于internlm-chat-7b的保险知识大模型微调☆20Apr 26, 2024Updated last year
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆73May 17, 2024Updated last year
- 可以成功Lora微调的Qwen-VL模型☆16Oct 27, 2023Updated 2 years ago
- 大模型学习--从模型部署到模型微调,此项目是经过训练营学习后,结合训练营项目,自我理解消化总结,以及创新型应用。可star/fork☆21Mar 26, 2024Updated last year
- ☆11Updated this week
- simple decoder-only GTP model in pytorch☆43May 19, 2024Updated last year
- ☆19Oct 9, 2024Updated last year
- Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and …☆18Apr 12, 2024Updated last year
- ☆20Oct 7, 2025Updated 4 months ago
- ModelScope+Transformers+SwanLab实现Qwen-1.5-7b的指令微调任务☆23Jun 9, 2024Updated last year
- Regularization Matters in Policy Optimization☆21Nov 1, 2021Updated 4 years ago
- 3D simulation in Jupyter☆22Sep 17, 2025Updated 5 months ago
- Synthetic data generation for TODs☆23Jul 17, 2024Updated last year
- Learning problem-solving, logic/set, math, physics, economics through functional programming using Haskell☆19Oct 16, 2015Updated 10 years ago
- Fine-Tuning LLM and embedding models☆27Sep 12, 2023Updated 2 years ago
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆32May 30, 2025Updated 8 months ago
- vanna.ai demo☆31May 1, 2024Updated last year
- When can you tell whether an image has been cropped or not?☆29Sep 19, 2021Updated 4 years ago
- A simple WeChat Official Account layout tool based on Dify☆16Jun 27, 2025Updated 7 months ago
- Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API,ChatGPT国内可用免费转发API,直连无需代理。☆13Aug 28, 2024Updated last year
- The official implementation of two AI-enhanced numerical solvers: NeurVec (Sci. Rep.) and AttNS (ICML'24)☆27May 21, 2024Updated last year
- ppo算法实现☆39Jun 5, 2024Updated last year
- [WIP] Better (FP8) attention for Hopper☆32Feb 24, 2025Updated 11 months ago
- The final project of Advance Machine Learning course in Tsinghua University. This project aims to make a color transfer of animes charact…☆31Dec 21, 2020Updated 5 years ago
- A Model Context Protocol server for Dify☆41Feb 6, 2025Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 3 months ago
- A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…☆25Jan 6, 2026Updated last month