阿里通义千问(Qwen-7B-Chat/Qwen-7B), 微调/LORA/推理
☆140May 17, 2024Updated last year
Alternatives and similar repositories for Qwen-SFT
Users that are interested in Qwen-SFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- chatglm3base模型的有监督微调SFT☆79Nov 5, 2023Updated 2 years ago
- 可以成功Lora微调的Qwen-VL模型☆16Oct 27, 2023Updated 2 years ago
- A script for merging a LLM model and a LoRA☆13Jun 22, 2023Updated 2 years ago
- 针对qwen微调模型进行数据预处理☆13Jan 8, 2024Updated 2 years ago
- ☆14Apr 19, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 基于deepseek、qwen3大模型,lora sft 医疗行业数据☆15Updated this week
- 基于internlm-chat-7b的保险知识大模型微调☆20Apr 26, 2024Updated last year
- WWW'24, Mirror Gradient (MG) makes multimodal recommendation models approach flat local minima easier compared to models with normal trai…☆17Nov 1, 2024Updated last year
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆73May 17, 2024Updated last year
- LibreOJ Problem download tool☆21Oct 6, 2024Updated last year
- A free program with a user-friendly interface that allows you to download Office 365, 2024, 2021, 2019, 2016 as well as Project and Visio☆30Sep 29, 2025Updated 6 months ago
- ☆22May 7, 2025Updated 11 months ago
- A Python implementation of the Sequential Thinking MCP server using the official Model Context Protocol (MCP) Python SDK. This server fac…☆24Jun 1, 2025Updated 10 months ago
- The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"☆10Jul 5, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆217May 17, 2024Updated last year
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- Qwen-Efficient-Tuning☆44Aug 16, 2023Updated 2 years ago
- [AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model☆14Dec 17, 2023Updated 2 years ago
- [ICML 2024] Official repository of ICML 2024 - RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language…☆11Apr 4, 2026Updated last week
- ☆14Dec 18, 2024Updated last year
- Learning problem-solving, logic/set, math, physics, economics through functional programming using Haskell☆19Oct 16, 2015Updated 10 years ago
- 利用简单的代码完成deepseek基于medical-o1-sft数据集的lora微调☆16Feb 25, 2025Updated last year
- FreeSwitch PHP API and Angular JS Mod_Callcenter Panel☆17May 17, 2014Updated 11 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆24Nov 1, 2025Updated 5 months ago
- Graph QABot Demo| 图谱问答案例☆15Apr 11, 2023Updated 3 years ago
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- 谷歌插件:标签整理器☆11Apr 2, 2024Updated 2 years ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆27Jun 7, 2024Updated last year
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆17Mar 26, 2025Updated last year
- fastapi异步IO+threadpool线程池的工作原理☆18Feb 12, 2024Updated 2 years ago
- This is the source code of our paper PALT in EMNLP2022.☆13Nov 19, 2022Updated 3 years ago
- ☆13Feb 1, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 爬取京东商品评论数据☆17Jul 2, 2025Updated 9 months ago
- ☆13Sep 26, 2025Updated 6 months ago
- ☆17Feb 16, 2025Updated last year
- ☆14Feb 26, 2024Updated 2 years ago
- code for EGSS☆12Dec 10, 2024Updated last year
- ☆10Mar 6, 2024Updated 2 years ago
- Using the Qwen-2.5 model for text classification (lora)☆24May 7, 2025Updated 11 months ago