阿里通义千问(Qwen-7B-Chat/Qwen-7B), 微调/LORA/推理
☆142May 17, 2024Updated 2 years ago
Alternatives and similar repositories for Qwen-SFT
Users that are interested in Qwen-SFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- chatglm3base模型的有监督微调SFT☆80Nov 5, 2023Updated 2 years ago
- 可以成功Lora微调的Qwen-VL模型☆16Oct 27, 2023Updated 2 years ago
- HMM\CRF\BERT-CRF\BILSTM-CRF\BERTBILSTMCRF\XLNETBILSTMCRF☆33Jul 30, 2022Updated 3 years ago
- 基于deepseek、qwen3大模型,lora sft 医疗行业数据☆15Apr 10, 2026Updated 2 months ago
- Multi-hop Question Generation with Graph Convolutional Network☆30Nov 2, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 基于internlm-chat-7b的保险知识大模型微调☆20Apr 26, 2024Updated 2 years ago
- Pretrain、Posttrain、RAG、Agent等大模型相关的基础项目合集☆40Dec 7, 2025Updated 6 months ago
- HEtero-Assists Distillation for Heterogeneous Object Detectors☆10Jul 3, 2023Updated 2 years ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆73May 17, 2024Updated 2 years ago
- A free program with a user-friendly interface that allows you to download Office 365, 2024, 2021, 2019, 2016 as well as Project and Visio☆36Sep 29, 2025Updated 8 months ago
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆11Feb 7, 2026Updated 4 months ago
- 2024金融行业大模型挑战赛-人生海海团队方案☆24May 31, 2025Updated last year
- A Python implementation of the Sequential Thinking MCP server using the official Model Context Protocol (MCP) Python SDK. This server fac…☆25Jun 1, 2025Updated last year
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆217May 17, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- 基于LLM和LangChain实现基于本地文档的QA chatbot☆35Aug 13, 2023Updated 2 years ago
- [AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model☆13Dec 17, 2023Updated 2 years ago
- 🚀 A curated collection of papers focusing on LLM-based quantitative trading.☆126May 25, 2026Updated 3 weeks ago
- Code for our paper titled "Lens: Rethinking Multilingual Enhancement for Large Language Models"☆12Oct 15, 2024Updated last year
- ☆14Dec 18, 2024Updated last year
- 利用简单的代码完成deepseek基于medical-o1-sft数据集的lora微调☆17Feb 25, 2025Updated last year
- A repo for LLM jailbreak☆14Sep 5, 2023Updated 2 years ago
- Learning problem-solving, logic/set, math, physics, economics through functional programming using Haskell☆19Oct 16, 2015Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆24Nov 1, 2025Updated 7 months ago
- simple decoder-only GTP model in pytorch☆44May 19, 2024Updated 2 years ago
- 基于鼠标键盘操作的微信自动聊天机器人☆13Nov 26, 2024Updated last year
- E-NER: Evidential Deep Learning for Trustworthy Named Entity Recognition☆25Jul 18, 2023Updated 2 years ago
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated last year
- This is the source code of our paper PALT in EMNLP2022.☆13Nov 19, 2022Updated 3 years ago
- Official Implementation of CAPEAM (ICCV'23)☆16Nov 30, 2024Updated last year
- fastapi异步IO+threadpool线程池的工作原理☆18Feb 12, 2024Updated 2 years ago
- ☆13Feb 1, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- smplify code for point cloud based HMR☆10Jan 11, 2022Updated 4 years ago
- ☆13Sep 26, 2025Updated 8 months ago
- ☆21Feb 16, 2025Updated last year
- ☆14Feb 26, 2024Updated 2 years ago
- code for EGSS☆12Dec 10, 2024Updated last year
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…☆14,496Updated this week
- 基于ByteX开发的服务发现框架☆14Feb 3, 2020Updated 6 years ago