阿里通义千问(Qwen-7B-Chat/Qwen-7B), 微调/LORA/推理
☆140May 17, 2024Updated last year
Alternatives and similar repositories for Qwen-SFT
Users that are interested in Qwen-SFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 通义千问 SFT试验☆83Jan 6, 2024Updated 2 years ago
- Due to the huge vocaburary size (151,936) of Qwen models, the Embedding and LM Head weights are excessively heavy. Therefore, this projec…☆38Jan 6, 2026Updated 4 months ago
- chatglm3base模型的有监督微调SFT☆80Nov 5, 2023Updated 2 years ago
- 可以成功Lora微调的Qwen-VL模型☆16Oct 27, 2023Updated 2 years ago
- HMM\CRF\BERT-CRF\BILSTM-CRF\BERTBILSTMCRF\XLNETBILSTMCRF☆33Jul 30, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 针对qwen微调模型进行数据预处理☆13Jan 8, 2024Updated 2 years ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆40Jan 13, 2025Updated last year
- ☆14Apr 19, 2024Updated 2 years ago
- 基于deepseek、qwen3大模型,lora sft 医疗行业数据☆15Apr 10, 2026Updated 3 weeks ago
- WWW'24, Mirror Gradient (MG) makes multimodal recommendation models approach flat local minima easier compared to models with normal trai…☆17Nov 1, 2024Updated last year
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆73May 17, 2024Updated last year
- A free program with a user-friendly interface that allows you to download Office 365, 2024, 2021, 2019, 2016 as well as Project and Visio☆32Sep 29, 2025Updated 7 months ago
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated 3 months ago
- A freeswitch esl server for make a callcenter core,ex:ACD,IVR and so on......☆12Sep 26, 2016Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"☆10Jul 5, 2022Updated 3 years ago
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆216May 17, 2024Updated last year
- LibreOJ Problem download tool☆22Oct 6, 2024Updated last year
- Code for our paper titled "Lens: Rethinking Multilingual Enhancement for Large Language Models"☆11Oct 15, 2024Updated last year
- Learning problem-solving, logic/set, math, physics, economics through functional programming using Haskell☆19Oct 16, 2015Updated 10 years ago
- 利用简单的代码完成deepseek基于medical-o1-sft数据集的lora微调☆16Feb 25, 2025Updated last year
- The official Pytorch implementation of the paper Neural Compositional Rule Learning for Knowledge Graph Reasoning☆36Jul 7, 2023Updated 2 years ago
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- E-NER: Evidential Deep Learning for Trustworthy Named Entity Recognition☆25Jul 18, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- fastapi异步IO+threadpool线程池的工作原理☆18Feb 12, 2024Updated 2 years ago
- An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation☆27Jun 7, 2024Updated last year
- smplify code for point cloud based HMR☆10Jan 11, 2022Updated 4 years ago
- A container to deploy Drupal on Docker within an Alpine container. Contains PHP and Nginx.☆12Mar 24, 2020Updated 6 years ago
- ☆19Feb 16, 2025Updated last year
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…☆14,032Updated this week
- code for EGSS☆12Dec 10, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 5 months ago
- Using the Qwen-2.5 model for text classification (lora)☆24May 7, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 构建一个基于大模型的智能客服系统,可提供静态知识问答(静态数据)、动态知识问答(数据库),业务办理(api调用)等功能,同时系统具有自我学习能力。定期的反思可让系统变得更强大。☆92Nov 5, 2025Updated 6 months ago
- Verifiers for LLM Reinforcement Learning☆80Apr 15, 2025Updated last year
- Llama2-SFT, Llama-2-7B微调(transformers)/LORA(peft)/推理☆27Jul 26, 2023Updated 2 years ago
- A light-weight python version of moses BLEU.☆13Jan 24, 2019Updated 7 years ago
- Wasserstein-Fisher-Rao Embedding: Logical Query Embeddings with Local Comparison and Global Transport (Findings-ACL 2023)☆13May 4, 2023Updated 3 years ago
- ☆12Jun 14, 2022Updated 3 years ago
- Codes and data for KDD 2024 Research Track paper "ProCom: A Few-shot Targeted Community Detection Algorithm"☆11Aug 15, 2024Updated last year