阿里通义千问(Qwen-7B-Chat/Qwen-7B), 微调/LORA/推理
☆142May 17, 2024Updated 2 years ago
Alternatives and similar repositories for Qwen-SFT
Users that are interested in Qwen-SFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 通义千问 SFT试验☆83Jan 6, 2024Updated 2 years ago
- chatglm3base模型的有监督微调SFT☆80Nov 5, 2023Updated 2 years ago
- 可以成功Lora微调的Qwen-VL模型☆16Oct 27, 2023Updated 2 years ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆40Jan 13, 2025Updated last year
- ☆14Apr 19, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The implementation of paper ''Efficient Attention Network: Accelerate Attention by Searching Where to Plug''.☆20Jun 16, 2023Updated 2 years ago
- Multi-hop Question Generation with Graph Convolutional Network☆30Nov 2, 2022Updated 3 years ago
- 基于internlm-chat-7b的保险知识大模型微调☆20Apr 26, 2024Updated 2 years ago
- HEtero-Assists Distillation for Heterogeneous Object Detectors☆10Jul 3, 2023Updated 2 years ago
- Qwen1.5-SFT(阿里, Ali), Qwen_Qwen1.5-2B-Chat/Qwen_Qwen1.5-7B-Chat微调(transformers)/LORA(peft)/推理☆73May 17, 2024Updated 2 years ago
- A Python implementation of the Sequential Thinking MCP server using the official Model Context Protocol (MCP) Python SDK. This server fac…☆24Jun 1, 2025Updated 11 months ago
- (ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning☆28Sep 27, 2024Updated last year
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆215May 17, 2024Updated 2 years ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Qwen-Efficient-Tuning☆44Aug 16, 2023Updated 2 years ago
- 基于LLM和LangChain实现基于本地文档的QA chatbot☆35Aug 13, 2023Updated 2 years ago
- 本课程主要介绍强化学习的基础知识,其目标是帮助同学们快速、顺利地进入强化学习及其应用领域的研究工作。课程主要内容包含有限马尔可夫决策过程,动态规划,无模型预测与控制(SASA,Q-Learning),价值函数逼近(DQN),策略梯度方法(REINFORCE),执行者/评论者…☆17Oct 17, 2022Updated 3 years ago
- [AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model☆14Dec 17, 2023Updated 2 years ago
- [ICML 2024] Official repository of ICML 2024 - RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language…☆11Apr 4, 2026Updated last month
- ☆14Dec 18, 2024Updated last year
- Official implementation of "Disentangled Knowledge Transfer for OOD Intent Discovery with Unified Contrastive Learning", ACL2022 main con…☆14Jul 23, 2022Updated 3 years ago
- A repo for LLM jailbreak☆14Sep 5, 2023Updated 2 years ago
- Learning problem-solving, logic/set, math, physics, economics through functional programming using Haskell☆19Oct 16, 2015Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The official Pytorch implementation of the paper Neural Compositional Rule Learning for Knowledge Graph Reasoning☆36Jul 7, 2023Updated 2 years ago
- A span-based joint named entity recognition (NER) and relation extraction model.☆11Aug 5, 2020Updated 5 years ago
- fastapi异步IO+threadpool线程池的工作原理☆18Feb 12, 2024Updated 2 years ago
- This is the source code of our paper PALT in EMNLP2022.☆13Nov 19, 2022Updated 3 years ago
- ☆13Feb 1, 2024Updated 2 years ago
- smplify code for point cloud based HMR☆10Jan 11, 2022Updated 4 years ago
- ☆13Sep 26, 2025Updated 8 months ago
- ☆14Feb 26, 2024Updated 2 years ago
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆45Dec 20, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL…☆14,218May 22, 2026Updated last week
- tts fronted-end☆11Dec 19, 2018Updated 7 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Nov 14, 2025Updated 6 months ago
- 构建一个基于大模型的智能客服系统,可提供静态知识问答(静态数据)、动态知识问答(数据库),业务办理(api调用)等功能,同时系统具有自我学习能力。定期的反思可让系统变得更强大。☆94Nov 5, 2025Updated 6 months ago
- 第一名克莱登大学二队方案分享☆18Mar 5, 2021Updated 5 years ago
- Speech detect of freeSwitch. With standard ASR interface of freeSwitch and send voice data via ESL.☆12Apr 8, 2018Updated 8 years ago
- Another ChatGLM2 implementation for GPTQ quantization☆55Oct 15, 2023Updated 2 years ago