通义千问 SFT试验
☆82Jan 6, 2024Updated 2 years ago
Alternatives and similar repositories for qwen-sft
Users that are interested in qwen-sft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RAG向量召回示例☆152Feb 14, 2024Updated 2 years ago
- qwen ai agent☆149Feb 21, 2024Updated 2 years ago
- A simple deep learning framework inspired by Dezero and PyTorch☆31Jan 27, 2025Updated last year
- 通义千问的DPO训练☆64Sep 21, 2024Updated last year
- LLM Tokenizer with BPE algorithm☆48May 7, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆32Oct 22, 2023Updated 2 years ago
- Multimodal Neurons in Artificial Neural Networks☆16Oct 18, 2021Updated 4 years ago
- Independent Multi-Modal Segmentation☆12Jun 12, 2025Updated 10 months ago
- 通义千问VLLM推理部署DEMO☆646Mar 28, 2024Updated 2 years ago
- ☆23Jan 16, 2024Updated 2 years ago
- pytorch复现transformer☆92Jan 18, 2024Updated 2 years ago
- ☆11May 28, 2024Updated last year
- ☆13Apr 10, 2025Updated last year
- [SIGIR'24] The official implementation code of MOELoRA.☆37Aug 3, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- fastapi异步IO+threadpool线程池的工作原理☆18Feb 12, 2024Updated 2 years ago
- Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.☆13Oct 2, 2019Updated 6 years ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆11Jul 27, 2024Updated last year
- ☆21Mar 2, 2023Updated 3 years ago
- (NBCE)Naive Bayes-based Context Extension on ChatGLM-6b☆15Jun 7, 2023Updated 2 years ago
- This is official code for ASFL.☆22Mar 3, 2025Updated last year
- The pytorch implementation of relational extraction models with PCNN feature extractor and multi-instance learning☆16Mar 8, 2018Updated 8 years ago
- 毕业设计开源代码 分别实现了抽取式中文文本摘要和生成式中文文本摘要☆18Feb 23, 2026Updated last month
- (包含完整代码和坑点记录)Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch☆31Jan 22, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- qwen models finetuning☆107Mar 9, 2025Updated last year
- Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images☆56Nov 4, 2025Updated 5 months ago
- MoE model with onnx runtime☆60May 5, 2024Updated last year
- ☆13Oct 20, 2020Updated 5 years ago
- 使用Qwen1.5-0.5B-Chat模型进行通用信息抽取任务的微调,旨在: 验证生成式方法相较于抽取式NER的效果; 为新手提供简易的模型微调流程,尽量减少代码量; 大模型训练的数据格式处理。☆15Sep 6, 2024Updated last year
- ☆18Jan 31, 2023Updated 3 years ago
- ☆15Apr 13, 2023Updated 2 years ago
- 使用多轮对话数据集对deepseek进行lora微调教程☆60Dec 26, 2024Updated last year
- 本项目带大家搭建一个完整的AI应用全栈项目,Flask+Vue.js+CrewAI☆25Oct 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 中文纠错-使用拼音树及编辑距离☆13Jul 19, 2019Updated 6 years ago
- codes for the paper "Super-Resolution for Hyperspectral and Multispectral Image Fusion Accounting for Seasonal Spectral Variability", IEE…☆12Mar 13, 2020Updated 6 years ago
- Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)☆32May 17, 2024Updated last year
- AutoML4ETC, a tool to automatically design efficient and high-performing neural architectures for encrypted traffic classification.☆12Feb 19, 2024Updated 2 years ago
- Dataset2024☆12Jun 12, 2025Updated 10 months ago
- 按照会话解包, 然后提取明文txt信息, 让ChatGPT来判断一下是否存在攻击行为☆14Mar 7, 2023Updated 3 years ago
- ☆20Jun 16, 2025Updated 9 months ago