通义千问 SFT试验
☆82Jan 6, 2024Updated 2 years ago
Alternatives and similar repositories for qwen-sft
Users that are interested in qwen-sft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RAG向量召回示例☆150Feb 14, 2024Updated 2 years ago
- qwen ai agent☆148Feb 21, 2024Updated 2 years ago
- A simple deep learning framework inspired by Dezero and PyTorch☆31Jan 27, 2025Updated last year
- code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis☆12Nov 17, 2024Updated last year
- LLM Tokenizer with BPE algorithm☆48May 7, 2024Updated last year
- ☆32Oct 22, 2023Updated 2 years ago
- 通义千问VLLM推理部署DEMO☆644Mar 28, 2024Updated last year
- ☆23Jan 16, 2024Updated 2 years ago
- ☆13Apr 10, 2025Updated 11 months ago
- [SIGIR'24] The official implementation code of MOELoRA.☆36Aug 3, 2024Updated last year
- Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.☆13Oct 2, 2019Updated 6 years ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆10Jul 27, 2024Updated last year
- ☆13Jan 16, 2025Updated last year
- ☆21Mar 2, 2023Updated 3 years ago
- This is official code for ASFL.☆21Mar 3, 2025Updated last year
- 针对口罩部分的人脸补全技术,为提高人脸补全算法的效果,基于现有的生成式对抗网络人脸图像补全算法做出改进,增加人脸结构信息的人脸关键点约束。☆10Oct 2, 2023Updated 2 years ago
- ☆10Dec 17, 2023Updated 2 years ago
- ☆19Feb 24, 2025Updated last year
- Graph QABot Demo| 图谱问答案例☆15Apr 11, 2023Updated 2 years ago
- Code for CVPR2018 "Iterative Learning with Open-set Noisy Labels"☆12Mar 12, 2021Updated 5 years ago
- qwen models finetuning☆107Mar 9, 2025Updated last year
- Federated Few-shot Learning for Mobile NLP. Conditionally accepted by MobiCom'23.☆16Aug 18, 2023Updated 2 years ago
- 微调阿里开源的文字检测模型,利用合合识别返回的OCR结果作为初始训练数据,对模型进行优化训练,使其更加适应1万张图片的具体场景,提高文字识别的精度。☆10Dec 9, 2024Updated last year
- ☆12Mar 6, 2023Updated 3 years ago
- MoE model with onnx runtime☆59May 5, 2024Updated last year
- 使用多轮对话数据集对deepseek进行lora微调教程☆60Dec 26, 2024Updated last year
- 中文纠错-使用拼音树及编辑距离☆13Jul 19, 2019Updated 6 years ago
- LLMs Learn Task Heuristics from Demonstrations: A Heuristic-Driven Prompting Strategy for Document-Level Event Argument Extraction (ACL 2…☆14Aug 12, 2024Updated last year
- Qwen2.5 0.5B GRPO☆81Feb 16, 2025Updated last year
- 使用Bert-BiLstm-CRF做中文命名实体识别,使用的数据集来自https://aistudio.baidu.com/aistudio/competition/detail/802/0/datasets☆18Mar 1, 2024Updated 2 years ago
- Pytorch DDP Traning Demo☆30Oct 20, 2024Updated last year
- A bot for Binance written in python☆10Jan 10, 2026Updated 2 months ago
- Beyond Known Clusters: Probe New Prototypes for Efficient Generalized Class Discovery☆16Apr 28, 2024Updated last year
- Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)☆32May 17, 2024Updated last year
- 生僻字OCR识别优化训练☆16Feb 16, 2023Updated 3 years ago
- albert-fc for RE(Relation Extraction),中文关系抽取☆19Apr 24, 2023Updated 2 years ago
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 6 months ago
- Team UWA's visualisation app developed as part of the ICDM 2019 Knowledge Graph Contest.☆13Dec 8, 2022Updated 3 years ago
- Source code for MICCAI 2023 paper entitled: 'FeSViBS: Federated Split Learning of Vision Transformer with Block Sampling'☆20Sep 25, 2023Updated 2 years ago