Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。
☆12Jun 29, 2024Updated last year
Alternatives and similar repositories for Qwen-fine-tune
Users that are interested in Qwen-fine-tune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"☆32Jun 25, 2025Updated 9 months ago
- 机器学习实战☆12Apr 17, 2019Updated 7 years ago
- Using the Qwen-2.5 model for text classification (lora)☆24May 7, 2025Updated 11 months ago
- this is based on the paper Chain-of-Retrieval Augmented Generation☆14Mar 29, 2025Updated last year
- Dataset and codes for our paper "New Benchmark Dataset and Fine-Grained Cross-Modal Fusion Framework for Vietnamese Multimodal Aspect-Cat…☆14Dec 14, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Apr 10, 2024Updated 2 years ago
- 收集整理大模型面试题☆12Aug 29, 2024Updated last year
- ☆17Apr 7, 2024Updated 2 years ago
- ☆13Jul 12, 2022Updated 3 years ago
- Llama2-SFT, Llama-2-7B微调(transformers)/LORA(peft)/推理☆27Jul 26, 2023Updated 2 years ago
- We implement an efficient mechanism for compressing large networks by {\em tensorizing\/} network layers: i.e. mapping layers on to high-…☆11Jul 10, 2018Updated 7 years ago
- SuperAnnotate HTTP service for Generated Text Detection☆16Dec 17, 2024Updated last year
- Native AI 是一个探索本地生活电商领域的多智能体系统,通过 AI 助手一站式解决用户吃喝玩乐住行等日常生活需求。系统基于大语言模型技术,主要为了探索Multi Agent的应用。☆12Apr 13, 2025Updated last year
- Simple code for the tutorial on Polynomial Nets.☆13Jan 19, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- accelerate generating vector by using onnx model☆18Jan 23, 2024Updated 2 years ago
- Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)☆31May 17, 2024Updated last year
- 在NLP领域中一些任务的Demo☆13Sep 11, 2023Updated 2 years ago
- Code for WisdoM: Improving Multimodal Sentiment Analysis by Fusing Contextual World Knowledge☆17Dec 31, 2024Updated last year
- 对 Java 语言的学习☆13Aug 22, 2018Updated 7 years ago
- GraphRAG 中文文档。GraphRAG是一种结构化的、分层的检索增强生成(RAG)方法,而不是使用纯文本片段的语义搜索方法。GraphRAG 过程包括从原始文本中提取出知识图谱,构建社群层级(这种结构通常用来描述个体、群体及它们之间的关系,帮助理解信息如何在社群内部传…☆19Jul 12, 2024Updated last year
- An AI project to provide `private` chat and RAG service. 一个提供私有化检索增强生成的AI项目☆11Jul 14, 2024Updated last year
- Attempt on a Kaggle competition, Personalized Web Search Challenge, hosted by Yandex (http://www.kaggle.com/c/yandex-personalized-web-sea…☆11Jan 3, 2014Updated 12 years ago
- tensorflow2.0 实现的 DCN (Deep & Cross Network) ,使用 Criteo 子数据集加以实践。☆15Aug 1, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- reproduce SimCSE in jupyter-notebook☆10Nov 28, 2021Updated 4 years ago
- Code release for "Learning from Missing Relations: Contrastive Learning with Commonsense Knowledge Graphs for Commonsense Inference"☆10Jun 25, 2022Updated 3 years ago
- MNIST experiment from Tensorizing neural networks (Novikov et al. 2015)☆14Oct 22, 2019Updated 6 years ago
- 使用Simhash对海量文本进行去重☆12Jun 2, 2018Updated 7 years ago
- [WWW '24] UnifiedSSR: A Unified Framework of Sequential Search and Recommendation☆12Feb 16, 2024Updated 2 years ago
- LlaMA3-SFT, Meta-Llama-3-8B/Meta-Llama-3-8B-Instruct微调(transformers)/LORA(peft)/推理, 支持中文(chinese, zh)☆34May 17, 2024Updated last year
- KDD淘宝长尾推荐见https://tianchi.aliyun.com/competition/entrance/231785/information☆11Jul 2, 2020Updated 5 years ago
- 📰 Named entitity recognition (NER) and Entity linking (EL) on the dataset of Patents☆16Jun 5, 2022Updated 3 years ago
- Code & data for IJCAI'22 paper "RecipeRec: A Heterogeneous Graph Learning Model for Recipe Recommendation".☆16Jul 24, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 通过lora对deepseek小模型进行微调☆22Nov 15, 2024Updated last year
- LLM powered AI multi agent platform that coordinate global to individual health through scaling each layer of healthcare☆26May 8, 2024Updated last year
- My solution for #12 in privat leaderboard. Score=0.0260809843625832☆11Sep 6, 2021Updated 4 years ago
- Experiment results using FM, FFM and DeepFM algorithms in Criteo Display Advertising Challenge(https://www.kaggle.com/c/criteo-display-ad…☆13Apr 15, 2020Updated 6 years ago
- Utilizing graphical neural networks and embeddings on a medical database KEGG to perform link predictions and drug similarity systems.☆17Oct 2, 2021Updated 4 years ago
- 使用graphsage 进行连边预测的实验☆13Mar 29, 2019Updated 7 years ago
- ☆16Jun 3, 2025Updated 10 months ago