An automated pipeline for scraping, processing, and visualizing medical Q&A data to build high-quality datasets. Includes a comprehensive guide for fine-tuning Qwen-7B-Chat.
☆23Dec 24, 2024Updated last year
Alternatives and similar repositories for DataScraping-LLMs-FineTuning
Users that are interested in DataScraping-LLMs-FineTuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 结合知识图谱做的有关诗词的问答demo☆11Mar 11, 2020Updated 6 years ago
- Implementation of the Influence Maximization Benchmarker (IMB)☆14Aug 10, 2023Updated 2 years ago
- 基于大语言模型的RAG项目,分别实现了基于文本和知识图谱的RAG☆29Dec 11, 2025Updated 3 months ago
- A tool to extract QQ chat history☆22Oct 15, 2024Updated last year
- Influence Maximization in Near-Linear Time: A Martingale Approach Scala implementation☆14Sep 3, 2018Updated 7 years ago
- 该仓库主要描述了CCAC2023多模态对话情绪识别评测第3名的实现过程☆12Aug 11, 2024Updated last year
- 基于Qwen2+SFT+DPO的医疗问答系统,项目中使用了自定义的 SFTTrainer/DPOTrainer/TRPOTrainer用于训练,其次,项目还调用各种知识库工具(neo4j, milvus, LDA, 等)进行自动化训练数据生成。另外,使用 vllm 用于推理…☆68Jan 4, 2026Updated 2 months ago
- python人脸识别和情绪识别☆17Oct 2, 2023Updated 2 years ago
- 2024年春节刘谦纸牌魔术 - 《守岁共此时》☆12Feb 25, 2024Updated 2 years ago
- 基于nodejs 的企业微信加解密库☆19May 29, 2024Updated last year
- NetMax is a python library that provides the implementation of several algorithms for the problem of Influence Maximization in Social Net…☆14Sep 17, 2025Updated 6 months ago
- SwornDisk是一个面向可信执行环境的、基于日志结构的安全块设备(全国大学生操作系统比赛2022)☆24Aug 14, 2022Updated 3 years ago
- ☆10Dec 8, 2022Updated 3 years ago
- 该项目专注于识别智能对话场景中的用户文本,自动判断情绪类别并给出相应的准确度。可以广泛应用于社交媒体评论情感分析、智能客服情绪分析等场景,成为情感支持工具,帮助用户 从情绪中解脱。多次Prompt提升后,GPT模型最终识别准确率高于人类Baseline水准。☆11Jul 25, 2023Updated 2 years ago
- 对qwen2.5进行微调以及知识蒸馏☆18Dec 24, 2024Updated last year
- 基于internlm-chat-7b的保险知识大模型微调☆20Apr 26, 2024Updated last year
- 新心数科,基于企业微信客户存留系统saas工具☆18Jan 6, 2023Updated 3 years ago
- ☆10Sep 7, 2021Updated 4 years ago
- This repository contains codes for Timeline model☆10Dec 18, 2018Updated 7 years ago
- ☆11Sep 6, 2019Updated 6 years ago
- ☆32Updated this week
- 大模型API企业网关,公司内部API管理,分发聚和系统,支持将多种大模型转换成统一的OpenAI兼容接口,尤其对国内开源模型deepseek,qwen,kimi,glm提供特别支持 可供个人或者企业内部大模型API统一管理和渠道分发使用(key管理与二次分发),长期更新,支…☆41Sep 12, 2025Updated 6 months ago
- Implementation of our paper "Exploiting Unsupervised Data for Emotion Recognition in Conversations" in the Findings of EMNLP-2020.☆13Nov 17, 2020Updated 5 years ago
- 哈尔滨工业大学854计算机基础硕士研究生入学考试资料收集☆16Apr 12, 2021Updated 4 years ago
- UniHPF : Universal Healthcare Predictive Framework with Zero Domain Knowledge☆13Nov 16, 2023Updated 2 years ago
- 使用ACE2005创建以事件和实体为节点的事件知识图谱,用于智能问答☆18Feb 29, 2020Updated 6 years ago
- 基于 Apache Airflow 的微信智能应用编排框架,通过可视化工作流驱动 AI 与数据自动化任务。支持 智能客服(多轮对话/知识库)、AI 图文/短视频生成、智能提醒等应用,灵活扩展多模态交互与大模型能力。☆80Mar 15, 2026Updated last week
- ☆68May 16, 2025Updated 10 months ago
- 以InternLM2-chat-7为基座模型,以常用中药等为数据集,微调的大模型。中医聊天小助手。☆16Feb 29, 2024Updated 2 years ago
- 基于React + FastAPI + LangChain + 通义千问的智能医疗问答系统,支持基于检索增强生成(RAG)的医疗知识问答。☆66Mar 27, 2025Updated 11 months ago
- 为bubbliiiing的yolo系列代码进行onnx部署(C++),目前已完美适配yolov4>>yolov5>>yolov5-6.1>>yolov7☆12Nov 16, 2022Updated 3 years ago
- ☆15Sep 18, 2021Updated 4 years ago
- 多模态情绪识别☆28Aug 11, 2023Updated 2 years ago
- 一个基于大模型微调的中文医疗问答机器人应用☆25Jan 11, 2024Updated 2 years ago
- 实验室服务器的使用指南☆22Feb 9, 2020Updated 6 years ago
- 基于qlora对baichuan-7B大模型进行指令微调。☆23Jun 22, 2023Updated 2 years ago
- [AAAI 2024] PoseGen: Learning to Generate 3D Human Pose Datasets with NeRF☆10Dec 29, 2023Updated 2 years ago
- DescEmb - Unifying Heterogenous Electronic Health Records Systems via Text-Based Code Embedding☆22Apr 29, 2025Updated 10 months ago
- 全国大学生数学建模竞赛LaTeX模板,拿来即用。代码美化、参考文献符合国标、文件结构分明☆35Oct 8, 2021Updated 4 years ago