An automated pipeline for scraping, processing, and visualizing medical Q&A data to build high-quality datasets. Includes a comprehensive guide for fine-tuning Qwen-7B-Chat.
☆23Dec 24, 2024Updated last year
Alternatives and similar repositories for DataScraping-LLMs-FineTuning
Users that are interested in DataScraping-LLMs-FineTuning are comparing it to the libraries listed below
Sorting:
- 大模型API企业网关,公司内部API管理,分发聚和系统,支持将多种大模型转换成统一的OpenAI兼容接口,尤其对国内开源模型deepseek,qwen,kimi,glm提供特别支持 可供个人或者企业内部大模型API统一管理和渠道分发使用(key管理与二次分发),长期更新,支…☆37Sep 12, 2025Updated 5 months ago
- 基于大语言模型的RAG项目,分别实现了基于文本和知识图谱的RAG☆27Dec 11, 2025Updated 2 months ago
- 一个基于FastAPI和React的智能体系统,支持多智能体管理、mcp管理、知识库、聊天对话等功能。An intelligent agent system based on FastAPI and React, supporting multi-agent managem…☆21Jan 25, 2026Updated last month
- Y-Agent Studio 是一个面向 企业级应用 的Agent开发套,Y-Agent是其中的核心模块。 包含了:支持智能体编排、RAG、流程日志、单元测试、流程测试、语料生产等垂直领域非常需要的功能。 智能体编排可以在同一个流程中,同时支持多智能体协作和流程混合编排…☆25Oct 4, 2025Updated 5 months ago
- 基于Qwen2+SFT+DPO的医疗问答系统,项目中使用了自定义的 SFTTrainer/DPOTrainer/TRPOTrainer用于训练,其次,项目还调用各种知识库工具(neo4j, milvus, LDA, 等)进行自动化训练数据生成。另外,使用 vllm 用于推理…☆61Jan 4, 2026Updated 2 months ago
- NLP on Korean news articles. Automatic topic extraction through dynamic clustering.☆12Sep 15, 2017Updated 8 years ago
- 海思设备上部署阉割版yolov5☆13Nov 22, 2021Updated 4 years ago
- This is a tool that can make you run intel openVINO Demos and samples easily.☆11Jan 31, 2023Updated 3 years ago
- Part of a research scholarship. I built a basic 2d driving sim with simulated lidar data to train Deep Q Neural Network. So far after abo…☆11Feb 15, 2017Updated 9 years ago
- NetMax is a python library that provides the implementation of several algorithms for the problem of Influence Maximization in Social Net…☆14Sep 17, 2025Updated 5 months ago
- OpenHIS医院系统(信创版)集十大核心模块于一体,涵盖目录管理、基础数据配置、个性化设置、门诊/住院全流程管理、药房药库智能管控、精细化耗材管理、财务核算体系、医保合规对接及多维报表分析等功能模块,共计372项标准化功能。☆13Feb 5, 2026Updated last month
- Arabic Handwritten Characters Dataset☆13Jun 22, 2017Updated 8 years ago
- CLIP-based Adaptive Graph Attention Network for Large-Scale Unsupervised Multi-modal Hashing Retrieval☆10Mar 18, 2024Updated last year
- 基于modelscope(魔搭社区)阿里大模型的语音转文本工具☆10Feb 2, 2024Updated 2 years ago
- PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k☆11Mar 14, 2024Updated last year
- A Kivy tutorial for PyOhio 2013☆14Apr 30, 2014Updated 11 years ago
- ☆12Oct 25, 2020Updated 5 years ago
- A highly commented Tensorflow implementation of DCGAN and WGAN for images.☆10Dec 22, 2017Updated 8 years ago
- This is the official Pytorch implementation for "Boosting Semi-Supervised Face Recognition with Noise Robustness"☆11Jul 22, 2021Updated 4 years ago
- Official code release for paper "Improving Confidence Estimates for Unfamiliar Examples" https://arxiv.org/abs/1804.03166☆12Aug 16, 2020Updated 5 years ago
- 针对常见的BAT公司中的大数据面试和笔试问题,列出解决思路,并使用python来实现☆11Aug 17, 2015Updated 10 years ago
- Implementation of the Influence Maximization Benchmarker (IMB)☆14Aug 10, 2023Updated 2 years ago
- Code for the blog post on GAN stability☆10Oct 8, 2016Updated 9 years ago
- 基于检索增强生成(RAG)技术的ICD-10医疗诊断内容标准化工具,支持中文医学术语的智能匹配和标准化。☆18Aug 12, 2025Updated 6 months ago
- ☆13Mar 16, 2025Updated 11 months ago
- 使用Qwen3的Embedding和Reranker模型实现查找与精排☆20Jun 22, 2025Updated 8 months ago
- ☆10Feb 26, 2020Updated 6 years ago
- Face hashing using neural networks, mapping images to Hamming codes.☆10Dec 21, 2018Updated 7 years ago
- ☆10Sep 7, 2021Updated 4 years ago
- Simple rules based grapheme to phoneme in Python☆11Sep 2, 2017Updated 8 years ago
- scrapy、pyspider、appium、beautiful soup、selenium、uiautomator2等爬虫技术。漏洞信息、威胁情报、舆情分析、自媒体平台信息、电商平台商品信息等爬虫。☆10Oct 20, 2023Updated 2 years ago
- GLFW3 application☆14Jan 25, 2026Updated last month
- An educational game. It won the 3rd place in the Kivy App Contest 2014.☆16Nov 19, 2024Updated last year
- MandelBulb rendered as a Point Cloud for IOS, uses Swift and Metal☆13May 31, 2021Updated 4 years ago
- ☆11Oct 27, 2017Updated 8 years ago
- Create 3D point clouds from depth images captured with the lens blur feature of the Google Camera app for Android.☆19Apr 26, 2014Updated 11 years ago
- 该项目专注于识别智能对话场景中的用户文本,自动判断情绪类别并给出相应的准确度。可以广泛应用于社交媒体评论情感分析、智能客服情绪分析等场景,成为情感支持工具,帮助用户 从情绪中解脱。多次Prompt提升后,GPT模型最终识别准确率高于人类Baseline水准。☆10Jul 25, 2023Updated 2 years ago
- [AAAI 2024] PoseGen: Learning to Generate 3D Human Pose Datasets with NeRF☆10Dec 29, 2023Updated 2 years ago
- 该仓库主要描述了CCAC2023多模态对话情绪识别评测第3名的实现过程☆11Aug 11, 2024Updated last year