A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using the response. A RAG system is build upon the tuned qwen2, using Prompt-Template + Tool-Use + Chroma embedding database + LangChain
☆69Apr 16, 2026Updated this week
Alternatives and similar repositories for Travel-Agent-based-on-Qwen2-RLHF
Users that are interested in Travel-Agent-based-on-Qwen2-RLHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于Qwen2+SFT+DPO的医疗问答系统,项目中使 用了自定义的 SFTTrainer/DPOTrainer/TRPOTrainer用于训练,其次,项目还调用各种知识库工具(neo4j, milvus, LDA, 等)进行自动化训练数据生成。另外,使用 vllm 用于推理…☆75Updated this week
- 在千问最新的多模态image-text模型Qwen3-VL-4B-Instruct 进行多种lora微调对比效果,通过langchain+RAG+多智能体(Multi-Agent)进行部署☆42Dec 14, 2025Updated 4 months ago
- 这是一份集成了RAG和微调以及思维链的LLM应用!最近也结合了知识图谱以及智能体agent~后续还会有很多更新!☆18Oct 12, 2024Updated last year
- 一个低成本、易于上手的多模态大模型学习项目。基于Qwen3-0.6B和CLIP构建,使用LLaVA架构和LoRA微调,在消费级16G显卡上数小时即可完成训练☆48Sep 15, 2025Updated 7 months ago
- 基于qwen3的医疗大模型研发全流程 0.分词训练 1.增量预训练 2.微调 3.强化 4.量化 5.蒸馏 6.评估 7.lora模型合并 8.服务 9.部署☆41Jan 3, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆15May 24, 2024Updated last year
- ☆16Apr 8, 2025Updated last year
- PyTorch implementation of the paper "NestE: Modeling Nested Relational Structures for Knowledge Graph Reasoning" (AAAI'24)☆14Jul 5, 2024Updated last year
- Generate personalized travel itineraries based on user preferences.☆56Aug 11, 2024Updated last year
- 本项目对Deepseek-R1-Distill-Qwen-7B进行心理咨询CoT数据的LoRA微调,以进一步提升Deepseek-R1-Distill-Qwen-7B在心理咨询领域的慢思考能力。☆12Mar 11, 2025Updated last year
- Diffusion-based Negative Sampling on Graphs for Link Prediction☆14Feb 13, 2024Updated 2 years ago
- 音乐类语料的意图识别填槽以及槽值纠错模型☆18Mar 24, 2023Updated 3 years ago
- ☆18Nov 11, 2022Updated 3 years ago
- Colab notebook for fine-tuning Qwen2-Audio with trl's SFT and PPO trainers.☆24Nov 23, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine☆107May 22, 2024Updated last year
- ACM Transactions on Information Systems (TOIS), the code and datasets for CKML.☆13Aug 31, 2023Updated 2 years ago
- [ACM TOIS] Multi-Behavior Recommendation with Personalized Directed Acyclic Behavior Graphs☆14Dec 6, 2024Updated last year
- 2024CCF国际AIOps挑战赛-赛道二(GLM4):基于检索增强的运维知识问答挑战赛解决方案分享。☆14Jul 5, 2024Updated last year
- ☆15Aug 26, 2024Updated last year
- 对深度学习中的NLP进行解释和代码使用☆63Jan 5, 2024Updated 2 years ago
- MultiModal Rag with Colpali, Milvus and VLM☆15Dec 22, 2024Updated last year
- Retrieval-Augmented Generation System for Cardiovascular Disease Consultation☆17Dec 31, 2024Updated last year
- CodeReadingNote pro supports jetbrains22.1.4+, code remark, custom tags, tags grouping topic, ongoing maintenance☆12Apr 12, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 基于多图神经网络的领域知识和语法结构融合的中文医疗问询意图识别方法☆17Nov 28, 2022Updated 3 years ago
- sharedownload☆12Aug 14, 2020Updated 5 years ago
- Generate bpftrace eBPF programs online with GPT or LLM☆22Aug 7, 2024Updated last year
- 基于ReAct构建的电商智能客服代理☆52Sep 19, 2024Updated last year
- ☆21Feb 6, 2024Updated 2 years ago
- ☆10Aug 14, 2019Updated 6 years ago
- Source Data of ACL2021 paper "Syntax-Enhanced Pre-trained Model"☆11Jun 1, 2021Updated 4 years ago
- “达观杯”长文本智能处理挑战赛。达观数据提供了一批长文本数据和分类信息,希望选手动用自己的智慧,结合当下最先进的NLP和人工智能技术,深入分析文本内在结构和语义信息,构建文本分类模型,实现精准分类。☆10Jul 20, 2018Updated 7 years ago
- TensorFlow: learn and practice☆11Aug 30, 2018Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 本仓库旨在记录和分享我在 LLM 和 Agent 领域的学习历程,并通过实践项目深入理解相关技术。通过从零开始构建基于 LLM 和 Agent 的应用,学习LLM原理和Agent开发经验。☆25Mar 28, 2025Updated last year
- Archive of Miniflux v1☆16Oct 3, 2020Updated 5 years ago
- 基于深度学习的药品评论情感分析系统,可以自动分析药品评论的情感倾向(积极、中性、消极)。本项目采用 LSTM + BERT 词向量的混合架构,并提供了友好的 Web 界面。☆14Dec 24, 2024Updated last year
- A large-scale node-classification graph benchmark that brings together both the heterophily and heterogeneity properties of real-world gr…☆38Aug 4, 2025Updated 8 months ago
- Scraped reviews from OpenRice for sentiment analysis. Formatted to use with BERT.☆11Apr 9, 2020Updated 6 years ago
- 本项目存放RankMixer复现相关代码☆51Apr 3, 2026Updated 2 weeks ago
- ☆12Jun 19, 2018Updated 7 years ago