A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using the response. A RAG system is build upon the tuned qwen2, using Prompt-Template + Tool-Use + Chroma embedding database + LangChain
☆78Apr 16, 2026Updated last month
Alternatives and similar repositories for Travel-Agent-based-on-Qwen2-RLHF
Users that are interested in Travel-Agent-based-on-Qwen2-RLHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于Qwen2+SFT+DPO的医疗问答系统,项目中使用了自定义的 SFTTrainer/DPOTrainer/TRPOTrainer用于训练,其次,项目还调用各种知识库工具(neo4j, milvus, LDA, 等)进行自动化训练数据生成。另外,使用 vllm 用于推理…☆88Apr 29, 2026Updated last month
- 这是一份集成了RAG和微调以及思维链的LLM应用!最近也结合了知识图谱以及智能体agent~后续还会有很多更新!☆18Oct 12, 2024Updated last year
- ☆15May 24, 2024Updated 2 years ago
- ☆17Apr 8, 2025Updated last year
- 本项目对Deepseek-R1-Distill-Qwen-7B进行心理咨询CoT数据的LoRA微调,以进一步提升Deepseek-R1-Distill-Qwen-7B在心理咨询领域的慢思考能力。☆12Mar 11, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Diffusion-based Negative Sampling on Graphs for Link Prediction☆14Feb 13, 2024Updated 2 years ago
- 音乐类语料的意图识别填槽以及槽值纠错模型☆18Mar 24, 2023Updated 3 years ago
- ☆18Nov 11, 2022Updated 3 years ago
- ACM Transactions on Information Systems (TOIS), the code and datasets for CKML.☆13Aug 31, 2023Updated 2 years ago
- KG☆14Nov 26, 2022Updated 3 years ago
- ☆15Aug 26, 2024Updated last year
- 对深度学习中的NLP进行解释和代码使用☆64Jan 5, 2024Updated 2 years ago
- MultiModal Rag with Colpali, Milvus and VLM☆15Dec 22, 2024Updated last year
- Retrieval-Augmented Generation System for Cardiovascular Disease Consultation☆17Dec 31, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICLR'26] EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization☆30Aug 5, 2025Updated 9 months ago
- 基于多图神经网络的领域知识和语法结构融合的中文医疗问询意图识别方法☆17Nov 28, 2022Updated 3 years ago
- sharedownload☆12Aug 14, 2020Updated 5 years ago
- Using GDPR to do GDPR Contract Review☆11Mar 14, 2025Updated last year
- ☆21Feb 6, 2024Updated 2 years ago
- Source Data of ACL2021 paper "Syntax-Enhanced Pre-trained Model"☆11Jun 1, 2021Updated 5 years ago
- TensorFlow: learn and practice☆11Aug 30, 2018Updated 7 years ago
- The official code and model for ACL 2023 paper 'mCLIP: Multilingual CLIP via Cross-lingual Transfer'☆10Jan 23, 2024Updated 2 years ago
- 本仓库旨在记录和分享我在 LLM 和 Agent 领域的学习历程,并通过实践项目深入理解相关技术。通过从零开始构建基于 LLM 和 Agent 的应用,学习LLM原理和Agent开发经验。☆27Mar 28, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 多任务学习MMOE和PLE☆40Sep 8, 2021Updated 4 years ago
- 基于深度学习的药品评论情感分析系统,可以自动分析药品评论的情感倾向(积极、中性、消极)。本项目采用 LSTM + BERT 词向量的混合架构,并提供了友好的 Web 界面。☆13Dec 24, 2024Updated last year
- A large-scale node-classification graph benchmark that brings together both the heterophily and heterogeneity properties of real-world gr…☆40Aug 4, 2025Updated 9 months ago
- ☆12Jun 19, 2018Updated 7 years ago
- ☆14Apr 19, 2022Updated 4 years ago
- ☆24Jun 13, 2023Updated 2 years ago
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆27Oct 13, 2025Updated 7 months ago
- ☆15Feb 26, 2024Updated 2 years ago
- Qwen GRPO Graph Extraction RL Finetune☆69Apr 2, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆26Mar 29, 2025Updated last year
- Official code for "Automated Scoring for Reading Comprehension via In-context BERT Tuning" (AIED 2022)☆13May 23, 2022Updated 4 years ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆70Jul 24, 2024Updated last year
- 《图解设计模式》结城浩著 官方源码☆13Nov 28, 2018Updated 7 years ago
- 基于Qwen2.5模型、使用DISC-Law-SFT-Pair数据集微调的法律大模型☆12Dec 29, 2024Updated last year
- Codes for paper "Contextual Information and Commonsense Based Prompt for Emotion Recognition in Conversation" published in ECML-PKDD 2022…☆17Jul 6, 2022Updated 3 years ago
- [Paper][ICDE2023] Relational Message Passing for Fully Inductive Knowledge Graph Completion☆26Sep 30, 2022Updated 3 years ago