A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using the response. A RAG system is build upon the tuned qwen2, using Prompt-Template + Tool-Use + Chroma embedding database + LangChain
☆63Nov 14, 2025Updated 4 months ago
Alternatives and similar repositories for Travel-Agent-based-on-Qwen2-RLHF
Users that are interested in Travel-Agent-based-on-Qwen2-RLHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于Qwen2+SFT+DPO的医疗问答系统,项目中 使用了自定义的 SFTTrainer/DPOTrainer/TRPOTrainer用于训练,其次,项目还调用各种知识库工具(neo4j, milvus, LDA, 等)进行自动化训练数据生成。另外,使用 vllm 用于推理…☆69Jan 4, 2026Updated 2 months ago
- 在千问最新的多模态image-text模型Qwen3-VL-4B-Instruct 进行多种lora微调对比效果,通过langchain+RAG+多智能体(Multi-Agent)进行部署☆34Dec 14, 2025Updated 3 months ago
- 这是一份集成了RAG和微调以及思维链的LLM应用!最近也结合了知识图谱以及智能体agent~后续还会有很多更新!☆18Oct 12, 2024Updated last year
- 基于qwen3的医疗大模型研发全流程 0.分词训练 1.增量预训练 2.微调 3.强化 4.量化 5.蒸馏 6.评估 7.lora模型合并 8.服务 9.部署☆37Jan 3, 2026Updated 2 months ago
- FineMotion: A Dataset and Benchmark with both Spatial and Temporal Annotation for Fine-grained Motion Generation and Editing☆31Sep 9, 2025Updated 6 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆16Apr 8, 2025Updated 11 months ago
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated last year
- PyTorch implementation of the paper "NestE: Modeling Nested Relational Structures for Knowledge Graph Reasoning" (AAAI'24)☆14Jul 5, 2024Updated last year
- [CVPR 2025] MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities☆32Apr 6, 2025Updated 11 months ago
- 本项目对Deepseek-R1-Distill-Qwen-7B进行心理咨询CoT数据的LoRA微调,以进一步提升Deepseek-R1-Distill-Qwen-7B在心理咨询领域的慢思考能力。☆12Mar 11, 2025Updated last year
- 音乐类语料的意图识别填槽以及槽值纠错模型☆18Mar 24, 2023Updated 3 years ago
- ☆18Nov 11, 2022Updated 3 years ago
- ☆14Mar 5, 2024Updated 2 years ago
- 中文医学多模态大模型 Large Chinese Language-and-Vision Assistant for BioMedicine☆107May 22, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ACM Transactions on Information Systems (TOIS), the code and datasets for CKML.☆13Aug 31, 2023Updated 2 years ago
- 2024CCF国际AIOps挑战赛-赛道二(GLM4):基于检索增强的运维知识问答挑战赛解决方案分享。☆14Jul 5, 2024Updated last year
- 基于 LangChain 生态与混合检索技术构建的智能化学术研究辅助平台,旨在提供高效、精准的文献深度分析能力。系统支持 PDF/TXT/DOCX 多格式学术文献上传,通过 BGE-Small-ZH 向量嵌入模型与 BM25 关键词检索融合的混合检索策略,实现跨文档语义关联…☆18Aug 28, 2025Updated 7 months ago
- [ACM TOIS] Multi-Behavior Recommendation with Personalized Directed Acyclic Behavior Graphs☆14Dec 6, 2024Updated last year
- ☆15Aug 26, 2024Updated last year
- [ICML 2024] KnowFormer: Revisiting Transformers for Knowledge Graph Reasoning☆25Sep 20, 2024Updated last year
- MultiModal Rag with Colpali, Milvus and VLM☆15Dec 22, 2024Updated last year
- Retrieval-Augmented Generation System for Cardiovascular Disease Consultation☆17Dec 31, 2024Updated last year
- 基于多图神经网络的领域知识和语法结构融合的中文医疗问询意图识别方法☆17Nov 28, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 基于ReAct构建的电商智能客服代理☆49Sep 19, 2024Updated last year
- ☆21Feb 6, 2024Updated 2 years ago
- ☆10Aug 14, 2019Updated 6 years ago
- Source Data of ACL2021 paper "Syntax-Enhanced Pre-trained Model"☆11Jun 1, 2021Updated 4 years ago
- The official code and model for ACL 2023 paper 'mCLIP: Multilingual CLIP via Cross-lingual Transfer'☆10Jan 23, 2024Updated 2 years ago
- “达观杯”长文本智能处理挑战赛。达观数据提供了一批长文本数据和分类信息,希望选手动用自己的智慧,结合当下最先进的NLP和人工智能技术,深入分析文本内在结构和语义信息,构建文本分类模型,实现精准分类。☆10Jul 20, 2018Updated 7 years ago
- 多任务学习MMOE和PLE☆40Sep 8, 2021Updated 4 years ago
- 基于深度学习的药品评论情感分析系统,可以自动分析药品评论的情感倾向(积极、中性、消极)。本项目采用 LSTM + BERT 词向量的混合架构,并提供了友好的 Web 界面。☆14Dec 24, 2024Updated last year
- Scraped reviews from OpenRice for sentiment analysis. Formatted to use with BERT.☆11Apr 9, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆12Jun 19, 2018Updated 7 years ago
- Code for Mathematical Foundations of Reinforcement Learning☆12Mar 31, 2025Updated last year
- ☆14Apr 19, 2022Updated 3 years ago
- ☆15Feb 26, 2024Updated 2 years ago
- ☆21Jul 3, 2023Updated 2 years ago
- Official code for "Automated Scoring for Reading Comprehension via In-context BERT Tuning" (AIED 2022)☆13May 23, 2022Updated 3 years ago
- ☆25Mar 29, 2025Updated last year