A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using the response. A RAG system is build upon the tuned qwen2, using Prompt-Template + Tool-Use + Chroma embedding database + LangChain
☆78Apr 16, 2026Updated 2 months ago
Alternatives and similar repositories for Travel-Agent-based-on-Qwen2-RLHF
Users that are interested in Travel-Agent-based-on-Qwen2-RLHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于Qwen2+SFT+DPO的医疗问答系统,项目中 使用了自定义的 SFTTrainer/DPOTrainer/TRPOTrainer用于训练,其次,项目还调用各种知识库工具(neo4j, milvus, LDA, 等)进行自动化训练数据生成。另外,使用 vllm 用于推理…☆87Apr 29, 2026Updated last month
- 在千问最新的多模态image-text模型Qwen3-VL-4B-Instruct 进行多种lora微调对比效果,通过langchain+RAG+多智能体(Multi-Agent)进行部署☆50Dec 14, 2025Updated 6 months ago
- 这是一份集成了RAG和微调以及思维链的LLM应用!最近也结合了知识图谱以及智能体agent~后续还会有很多更新!☆18Oct 12, 2024Updated last year
- 一个低成本、易于上手的多模态大模型学习项目。基于Qwen3-0.6B和CLIP构建,使用LLaVA架构和LoRA微调,在消费级16G显卡上数小时即可完成训练☆51Sep 15, 2025Updated 9 months ago
- 基于qwen3的医疗大模型研发全流程 0.分词训练 1.增量预训练 2.微调 3.强化 4.量化 5.蒸馏 6.评估 7.lora模型合并 8.服务 9.部署☆45Jan 3, 2026Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15May 24, 2024Updated 2 years ago
- ☆17Apr 8, 2025Updated last year
- 基于PaddleNLP的对话意图识别☆10Apr 11, 2023Updated 3 years ago
- PyTorch implementation of the paper "NestE: Modeling Nested Relational Structures for Knowledge Graph Reasoning" (AAAI'24)☆14Jul 5, 2024Updated last year
- 本项目对Deepseek-R1-Distill-Qwen-7B进行心理咨询CoT数据的LoRA微调,以进一步提升Deepseek-R1-Distill-Qwen-7B在心理咨询领域的慢思考能力。☆12Mar 11, 2025Updated last year
- Diffusion-based Negative Sampling on Graphs for Link Prediction☆14Feb 13, 2024Updated 2 years ago
- AiMed面向中文医学的人工智能大语言模型期望实现有效处理医学知识问答、医学论文阅读、医学文献检索等任务和在医学科研中的应用。☆13Feb 8, 2025Updated last year
- ☆19Nov 11, 2022Updated 3 years ago
- ☆14Mar 5, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ACM TOIS] Multi-Behavior Recommendation with Personalized Directed Acyclic Behavior Graphs☆14Dec 6, 2024Updated last year
- KG☆14Nov 26, 2022Updated 3 years ago
- 2024CCF国际AIOps挑战赛-赛道二(GLM4):基于检索增强的运维知识问答挑战赛解决方案分享。☆14Jul 5, 2024Updated last year
- ☆15Aug 26, 2024Updated last year
- 对深度学习中的NLP进行解释和代码使用☆64Jan 5, 2024Updated 2 years ago
- MultiModal Rag with Colpali, Milvus and VLM☆15Dec 22, 2024Updated last year
- Retrieval-Augmented Generation System for Cardiovascular Disease Consultation☆17Dec 31, 2024Updated last year
- CodeReadingNote pro supports jetbrains22.1.4+, code remark, custom tags, tags grouping topic, ongoing maintenance☆13Apr 12, 2026Updated 2 months ago
- Using GDPR to do GDPR Contract Review☆11Mar 14, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Go 版本的 Redisson 欢迎大家使用☆12Jan 20, 2025Updated last year
- ☆10Aug 14, 2019Updated 6 years ago
- 基于 LangChain 生态与混合检索技术构建的智能化学术研究辅助平台,旨在提供高效、精准的文献深度分析能力。系统支持 PDF/TXT/DOCX 多格式学术文献上传,通过 BGE-Small-ZH 向量嵌入模型与 BM25 关键词检索融合的混合检索策略,实现跨文档语义关联…☆20Aug 28, 2025Updated 9 months ago
- “达观杯”长文本智能处理挑战赛。达观数据提供了一批长文本数据和分类信息,希望选手动用自己的智慧,结合当下最先进的NLP和人工 智能技术,深入分析文本内在结构和语义信息,构建文本分类模型,实现精准分类。☆10Jul 20, 2018Updated 7 years ago
- TensorFlow: learn and practice☆11Aug 30, 2018Updated 7 years ago
- Archive of Miniflux v1☆16Oct 3, 2020Updated 5 years ago
- Scraped reviews from OpenRice for sentiment analysis. Formatted to use with BERT.☆11Apr 9, 2020Updated 6 years ago
- 本项目存放RankMixer复现相关代码☆68Apr 3, 2026Updated 2 months ago
- ☆12Jun 19, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆24Jun 13, 2023Updated 3 years ago
- ☆15Feb 26, 2024Updated 2 years ago
- Qwen GRPO Graph Extraction RL Finetune☆70Apr 2, 2025Updated last year
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆70Jul 24, 2024Updated last year
- ☆11Nov 21, 2024Updated last year
- A curated list of resources dedicated to word segmentation☆12Jan 9, 2019Updated 7 years ago
- 基于Qwen2.5模型、使用DISC-Law-SFT-Pair数据集微调的法律大模型☆12Dec 29, 2024Updated last year