NJUxlj / Travel-Agent-based-on-Qwen2-RLHFLinks

A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using the response. A RAG system is build upon the tuned qwen2, using Prompt-Template + Tool-Use + Chroma embedding database + LangChain

☆22

Alternatives and similar repositories for Travel-Agent-based-on-Qwen2-RLHF

Users that are interested in Travel-Agent-based-on-Qwen2-RLHF are comparing it to the libraries listed below

Sorting:

NJUxlj / Chinese-MedQA-Qwen2
基于Qwen2+SFT+DPO的医疗问答系统，项目中使用了LLaMA-Factory用于训练，fastllm和vllm用于推理，
☆15Updated last month
Dylan9897 / LLM-TextClassification
集成Qwen与DeepSeek等先进大语言模型，支持纯LLM+分类层模式及LLM+LoRA+分类层模式，使用transformers模块化设计和训练便于根据需要调整或替换组件。
☆13Updated 4 months ago
percent4 / embedding_model_exp
本项目用于Embedding模型的相关实验，包括Embedding模型评估、Embedding模型微调、Embedding模型量化等。
☆58Updated last year
NJUxlj / An-Academic-Paper-Chatbot-based-on-LLama3.1-and-Knowledge-Graph
基于知识图谱和大模型的对话系统
☆10Updated 2 months ago
Lightblues / AgentRE
Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".
☆69Updated last year
NanGePlus / PromptLangChainTest
本项目主要介绍prompt工程相关用例。包括模拟智能推荐客服系统构建和问答、思维链、自洽性、思维树等相关进阶demo，旨在帮助大家理解prompt。通过一份代码实现了同时支持多种大模型（如OpenAI、阿里通义千问等）并使用FastAPI对应用进行API封装。
☆31Updated 10 months ago
chaoql / rag-best-practices
大模型检索增强生成技术最佳实践。
☆80Updated 11 months ago
Logistic98 / rag-omni
基于大语言模型的检索增强生成RAG示例
☆153Updated 3 months ago
Ginjing-Yuan / QWen2-from_ground_up
☆20Updated last year
cwxndl / LLM
大语言模型应用：RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛
☆66Updated 5 months ago
limafang / tiny-graphrag
☆41Updated 2 months ago
heyblackC / BetterMixture-Top1-Solution
天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案
☆31Updated last year
yanqiangmiffy / Agent-Tutorials-ZH
大模型智能体Agent中文教程，博客代码仓库
☆26Updated last month
qianniuspace / llm_notebooks
AI 应用示例合集
☆103Updated last year
owenliang / qwen-dpo
通义千问的DPO训练
☆51Updated 10 months ago
liucongg / LLMsBook
大型语言模型实战指南：应用实践与场景落地
☆75Updated 10 months ago
Shy2593666979 / Agent_Multiple-Talk
基于LLM的多轮问答系统。结合了意图识别和词槽填充技术
☆21Updated last week
Alibaba-NLP / CoFE-RAG
☆37Updated 3 months ago
826568389 / GRPO-R1
☆13Updated 4 months ago
cjymz886 / LLM-RAG-QA
LLM+RAG for QA
☆22Updated last year
linancn / TianGong-AI-Unstructure
TianGong-AI-Unstructure
☆68Updated last month
VovyH / MultiAgent-Search
[2025-上海人工智能实验室书生实训营十佳、优秀项目]
☆31Updated 2 weeks ago
taishan1994 / Llama3.1-Finetuning
对llama3进行全参微调、lora微调以及qlora微调。
☆204Updated 10 months ago
LancelotXWX / MAG-SQL
MAG-SQL: Multi-Agent Generative Approach with Soft Schema Linking and Iterative Sub-SQL Refinement for Text-to-SQL
☆16Updated 3 weeks ago
fufankeji / ReAct_AI_Agent
基于ReAct构建的电商智能客服代理
☆29Updated 10 months ago
yanqiangmiffy / tree2retriever
Recursive Abstractive Processing for Tree-Organized Retrieval
☆10Updated last year
Yazooliu / agent_from_0t1
手把手带你从0到1实现大模型agent
☆118Updated last year
pydaxing / clip_blip_embedding_rag
在RAG技术中，嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务，该服务支持文本和图像的嵌入生成与相似度计算，为多模态信息检索提供了基础能力。
☆32Updated 7 months ago
Trae1ounG / DyPRAG
Official code for Dynamic Parametric RAG.
☆141Updated 2 months ago
ZBayes / poc_project
通用简单工具项目
☆20Updated 10 months ago