NJUxlj / Travel-Agent-based-on-Qwen2-RLHF

A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using the response. A RAG system is build upon the tuned qwen2, using Prompt-Template + Tool-Use + Chroma embedding database + LangChain
11Updated last month

Alternatives and similar repositories for Travel-Agent-based-on-Qwen2-RLHF

Users that are interested in Travel-Agent-based-on-Qwen2-RLHF are comparing it to the libraries listed below

Sorting: