NJUxlj / Travel-Agent-based-on-Qwen2-RLHFView external linksLinks
A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using the response. A RAG system is build upon the tuned qwen2, using Prompt-Template + Tool-Use + Chroma embedding database + LangChain
☆56Nov 14, 2025Updated 3 months ago
Alternatives and similar repositories for Travel-Agent-based-on-Qwen2-RLHF
Users that are interested in Travel-Agent-based-on-Qwen2-RLHF are comparing it to the libraries listed below
Sorting:
- 基于Qwen2+SFT+DPO的医疗问答系统,项目中使用了自定义的 SFTTrainer/DPOTrainer/TRPOTrainer用于训练,其次,项目还调用各种知识库工具(neo4j, milvus, LDA, 等)进行自动化训练数据生成。另外,使用 vllm 用于推理…☆60Jan 4, 2026Updated last month
- 2024CCF国际AIOps挑战赛-赛道二(GLM4):基于检索增强的运维知识问答挑战赛解决方案分享。☆14Jul 5, 2024Updated last year
- 基于Qwen2.5模型、使用DISC-Law-SFT-Pair数据集微调的法律大模型☆10Dec 29, 2024Updated last year
- CFT-RAG: An Entity Tree Based Retrieval Augmented Generation Algorithm With Cuckoo Filter☆22May 28, 2025Updated 8 months ago
- 基于Langchain的学术论文RAG知识库系统☆16Sep 25, 2024Updated last year
- Yet Another Papers With Code☆35Sep 7, 2025Updated 5 months ago
- Code for Robust Fine-tuning (RbFT)☆17Jan 31, 2025Updated last year
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆31Nov 7, 2025Updated 3 months ago
- 用大模型批量处理数据,现支持各种大模型做OCR,支持通义千问, 月之暗面, 百度飞桨OCR, OpenAI 和LLAVA。Use LLM to generate or clean data for academic use. Support OCR with qwen, m…☆16Sep 15, 2024Updated last year
- ☆11Updated this week
- 多任务学习MMOE和PLE☆39Sep 8, 2021Updated 4 years ago
- 这是一份集成了RAG和微调以及思维链的LLM应用!最近也结合了知识图谱以及智能体agent~后续还会有很多更新!☆18Oct 12, 2024Updated last year
- 记录☆19Nov 29, 2025Updated 2 months ago
- ☆26May 11, 2025Updated 9 months ago
- Claude code 镜像 / Claude API 的二次分发反向代理服务器,可以分发为多个key,同时转换给CC或者任何Anthropic/OpenAI API兼容应用使用☆40Sep 1, 2025Updated 5 months ago
- 基于 LangChain 生态与混合检索技术构建的智能化学术研究辅助平台,旨在提供高效、精准的文献深度分析能力。系统支持 PDF/TXT/DOCX 多格式学术文献上传,通过 BGE-Small-ZH 向量嵌入模型与 BM25 关键词检索融合的混合检索策略,实现跨文档语义关联…☆15Aug 28, 2025Updated 5 months ago
- 2024百度商业AI技术创新大赛赛道一:基于大模型的广告检索全国一等奖获奖方案☆17Feb 23, 2025Updated 11 months ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆63Dec 12, 2024Updated last year
- ☆29Feb 27, 2025Updated 11 months ago
- ☆28Oct 14, 2024Updated last year
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- Difyで作る生成AIアプリ完全入門☆17May 25, 2025Updated 8 months ago
- A simple WeChat Official Account layout tool based on Dify☆16Jun 27, 2025Updated 7 months ago
- code for piccolo embedding model from SenseTime☆145May 21, 2024Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32May 29, 2024Updated last year
- Write the database metadata into the dify knowledge☆12Dec 30, 2025Updated last month
- 本网站时一个 网上鲜花销售系统,来源于本人的毕业设计项目,开发目的是旨在为用户提供便捷、高效的在线鲜花购买服务,同时帮助商家高效管理订单和库存。系统具备完善的功能,包括但不限于鲜花展示模块、鲜花信息模块,能让顾客清晰浏览各类鲜花的品种、颜色、价格、花语等详细信息。拥有便捷的购…☆17Dec 2, 2024Updated last year
- Workflow automation, but you just describe what you want and it happens.☆26Nov 22, 2025Updated 2 months ago
- ☆11Aug 29, 2025Updated 5 months ago
- 100 Production-Ready Claude Code Skills - The most comprehensive collection of AI skills for sales, business automation, content creation…☆35Oct 22, 2025Updated 3 months ago
- ☆28Dec 4, 2025Updated 2 months ago
- ☆39Feb 9, 2026Updated last week
- A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…☆25Jan 6, 2026Updated last month
- Maximizing the Performance of a Simple RAG using RL☆90Mar 20, 2025Updated 10 months ago
- ☆41Apr 11, 2025Updated 10 months ago
- A multi-agent framework to help with your homework.☆10Mar 1, 2025Updated 11 months ago
- dify 知识库检索工具☆13Apr 3, 2025Updated 10 months ago
- 2020湖南省第一届人工智能大赛参赛作品☆11Feb 17, 2022Updated 4 years ago