A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using the response. A RAG system is build upon the tuned qwen2, using Prompt-Template + Tool-Use + Chroma embedding database + LangChain
☆72Apr 16, 2026Updated 3 weeks ago
Alternatives and similar repositories for Travel-Agent-based-on-Qwen2-RLHF
Users that are interested in Travel-Agent-based-on-Qwen2-RLHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于Qwen2+SFT+DPO的医疗问答系统,项目中使用了自定义 的 SFTTrainer/DPOTrainer/TRPOTrainer用于训练,其次,项目还调用各种知识库工具(neo4j, milvus, LDA, 等)进行自动化训练数据生成。另外,使用 vllm 用于推理…☆81Apr 29, 2026Updated last week
- 这是一份集成了RAG和微调以及思维链的LLM应用!最近也结合了知识图谱以及智能体agent~后续还会有很多更新!☆18Oct 12, 2024Updated last year
- FineMotion: A Dataset and Benchmark with both Spatial and Temporal Annotation for Fine-grained Motion Generation and Editing☆32Sep 9, 2025Updated 8 months ago
- [TCDS] The implementation of SpikingVit for object detection on event-based datasets.☆11Jul 6, 2024Updated last year
- 基于PaddleNLP的对话意图识别☆10Apr 11, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆16Apr 8, 2025Updated last year
- An object detection model for NMNIST larger video frame☆12Feb 24, 2022Updated 4 years ago
- Fast instruction tuning with Llama2☆11Apr 8, 2024Updated 2 years ago
- Implementation code for: Semantic Segmentation and Depth Estimation with RGB and DVS Sensor Fusion for Multi-view Driving Perception, Pro…☆10Apr 12, 2026Updated 3 weeks ago
- 本项目对Deepseek-R1-Distill-Qwen-7B进行心理咨询CoT数据的LoRA微调,以进一步提升Deepseek-R1-Distill-Qwen-7B在心理咨询领域的慢思考能力。☆12Mar 11, 2025Updated last year
- Paper Reproduction for "Learning Monocular Dense Depth from Events" | CS4245 Computer Vision by Deep Learning course project☆13Mar 9, 2023Updated 3 years ago
- [ICLR 2026] Beyond a Million Tokens: Benchmarking and Enhancing Long-Term Memory in LLMs☆49Feb 2, 2026Updated 3 months ago
- 音乐类语料的意图识别填槽以及槽值纠错模型☆18Mar 24, 2023Updated 3 years ago
- ☆18Nov 11, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- KG☆14Nov 26, 2022Updated 3 years ago
- 2024CCF国际AIOps挑战赛-赛道二(GLM4):基于检索增强的运维知识问答挑战赛解决方案分享。☆14Jul 5, 2024Updated last year
- [ICML 2024] KnowFormer: Revisiting Transformers for Knowledge Graph Reasoning☆25Sep 20, 2024Updated last year
- ☆15Aug 26, 2024Updated last year
- 对深度学习中的NLP进行解释和代码使用☆64Jan 5, 2024Updated 2 years ago
- MultiModal Rag with Colpali, Milvus and VLM☆15Dec 22, 2024Updated last year
- Retrieval-Augmented Generation System for Cardiovascular Disease Consultation☆17Dec 31, 2024Updated last year
- 基于多图神经网络的领域知识和语法结构融合的中文医疗问询意图识别方法☆17Nov 28, 2022Updated 3 years ago
- ☆10Aug 14, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 基于 LangChain 生态与混合检索技术构建的智能化学术研究辅助平台,旨在提供高效、精准的文献深度分析能力。系统支持 PDF/TXT/DOCX 多格式学术文献上传,通过 BGE-Small-ZH 向量嵌入模型与 BM25 关键词检索融合的混合检索策略,实现跨文档语义关联…☆20Aug 28, 2025Updated 8 months ago
- “达观杯”长文本智能处理挑战赛。达观数据提供了一批长文本数据和分类信息,希望选手动用自己的智慧,结合当下最先进的NLP和人工智能技术,深入分析文本内在结构和语义信息,构建文本分类模型,实现精准分类。☆10Jul 20, 2018Updated 7 years ago
- TensorFlow: learn and practice☆11Aug 30, 2018Updated 7 years ago
- The official code and model for ACL 2023 paper 'mCLIP: Multilingual CLIP via Cross-lingual Transfer'☆10Jan 23, 2024Updated 2 years ago
- 本仓库旨在记录和分享我在 LLM 和 Agent 领域的学习历程,并通过实践项目深入理解相关技术。通过从零开始构建基于 LLM 和 Agent 的应用,学习LLM原理和Agent开发经验。☆27Mar 28, 2025Updated last year
- 多任务学习MMOE和PLE☆40Sep 8, 2021Updated 4 years ago
- A large-scale node-classification graph benchmark that brings together both the heterophily and heterogeneity properties of real-world gr…☆39Aug 4, 2025Updated 9 months ago
- 基于深度学习的药品评论情感分析系统,可以自动分析药品评论的情感倾向(积极、中性、消极)。本项目采用 LSTM + BERT 词向量的混合架构,并提供了友好的 Web 界面。☆14Dec 24, 2024Updated last year
- 本项目存放RankMixer复现相关代码☆58Apr 3, 2026Updated last month
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆28Oct 13, 2025Updated 6 months ago
- ☆15Feb 26, 2024Updated 2 years ago
- Qwen GRPO Graph Extraction RL Finetune☆69Apr 2, 2025Updated last year
- ☆11Nov 21, 2024Updated last year
- A curated list of resources dedicated to word segmentation☆12Jan 9, 2019Updated 7 years ago
- 基于Qwen2.5模型、使用DISC-Law-SFT-Pair数据集微调的法律大模型☆12Dec 29, 2024Updated last year
- Codes for paper "Contextual Information and Commonsense Based Prompt for Emotion Recognition in Conversation" published in ECML-PKDD 2022…☆17Jul 6, 2022Updated 3 years ago