zoe-yyx / Awesome-AIAgent-ProtocolLinks
☆14Updated 2 months ago
Alternatives and similar repositories for Awesome-AIAgent-Protocol
Users that are interested in Awesome-AIAgent-Protocol are comparing it to the libraries listed below
Sorting:
- Repo of "Large Language Model-based Human-Agent Collaboration for Complex Task Solving(EMNLP2024 Findings)"☆33Updated 9 months ago
- ☆31Updated 9 months ago
- MARFT stands for Multi-Agent Reinforcement Fine-Tuning. This repository implements an LLM-based multi-agent reinforcement fine-tuning fra…☆43Updated last week
- Implementation of the paper "ReLLa: Retrieval-enhanced Large Language Models for Lifelong Sequential Behavior Comprehension in Recommenda…☆10Updated last year
- ☆98Updated last year
- Natural Language Reinforcement Learning☆89Updated 6 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆81Updated 10 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Updated 3 months ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆45Updated 8 months ago
- ☆31Updated 7 months ago
- ☆58Updated 7 months ago
- Yelp Simulator for WWW'25 AgentSociety Challenge☆80Updated 2 months ago
- Baseline for NeurIPS_Auto_Bidding_General_Track☆33Updated 10 months ago
- A Survey of Personalization: From RAG to Agent☆47Updated 2 months ago
- Kuaishou Online RL Benchmark☆18Updated last year
- ☆29Updated 9 months ago
- ☆49Updated 8 months ago
- A general framework for bridging LLMs and recommendation systems via reinforcement learning. https://arxiv.org/pdf/2503.24289☆93Updated 3 weeks ago
- ☆114Updated 5 months ago
- ☆59Updated 6 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆105Updated last year
- ☆27Updated 8 months ago
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆62Updated 2 months ago
- [ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"☆14Updated last year
- code for RIM☆22Updated 2 years ago
- ☆143Updated 7 months ago
- ☆33Updated 9 months ago
- Recommender systems with large language models (Paper list)☆61Updated last year
- The LLMOPT project offers a comprehensive set of resources, including the model, dataset, training framework, and inference code, enablin…☆68Updated 2 months ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆179Updated 2 months ago