zoe-yyx / Awesome-AIAgent-ProtocolLinks
☆18Updated 2 months ago
Alternatives and similar repositories for Awesome-AIAgent-Protocol
Users that are interested in Awesome-AIAgent-Protocol are comparing it to the libraries listed below
Sorting:
- MARFT stands for Multi-Agent Reinforcement Fine-Tuning. This repository implements an LLM-based multi-agent reinforcement fine-tuning fra…☆49Updated last month
- Official implementation of the paper "Chain-of-Experts: When LLMs Meet Complex Operation Research Problems"☆103Updated 5 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆106Updated last year
- The LLMOPT project offers a comprehensive set of resources, including the model, dataset, training framework, and inference code, enablin…☆73Updated 2 months ago
- [ICLR 2025] Official code of "Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization"☆16Updated last year
- A Survey of Personalization: From RAG to Agent☆54Updated this week
- ☆31Updated 8 months ago
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆92Updated last year
- Natural Language Reinforcement Learning☆90Updated 6 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆83Updated 10 months ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆45Updated 8 months ago
- ☆114Updated 5 months ago
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆64Updated 3 months ago
- ☆28Updated 9 months ago
- [NAACL 25 main] Awesome LLM Causal Reasoning is a collection of LLM-based casual reasoning works, including papers, codes and datasets.☆70Updated 4 months ago
- Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)☆188Updated last year
- ☆144Updated 7 months ago
- Repo of "Large Language Model-based Human-Agent Collaboration for Complex Task Solving(EMNLP2024 Findings)"☆33Updated 9 months ago
- ☆142Updated last week
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆181Updated 3 months ago
- On Memorization of Large Language Models in Logical Reasoning☆69Updated 3 months ago
- ☆58Updated 8 months ago
- ☆147Updated 5 months ago
- ☆38Updated 4 months ago
- This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by g…☆35Updated last week
- e☆38Updated 2 months ago
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆110Updated 3 months ago
- This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'☆118Updated last month
- ☆33Updated 10 months ago
- An Awesome List of Reinforcement Learning-based Large Language Agent Works. Collect directly from official code base.☆219Updated last week