GHupppp / InteractiveMemorySharingLLMLinks
☆17Updated 8 months ago
Alternatives and similar repositories for InteractiveMemorySharingLLM
Users that are interested in InteractiveMemorySharingLLM are comparing it to the libraries listed below
Sorting:
- [EMNLP 2024] Ask-before-Plan: Proactive Language Agents for Real-World Planning☆21Updated 8 months ago
- ☆71Updated 9 months ago
- ☆47Updated 2 weeks ago
- ☆24Updated 9 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆93Updated 2 weeks ago
- ☆49Updated 3 months ago
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆23Updated last year
- ☆41Updated this week
- ☆29Updated last year
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation☆22Updated 3 weeks ago
- MARFT stands for Multi-Agent Reinforcement Fine-Tuning. This repository implements an LLM-based multi-agent reinforcement fine-tuning fra…☆43Updated last week
- ☆43Updated 8 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆108Updated 8 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆59Updated 4 months ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆45Updated last year
- ☆103Updated 6 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆65Updated last month
- ☆59Updated last month
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆95Updated 2 weeks ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- The official repo for the code and data of paper SMART☆26Updated 4 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆78Updated last month
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆28Updated 6 months ago
- A trainable user simulator☆34Updated 9 months ago
- ☆67Updated 3 weeks ago
- This is the code of MMOA-RAG.☆53Updated last month
- ☆66Updated 3 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆72Updated last week
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year