ziyuwan / ReMA-publicView external linksLinks
Reinforced Multi-LLM Agents training
☆70Jan 18, 2026Updated 3 weeks ago
Alternatives and similar repositories for ReMA-public
Users that are interested in ReMA-public are comparing it to the libraries listed below
Sorting:
- Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"☆11Nov 18, 2022Updated 3 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆12Jun 7, 2024Updated last year
- A simple 2D ball collision engine.☆12Jun 15, 2023Updated 2 years ago
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆14Aug 9, 2024Updated last year
- [ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.☆24Mar 5, 2024Updated last year
- ☆45Jan 21, 2026Updated 3 weeks ago
- [ICLR 2025] Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs☆19Mar 20, 2025Updated 10 months ago
- Offical Repository of MetaAgent Program☆40Dec 2, 2025Updated 2 months ago
- ☆26Mar 17, 2025Updated 10 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆47Apr 15, 2025Updated 10 months ago
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆21Dec 10, 2024Updated last year
- ☆32Aug 11, 2025Updated 6 months ago
- BackTime: Backdoor Attacks on Multivariate Time Series Forecasting☆30Apr 14, 2025Updated 10 months ago
- ☆31Jan 26, 2026Updated 3 weeks ago
- Emoji Attack [ICML 2025]☆41Jul 15, 2025Updated 7 months ago
- ☆77Nov 6, 2025Updated 3 months ago
- ☆46Oct 28, 2025Updated 3 months ago
- Enterprise AI Security Platform - Real-time firewall protection for LLM applications against prompt injection, data leakage, and function…☆23Sep 14, 2025Updated 5 months ago
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆32Jan 7, 2026Updated last month
- ☆43Aug 15, 2025Updated 6 months ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Dec 12, 2024Updated last year
- Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)☆67Mar 27, 2025Updated 10 months ago
- ☆32Jun 5, 2025Updated 8 months ago
- This the implementation of LeCo☆31Jan 20, 2025Updated last year
- MrlX: A Multi-Agent Reinforcement Learning Framework☆190Jan 19, 2026Updated 3 weeks ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆78Aug 17, 2024Updated last year
- ☆12Jan 31, 2024Updated 2 years ago
- [ICLR'26] Stronger-MAS: A RL Framework for multi LLM agent system☆105Feb 3, 2026Updated last week
- Repository of IPBench☆19Jan 4, 2026Updated last month
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆43Mar 14, 2024Updated last year
- Source code of paper "An Unforgeable Publicly Verifiable Watermark for Large Language Models" accepted by ICLR 2024☆34May 23, 2024Updated last year
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆96Aug 20, 2024Updated last year
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆55Nov 4, 2025Updated 3 months ago
- 🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to…☆53Aug 28, 2025Updated 5 months ago
- Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models" accepted by ICLR 2024.☆36Nov 13, 2024Updated last year
- Official repository of the paper: Who Wrote this Code? Watermarking for Code Generation (ACL 2024)☆39May 28, 2024Updated last year
- [SCIS] MULTI-Benchmark: Multimodal Understanding Leaderboard with Text and Images☆44Nov 19, 2025Updated 2 months ago
- ☆164Jan 21, 2025Updated last year