ziyuwan/ReMA-public

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ziyuwan/ReMA-public)

ziyuwan / ReMA-public

Reinforced Multi-LLM Agents training

☆86

Alternatives and similar repositories for ReMA-public

Users that are interested in ReMA-public are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Lux0926 / ASPRM
View on GitHub
AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence
☆10Mar 2, 2025Updated last year
Jielin-Qiu / MMWatermark-Robustness
View on GitHub
Evaluating Durability: Benchmark Insights into Multimodal Watermarking
☆12Jun 7, 2024Updated 2 years ago
SaFo-Lab / MetaAgent
View on GitHub
Offical Repository of MetaAgent Program
☆52Dec 2, 2025Updated 6 months ago
XuandongZhao / pf-decoding
View on GitHub
[ICLR 2025] Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs
☆19Mar 20, 2025Updated last year
allenai / sso
View on GitHub
Repository for Skill Set Optimization
☆14Jul 26, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
dengmengjie / ToolScope
View on GitHub
Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use
☆30Nov 4, 2025Updated 7 months ago
csitfun / ConTRoL-dataset
View on GitHub
Dataset for AAAI paper "Natural Language Inference in Context - Investigating Contextual Reasoning over Long Texts"
☆11Nov 18, 2022Updated 3 years ago
jidiai / olympics_engine
View on GitHub
A simple 2D ball collision engine.
☆12Jun 15, 2023Updated 3 years ago
Zillwang / StepSearch
View on GitHub
EMNLP MAIN 2025 StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimization
☆72Sep 13, 2025Updated 9 months ago
THU-BPM / Watermarked_LLM_Identification
View on GitHub
Code and data for paper "Can Watermarked LLMs be Identified by Users via Crafted Prompts?" Accepted by ICLR 2025 (Spotlight)
☆28Dec 28, 2024Updated last year
wolfdroid / x86_Chess
View on GitHub
Classic Chess game using x86 Assembly Language
☆11Apr 23, 2019Updated 7 years ago
jwliao-ai / MARFT
View on GitHub
☆83May 14, 2026Updated last month
miniHuiHui / SimpleRL-reason-GRPO
View on GitHub
☆12Feb 27, 2025Updated last year
backprop07 / Self-Certainty
View on GitHub
Implementation of self-certainty as an extention of ZeroEval Project
☆36May 31, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
NY1024 / RACE
View on GitHub
☆27Mar 17, 2025Updated last year
tim-lawson / skip-middle
View on GitHub
Learning to Skip the Middle Layers of Transformers
☆17Aug 7, 2025Updated 10 months ago
lichengliu03 / unary-feedback
View on GitHub
☆44Mar 31, 2026Updated 2 months ago
hkust-nlp / mstar
View on GitHub
[ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
☆75Jul 13, 2025Updated 11 months ago
princeton-nlp / unintentional-unalignment
View on GitHub
[ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization
☆32Jan 7, 2026Updated 5 months ago
hurunyi / VideoShield
View on GitHub
[ICLR 2025] VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking (Official Implementation)
☆56May 30, 2025Updated last year
chenyiqun / Agentic-RAG
View on GitHub
This is the code of a agentic rag method with dynamic workflow.
☆14Jan 22, 2026Updated 4 months ago
zhipeng-wei / EmojiAttack
View on GitHub
Emoji Attack [ICML 2025]
☆44Jul 15, 2025Updated 11 months ago
lukahhcm / Awesome_Environment_Scaling
View on GitHub
Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …
☆70Jan 28, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Tongyi-CCAI / Complex-IF
View on GitHub
☆33Jan 26, 2026Updated 4 months ago
JingyangYi / ShorterBetter
View on GitHub
☆17Jul 31, 2025Updated 10 months ago
THU-BPM / unforgeable_watermark
View on GitHub
[ICLR 2024] Source code of paper "An Unforgeable Publicly Verifiable Watermark for Large Language Models"
☆34May 23, 2024Updated 2 years ago
ashutosh1919 / data2vec-pytorch
View on GitHub
Ready to run PyTorch implementation of Data2Vec 2.0: Highly efficient self-supervised representation learning for vision, speech and text…
☆16Mar 29, 2023Updated 3 years ago
Fu-Dayuan / PreAct
View on GitHub
PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)
☆30Dec 12, 2024Updated last year
sheryc / resonance_rope
View on GitHub
[ACL 24 Findings] Implementation of Resonance RoPE and the PosGen synthetic dataset.
☆24Mar 5, 2024Updated 2 years ago
zsgvivo / VideoZoomer
View on GitHub
☆34Feb 12, 2026Updated 4 months ago
CubasMike / plagiarism_detection_pan2015
View on GitHub
Plagiarism Detection Approach for PAN 2015 Text Alignment task
☆11May 11, 2018Updated 8 years ago
INK-USC / PE2
View on GitHub
Code for paper "Prompt Engineering a Prompt Engineer" (https://arxiv.org/abs/2311.05661)
☆12Aug 1, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
THU-BPM / Robust_Watermark
View on GitHub
[ICLR 2024] Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models".
☆38Nov 13, 2024Updated last year
TIGER-AI-Lab / GenAI-Arena
View on GitHub
Interface for GenAI-Arena [NeurIPS24]
☆17Feb 27, 2024Updated 2 years ago
raj-sahu / Sahi_Hai
View on GitHub
Chrome Extension to detect Malicious Websites
☆11May 29, 2024Updated 2 years ago
CogComp / Salient-Event-Detection
View on GitHub
The repository for the paper "Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection"
☆10Jul 5, 2022Updated 3 years ago
microsoft / DKI_LLM
View on GitHub
This is a repository for DKI group concerning the LLM-related papers alongside with code.
☆40May 20, 2026Updated last month
S-Abdelnabi / awt
View on GitHub
Code for our S&P'21 paper: Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding
☆53Nov 15, 2022Updated 3 years ago
RUC-NLPIR / ARPO
View on GitHub
[ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)
☆1,048Apr 13, 2026Updated 2 months ago