DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails
☆34Feb 26, 2025Updated last year
Alternatives and similar repositories for DuoGuard
Users that are interested in DuoGuard are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆59Apr 6, 2025Updated last year
- ☆18Jul 25, 2025Updated 11 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆157Dec 24, 2024Updated last year
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆40Jan 16, 2026Updated 5 months ago
- BLSP: Bootstrapping Langauge-Speech Pre-training via Behavior Alignment of Continuation Writing☆59Mar 11, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆29May 4, 2024Updated 2 years ago
- Official repository for ICLR 2025 paper "Amulet: ReAlignment During Test Time for Personalized Preference Adaptation of LLMs"☆20Mar 18, 2025Updated last year
- Code base for internal reward models and PPO training☆24Oct 1, 2023Updated 2 years ago
- ☆14Apr 16, 2024Updated 2 years ago
- ☆16Mar 11, 2022Updated 4 years ago
- ☆40Oct 2, 2024Updated last year
- OpenS2S : Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model☆118Mar 28, 2026Updated 3 months ago
- 短视频内容理解与推荐竞赛☆12Feb 18, 2019Updated 7 years ago
- This is the official code for our paper "Simple and Scalable Nearest Neighbor Machine Translation" (ICLR 2023).☆15Nov 22, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆12May 21, 2019Updated 7 years ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆42Feb 7, 2026Updated 4 months ago
- [NeurIPS 2024] Can Language Models Learn to Skip Steps?☆21Jan 25, 2025Updated last year
- 从零开始无框架python实现卷积神经网络☆13Aug 24, 2020Updated 5 years ago
- Official Code Repository for the paper "Key-value memory in the brain"☆32Feb 25, 2025Updated last year
- ☆16Jul 25, 2022Updated 3 years ago
- Fleming-R1: Toward Expert-Level Medical Reasoning via Reinforcement Learning☆31Sep 29, 2025Updated 9 months ago
- The Ever-Evolving Science Exam☆52Jan 18, 2026Updated 5 months ago
- ☆20Dec 14, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR 2025] Pad: Personalized alignment of llms at decoding-time☆20Mar 19, 2025Updated last year
- Reflect-RL: Two-Player Online RL Fine-Tuning for LMs☆18Jul 19, 2025Updated 11 months ago
- ☆10Jul 4, 2024Updated last year
- ☆20May 16, 2024Updated 2 years ago
- Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)☆22Oct 17, 2022Updated 3 years ago
- [NeurIPS'23] Binary Classification with Confidence Difference☆10May 13, 2024Updated 2 years ago
- [ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models☆20Jul 17, 2024Updated last year
- ☆11Mar 31, 2022Updated 4 years ago
- This is the official implementation of our paper 'Untargeted Backdoor Watermark: Towards Harmless and Stealthy Dataset Copyright Protecti…☆58May 1, 2026Updated 2 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [JMLR] Gradual Domain Adaptation: Theory and Algorithms☆11Jan 14, 2025Updated last year
- LiveSecBench:动态中文大模型安全榜单☆28Mar 9, 2026Updated 3 months ago
- Data augmentation using OpenCV☆11Jan 12, 2017Updated 9 years ago
- ☆12Oct 23, 2022Updated 3 years ago
- [ICML 2023] Protecting Language Generation Models via Invisible Watermarking☆13Sep 8, 2023Updated 2 years ago
- ☆52Mar 31, 2026Updated 3 months ago
- ☆11Mar 20, 2023Updated 3 years ago