The code of paper "Toward Optimal LLM Alignments Using Two-Player Games".
☆17Jun 20, 2024Updated last year
Alternatives and similar repositories for gpo
Users that are interested in gpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Feb 11, 2021Updated 5 years ago
- ☆20Oct 15, 2022Updated 3 years ago
- ☆12Dec 9, 2020Updated 5 years ago
- Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward☆33Oct 5, 2025Updated 5 months ago
- How Robust are Randomized Smoothing based Defenses to Data Poisoning? (CVPR 2021)☆14Jul 16, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Code for the paper "Deep Partition Aggregation: Provable Defenses against General Poisoning Attacks"☆13Aug 22, 2022Updated 3 years ago
- 👀 VITRina: VIsual Token Representations☆11Jun 15, 2023Updated 2 years ago
- ☆13Jun 4, 2024Updated last year
- ☆16Jul 17, 2022Updated 3 years ago
- zero-vocab or low-vocab embeddings☆18Jul 17, 2022Updated 3 years ago