Adaptive Weight Scheduling for Multi-Objective GRPO in Code Generation. Fixed multi-objective rewards cause reward hacking (short but broken code). Our curriculum approach—correctness first, then gradually adding efficiency/brevity—preserves 81.7% HumanEval while generating 11% shorter code.
☆49Apr 14, 2026Updated last month
Alternatives and similar repositories for adaptive-mogrpo
Users that are interested in adaptive-mogrpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cross-platform Game Engine | Based on Cocos Creator☆83Jan 19, 2026Updated 4 months ago
- The code for the Trans4trade Timeseries NeuroNet☆52Oct 16, 2025Updated 7 months ago
- This repo collects research papers that use AI tools and are in the field of scientific research (including computer science, agronomy, c…☆105Mar 17, 2025Updated last year
- End-to-end MTCNN face detection and alignment workflow with reproducible PyTorch implementation.☆31Jan 11, 2026Updated 4 months ago
- PrerenderShield 是一款集防火墙安全防护与预渲染功能于一体的企业级 Web 应用中间件,专为解决前后端分离架构下网站发布的痛点而设计。现有防火墙产品(如雷池)无法支持预渲染,而预渲染产品(如 Rendertron)缺乏防火墙能力,PrerenderShield…☆45Mar 23, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 验证码识别☆12Aug 24, 2022Updated 3 years ago
- MCP Server for Codex CLI integration - stateful code writing and review workflows☆164Feb 23, 2026Updated 3 months ago
- Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward☆33Oct 5, 2025Updated 8 months ago
- ☆23Jan 16, 2026Updated 4 months ago
- UpTop is a BNB Chain-based liquidity protocol that allows users to unilaterally add BNB to liquidity pools, earn high yields, and support…☆76Jun 11, 2025Updated 11 months ago
- [ICLR 2026] Discrete Diffusion Divergence Instruct (DiDi-Instruct)☆153Mar 4, 2026Updated 3 months ago
- GigaTrain: An Efficient and Scalable Training Framework for AI Models☆1,066Dec 3, 2025Updated 6 months ago
- AI is changing the way how to launch cyberattack.☆127Jan 4, 2026Updated 5 months ago
- ☆13Sep 14, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 🏥 A Modern and Resilient DHIS2 Client with Built-in WHO-DQR Validation for LMICs☆32Oct 26, 2025Updated 7 months ago
- ReLoop: RetailOpt-190 Benchmark and Codebase☆110Apr 29, 2026Updated last month
- ☆84Nov 3, 2025Updated 7 months ago
- Official implementation repository of Holistic Data Schedule☆199Jan 2, 2026Updated 5 months ago
- EmbodyHub☆79Feb 4, 2025Updated last year
- This project uses yolov8 combined with bytetrack to achieve multi-target tracking☆69Oct 17, 2024Updated last year
- AIGC Creative Suite☆202May 12, 2025Updated last year
- ☆101Apr 9, 2025Updated last year
- ☆48Nov 11, 2025Updated 6 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆1,053Aug 6, 2025Updated 10 months ago
- Spring项目:支持设置时间、价格、距离权重的个性化导航服务,并支持根据大量用户行驶状态更新道路情况和预计到达时间☆22Apr 24, 2025Updated last year
- AI Phone Agent: A starter kit to build AI agents that answer real phone calls and talk to customers in real time (OpenAI Realtime). Node.…☆103Apr 18, 2026Updated last month
- A self-prior point cloud reconstruction model for tree crown volume calculation, 3D reconstruction of tree crowns, and tree crown project…☆53Nov 3, 2025Updated 7 months ago
- 爬取b站番剧短评,利用jieba分词,wordcloud展示。因为有的时候评分高并不代表这部动画真的好看。☆25Mar 8, 2024Updated 2 years ago
- [AAAI 2026 Oral] Official repository for InfiGUI-G1. We introduce Adaptive Exploration Policy Optimization (AEPO) to overcome semantic al…☆145Nov 19, 2025Updated 6 months ago
- ☆29May 3, 2025Updated last year
- [3DV 2026] Reflect3r: Single-View 3D Stereo Reconstruction Aided by Mirror Reflections☆96Mar 21, 2026Updated 2 months ago
- Focus on Linux C2. The open source part is reverse shell management.☆665Aug 16, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Image Stitching with perspective transformation in python☆19Jul 3, 2024Updated last year
- The core protocol check the inference proof and distributes $HYPT as rewards to one of the Hyper Nodes that completed the AI inference t…☆53Feb 4, 2026Updated 4 months ago
- Tansfer Optimization System for Black-box Optimization☆250Nov 21, 2024Updated last year
- 本项目是一个基于 Golang Gin 框架 开发的 B2C 电商平台,采用 MVC(Model-View-Controller)架构 进行模块化设计,能够扩展为实现前后端分离,支持后台商品管理、用户系统、订单交易、支付集成、数据分析等功能,系统地展…☆1,040Oct 12, 2025Updated 7 months ago
- This repository is used to record some simulation - implemented solutions, mainly covering areas such as post - quantum cryptography, zer…☆135Apr 20, 2026Updated last month
- [CVPR 2026] PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.☆3,698May 18, 2026Updated 3 weeks ago
- This is the code for Visual Reasoning Sequential Attack, which is a method to jailbreak Multimodal Large Language Models Based on their v…☆64Mar 16, 2026Updated 2 months ago