Adaptive Weight Scheduling for Multi-Objective GRPO in Code Generation. Fixed multi-objective rewards cause reward hacking (short but broken code). Our curriculum approach—correctness first, then gradually adding efficiency/brevity—preserves 81.7% HumanEval while generating 11% shorter code.
☆49Apr 14, 2026Updated 2 weeks ago
Alternatives and similar repositories for adaptive-mogrpo
Users that are interested in adaptive-mogrpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cross-platform Game Engine | Based on Cocos Creator☆83Jan 19, 2026Updated 3 months ago
- The code for the Trans4trade Timeseries NeuroNet☆52Oct 16, 2025Updated 6 months ago
- This repo collects research papers that use AI tools and are in the field of scientific research (including computer science, agronomy, c…☆103Mar 17, 2025Updated last year
- End-to-end MTCNN face detection and alignment workflow with reproducible PyTorch implementation.☆31Jan 11, 2026Updated 3 months ago
- PrerenderShield 是一款集防火墙安全防护与预渲染功能于一体的企业级 Web 应用中间件,专为解决前后端分离架构下网站发布的痛点而设计。现有防火墙产品(如雷池)无法支持预渲染,而预渲染产品(如 Rendertron)缺乏防火墙能力,PrerenderShield…☆45Mar 23, 2026Updated last month
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 验证码识别☆12Aug 24, 2022Updated 3 years ago
- MCP Server for Codex CLI integration - stateful code writing and review workflows☆163Feb 23, 2026Updated 2 months ago
- Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward☆33Oct 5, 2025Updated 6 months ago
- UpTop is a BNB Chain-based liquidity protocol that allows users to unilaterally add BNB to liquidity pools, earn high yields, and support…☆75Jun 11, 2025Updated 10 months ago
- ☆23Jan 16, 2026Updated 3 months ago
- [ICLR 2026] Discrete Diffusion Divergence Instruct (DiDi-Instruct)☆153Mar 4, 2026Updated last month
- [3DV 2026] Reflect3r: Single-View 3D Stereo Reconstruction Aided by Mirror Reflections☆51Mar 21, 2026Updated last month
- GigaTrain: An Efficient and Scalable Training Framework for AI Models☆1,064Dec 3, 2025Updated 4 months ago
- AI is changing the way how to launch cyberattack.☆126Jan 4, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Sep 14, 2022Updated 3 years ago
- 🏥 A Modern and Resilient DHIS2 Client with Built-in WHO-DQR Validation for LMICs☆32Oct 26, 2025Updated 6 months ago
- ReLoop: RetailOpt-190 Benchmark and Codebase☆108Feb 19, 2026Updated 2 months ago
- ☆83Nov 3, 2025Updated 5 months ago
- Official implementation repository of Holistic Data Schedule☆199Jan 2, 2026Updated 3 months ago
- EmbodyHub☆79Feb 4, 2025Updated last year
- A research-oriented framework for building optimization-driven decision intelligence systems, integrating causal inference, risk modeling…☆154Mar 21, 2026Updated last month
- This project uses yolov8 combined with bytetrack to achieve multi-target tracking☆69Oct 17, 2024Updated last year
- AIGC Creative Suite☆202May 12, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆101Apr 9, 2025Updated last year
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆1,046Aug 6, 2025Updated 8 months ago
- [AAAI 2026 Oral] Official repository for InfiGUI-G1. We introduce Adaptive Exploration Policy Optimization (AEPO) to overcome semantic al…☆143Nov 19, 2025Updated 5 months ago
- ☆48Nov 11, 2025Updated 5 months ago
- Spring项目:支持设置时间、价格、距离权重的个性化导航服务,并支持根据大量用户行驶状态更新道路情况和预计到达时间☆22Apr 24, 2025Updated last year
- A self-prior point cloud reconstruction model for tree crown volume calculation, 3D reconstruction of tree crowns, and tree crown project…☆52Nov 3, 2025Updated 5 months ago
- 爬取b站番剧短评,利用jieba分词,wordcloud展示。因为有的时候评分高并不代表这部动画真的好看。☆25Mar 8, 2024Updated 2 years ago
- ☆28May 3, 2025Updated 11 months ago
- Focus on Linux C2. The open source part is reverse shell management.☆668Aug 16, 2025Updated 8 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Image Stitching with perspective transformation in python☆19Jul 3, 2024Updated last year
- Tansfer Optimization System for Black-box Optimization☆250Nov 21, 2024Updated last year
- The core protocol check the inference proof and distributes $HYPT as rewards to one of the Hyper Nodes that completed the AI inference t…☆53Feb 4, 2026Updated 2 months ago
- [ICLRW 2026 Best Short Paper Award] Visual Exclusivity Attacks: Automatic Multimodal Red Teaming via Agentic Planning☆65Apr 15, 2026Updated last week
- 本项目是一个基于 Golang Gin 框架 开发的 B2C 电商平台,采用 MVC(Model-View-Controller)架构 进行模块化设计,能够扩展为实现前后端分离,支持后台商品管理、用户系统、订单交易、支付集成、数据分析等功能,系统地展…☆1,026Oct 12, 2025Updated 6 months ago
- This repository is used to record some simulation - implemented solutions, mainly covering areas such as post - quantum cryptography, zer…☆135Apr 20, 2026Updated last week
- [CVPR 2026] PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.☆3,669Jan 26, 2026Updated 3 months ago