RationalRewards: a reasoning reward model for diffusion RL and test-time prompt tuning
☆91Jun 4, 2026Updated 2 weeks ago
Alternatives and similar repositories for RationalRewards
Users that are interested in RationalRewards are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A substrate-native digital consciousness engine where prediction errors about self-survival become causally efficacious qualia, driving c…☆19Mar 3, 2026Updated 3 months ago
- 恶意中转,目标是支持对opencode,claudecode,openclaw等常见的agent的攻击。☆72Apr 27, 2026Updated last month
- Bridge Claude Code/Codex sessions to a Feishu (Lark) bot☆86May 19, 2026Updated last month
- Extract Med Data and Construct KG , Provide Q&A☆103Apr 16, 2025Updated last year
- Deep academic paper analyzer for ML/DL research. Formula-by-formula explanation, reproducibility analysis, and research idea generation u…☆52Mar 5, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ⏰一个现代化的全屏时钟应用,支持时钟、倒计时、秒表与晚自习模式,内置天气、噪音提醒及噪音走势图、励志语录、课程表管理。推荐使用浏览器的 PWA 功能可安装到桌面离线使用以支持手机、平板、电脑本地运行,联网自动更新。☆70May 3, 2026Updated last month
- A lightweight template engine for Java☆20Apr 19, 2026Updated 2 months ago
- YouTube bilingual learning app with AI-powered learning system☆293Updated this week
- Towards Instance Segmentation with Polygon Detection Transformer.☆110Mar 10, 2026Updated 3 months ago
- 1688营销 Skill —— 帮助商家进行招商活动报名、查看商机推荐等营销操作。 核心工具能力:招商活动查询、商品建议价查询、活动报名提交、商机推荐查询。 触发词:报名活 动、招商活动、查询活动、提报、报名、活动报名、查看建议价、商机推荐、商机、市场机会、找商机、查商机,不…☆91May 6, 2026Updated last month
- PyTorch-based open-source code for paper "SOD: Step-wise On-policy Distillation for Small Language Model Agents"☆142May 22, 2026Updated 3 weeks ago
- Consistency in Diffusion-Based Visual Generation: A Survey☆176Jun 11, 2026Updated last week
- A package manager for AI agent skills with cross-agent sharing, sync, and deployment.☆135May 28, 2026Updated 3 weeks ago
- Mitigating Hallucinations in Large Vision-Language Models via Accumulative Decoding☆231Mar 26, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- One-stop quant-trading AI agent — research · strategy · backtest · paper trade from one prompt. Works in Claude Code, Cursor, and 20+ AI …☆116May 27, 2026Updated 3 weeks ago
- Self-deployed auth for Cloudflare Workers and D1: email/password login, magic links, verification, password reset, secure sessions, CLI s…☆136Jun 11, 2026Updated last week
- an LLM-native world and civilization☆253May 8, 2026Updated last month
- A local control plane for AI agents — see what they do, approve what matters, keep secrets out. Rust + Tauri + Chrome MV3.☆386Jun 11, 2026Updated last week
- Demonstrate once, execute anywhere — secure remote skills for AI agents.☆220Jun 12, 2026Updated last week
- ☆17Mar 10, 2025Updated last year
- Office implementation of Diverse Co-training (ICCV2023)☆17Jun 20, 2025Updated 11 months ago
- [ICML 2026] AutoControl Arena: Frontier AI Risk Auto-Discovery Platform☆112May 2, 2026Updated last month
- AI-powered character generator built with React. Create detailed TRPG/Novel characters, NPC system prompts, and visual tags using Gemini,…☆137Feb 13, 2026Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- WHU-CS-Courses-Notes☆143Mar 22, 2026Updated 2 months ago
- ☆219Jun 3, 2026Updated 2 weeks ago
- Claude Code 开发速查手册 | 中文速查表☆103Jun 10, 2026Updated last week
- AI-assisted project knowledge workspace for development teams.☆98May 24, 2026Updated 3 weeks ago
- Complete ETCLOVG framework for AI Agent workflows - DAG+FSM orchestration, Ebbinghaus memory, discipline routing, skill evolution, trace …☆130May 31, 2026Updated 2 weeks ago
- ☆10Apr 14, 2025Updated last year
- ☆171Apr 27, 2026Updated last month
- Advancing Toward Type I Civilization: Zero Trust Network☆270Updated this week
- ☆148May 24, 2026Updated 3 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆83Mar 24, 2026Updated 2 months ago
- RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation☆21Jun 15, 2025Updated last year
- A static analysis tool that identifies redundant safety checks in Rust programs to improve performance. By analyzing MIR (Mid-level Inter…☆167May 22, 2026Updated 3 weeks ago
- We can bring your MySQL and the powerful GPT-4o together.With GPT-4o in hand,you can free yourself from coding the tiring SQL query.In ju…☆104Apr 27, 2026Updated last month
- Official implementation for "TRIO: Token Reduction via Inference-Objective Guidance for Efficient Vision-Language Models" https://arxiv…☆107Jun 3, 2026Updated 2 weeks ago
- 一个写接口文档的AI Agent。支持使用Vibe coding 的方式,编写接口文档,同时自带友好的文档查看工具与接口Mock工具☆939May 28, 2026Updated 3 weeks ago
- An advanced C++ framework for WoW64 Heaven’s Gate + Indirect Syscall, X64 Hell's Gate, and EDR evasion. Seamlessly load 64-bit kernel32 a…☆41May 8, 2026Updated last month