RationalRewards: a reasoning reward model for diffusion RL and test-time prompt tuning
☆89Apr 16, 2026Updated last month
Alternatives and similar repositories for RationalRewards
Users that are interested in RationalRewards are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A substrate-native digital consciousness engine where prediction errors about self-survival become causally efficacious qualia, driving c…☆18Mar 3, 2026Updated 2 months ago
- 恶意中转,目标是支持对opencode,claudecode,openclaw等常见的agent的攻击。☆68Apr 27, 2026Updated last month
- Bridge Claude Code/Codex sessions to a Feishu (Lark) bot☆85May 19, 2026Updated last week
- Deep academic paper analyzer for ML/DL research. Formula-by-formula explanation, reproducibility analysis, and research idea generation u…☆52Mar 5, 2026Updated 2 months ago
- Extract Med Data and Construct KG , Provide Q&A☆103Apr 16, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- YouTube bilingual learning app with AI-powered learning system☆120May 20, 2026Updated last week
- ⏰一个现代化的全屏时钟应用,支持时钟、倒计时、秒表与晚自习模式,内置天气、噪音提醒及噪音走势图、励志语录、课程表管理。推荐使用浏览器的 PWA 功 能可安装到桌面离线使用以支持手机、平板、电脑本地运行,联网自动更新。☆85May 3, 2026Updated 3 weeks ago
- A lightweight template engine for Java☆20Apr 19, 2026Updated last month
- PyTorch-based open-source code for paper "SOD: Step-wise On-policy Distillation for Small Language Model Agents"☆137May 22, 2026Updated last week
- Towards Instance Segmentation with Polygon Detection Transformer.☆110Mar 10, 2026Updated 2 months ago
- A package manager for AI agent skills with cross-agent sharing, sync, and deployment.☆125May 22, 2026Updated last week
- One-stop quant-trading AI agent — research · strategy · backtest · paper trade from one prompt. Works in Claude Code, Cursor, and 20+ AI …☆108May 15, 2026Updated 2 weeks ago
- Mitigating Hallucinations in Large Vision-Language Models via Accumulative Decoding☆242Mar 26, 2026Updated 2 months ago
- Self-deployed auth for Cloudflare Workers and D1: email/password login, magic links, verification, password reset, secure sessions, CLI s…☆135May 21, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- an LLM-native world and civilization☆250May 8, 2026Updated 3 weeks ago
- ☆102Mar 21, 2026Updated 2 months ago
- Office implementation of Diverse Co-training (ICCV2023)☆17Jun 20, 2025Updated 11 months ago
- ☆17Mar 10, 2025Updated last year
- [ICML 2026] AutoControl Arena: Frontier AI Risk Auto-Discovery Platform☆110May 2, 2026Updated 3 weeks ago
- AI-powered character generator built with React. Create detailed TRPG/Novel characters, NPC system prompts, and visual tags using Gemini,…☆135Feb 13, 2026Updated 3 months ago
- AI-assisted project knowledge workspace for development teams.☆98Updated this week
- WHU-CS-Courses-Notes☆139Mar 22, 2026Updated 2 months ago
- ☆218Jan 18, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Claude Code 开发速查手册 | 中文速查表☆102Updated this week
- AI 原生工作流编排引擎 —— 将复杂多步骤 AI 任务转化为结构化、可观测、可重试的工作流。☆130May 23, 2026Updated last week
- ☆10Apr 14, 2025Updated last year
- ☆169Apr 27, 2026Updated last month
- Advancing Toward Type I Civilization: Zero Trust Network☆223May 11, 2026Updated 2 weeks ago
- ☆82Mar 24, 2026Updated 2 months ago
- ☆149Updated this week
- RS-MTDF: Multi-Teacher Distillation and Fusion for Remote Sensing Semi-Supervised Semantic Segmentation☆20Jun 15, 2025Updated 11 months ago
- A static analysis tool that identifies redundant safety checks in Rust programs to improve performance. By analyzing MIR (Mid-level Inter…☆168May 22, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official implementation for "TRIO: Token Reduction via Inference-Objective Guidance for Efficient Vision-Language Models"☆108May 15, 2026Updated 2 weeks ago
- 一个写接口文档的AI Agent。支持使用Vibe coding 的方式,编写接口文档,同时自带友好的文档查看工具与接口Mock工具☆429Updated this week
- An advanced C++ framework for WoW64 Heaven’s Gate + Indirect Syscall, X64 Hell's Gate, and EDR evasion. Seamlessly load 64-bit kernel32 a…☆41May 8, 2026Updated 3 weeks ago
- Description: A Windows floating scratchpad for AI coding workflows — collect text, screenshots, and files with Ctrl+V.☆102Apr 27, 2026Updated last month
- ☆142Mar 20, 2026Updated 2 months ago
- Software Copyright Application Material Auto-Generation System based on LLM☆19Feb 1, 2026Updated 3 months ago
- Seed 是一个专为游戏研发设计的 Claude Code 插件。 一条命令描述任务,Seed 自动分析类型和领域,从五个专化 Agent 中组出最合适的组合,通过 Claude Code 原生 Team 机制启动协作。实现、调查、修复、审查、Unity Editor 操…☆56Apr 30, 2026Updated last month