tengwang0318/hierarchial_reward_model

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tengwang0318/hierarchial_reward_model)

tengwang0318 / hierarchial_reward_model

[ACL2026 Findings] "Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language Models"

☆20

Alternatives and similar repositories for hierarchial_reward_model

Users that are interested in hierarchial_reward_model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Visual-AI / Category-Discovery
View on GitHub
[ArXiv2025] Category Discovery: An Open-World Perspective
☆15Mar 17, 2026Updated 4 months ago
Visual-AI / SEAL
View on GitHub
[NeurIPS 2025] SEAL: Semantic-Aware Hierarchical Learning for Generalized Category Discovery
☆16Apr 4, 2026Updated 3 months ago
Visual-AI / HiLo
View on GitHub
[ICLR2025] HiLo: A Learning Framework for Generalized Category Discovery Robust to Domain Shifts
☆22Aug 1, 2025Updated 11 months ago
zhenqi-he / transnuseg
View on GitHub
MICCAI2023 - TransNuSeg: A Lightweight Multi-Task Transformer for Nuclei Segmentation
☆26Feb 5, 2024Updated 2 years ago
LiuChuang0059 / StructMAE
View on GitHub
Code for IJCAI'24 paper: Where to Mask: Structure-Guided Masking for Graph Masked Autoencoders
☆14Apr 30, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sgvaze / SSB
View on GitHub
Python package to download and use the SSB datasets
☆11Aug 3, 2023Updated 2 years ago
Visual-AI / Dissect-OOD-OSR
View on GitHub
[IJCV 2024] Dissecting Out-of-Distribution Detection and Open-Set Recognition: A Critical Analysis of Methods and Benchmarks
☆15Aug 30, 2024Updated last year
charlesXu86 / Chatbot_Doc
View on GitHub
Chatbot_CN项目的Chatbot_Doc模块
☆19May 17, 2020Updated 6 years ago
zhaoxlpku / HKU-DASC7606-A2
View on GitHub
☆17Mar 28, 2024Updated 2 years ago
flowersteam / EAGER
View on GitHub
☆10Oct 11, 2022Updated 3 years ago
Visual-AI / DebGCD
View on GitHub
[ICLR 2025] DebGCD: Debiased Learning with Distribution Guidance for Generalized Category Discovery
☆16Sep 27, 2025Updated 10 months ago
MangoKiller / SimOAR_OAR
View on GitHub
☆11Nov 8, 2023Updated 2 years ago
Arnav-Gr0ver / ICAIF_FinRL-2024
View on GitHub
Code for the paper "FinRLlama: A Solution to LLM-Engineered Signals Challenge at FinRL Contest 2024"
☆12Feb 14, 2025Updated last year
zongqianwu / ST-COT
View on GitHub
(ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training
☆13Feb 15, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
paolomandica / HALO
View on GitHub
Official PyTorch implementation of the ICML 2024 paper "Hyperbolic Active Learning for Semantic Segmentation under Domain Shift"
☆25Nov 26, 2024Updated last year
GraphPKU / CoI
View on GitHub
Chain of Images for Intuitively Reasoning
☆10Nov 29, 2023Updated 2 years ago
tabzhangjx / MixupExplainer
View on GitHub
☆10Jun 11, 2023Updated 3 years ago
laurabravo97 / dynamic_functional_connectivity_analysis
View on GitHub
Python script to obtain dynamic functional connectivity metrics, after using a sliding window approach, statistical analyses to test for …
☆12Sep 10, 2024Updated last year
ALT-JS / OthelloSAE
View on GitHub
CS194-196 Course Project
☆14Feb 20, 2025Updated last year
ranlongyu / pycloudsim
View on GitHub
云任务调度仿真平台
☆13Mar 11, 2020Updated 6 years ago
CECNL / XBrainLab
View on GitHub
We introduce XBrainLab, an open-source user-friendly software, for accelerated interpretation of neural patterns from EEG data based on c…
☆14Dec 5, 2025Updated 7 months ago
Visual-AI / JoVA
View on GitHub
JoVA: Unified Multimodal Learning for Joint Video-Audio Generation
☆33Dec 22, 2025Updated 7 months ago
IBM / transformers-struct-guidance
View on GitHub
Code for the ACL 2021 paper "Structural Guidance for Transformer Language Models"
☆15Sep 17, 2025Updated 10 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
5a7man / eeg_fConn
View on GitHub
Python library to compute functional connectivity measures from EEG
☆12Oct 14, 2023Updated 2 years ago
DaBiGu / Bi_Gu-bot
View on GitHub
Personal QQ chatbot based on Nonebot2 and NapCatQQ
☆12Updated this week
lemon-little / BetterSynth
View on GitHub
天池Better Synth多模态大模型数据合成挑战赛-打赢baseline就算成功方案
☆30Oct 30, 2025Updated 8 months ago
snu-larr / ibc_official
View on GitHub
Code for "Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum" (ICML 2023)
☆10Jul 6, 2023Updated 3 years ago
Chen-GX / SEER
View on GitHub
☆15Feb 10, 2025Updated last year
leonardosposina / mt5-distance-from-moving-average
View on GitHub
MetaTrader 5 indicator that measures the largest distance between a price (high or low) and a moving average.
☆11Oct 9, 2020Updated 5 years ago
Hoyyyaard / NavGPT
View on GitHub
☆10Nov 16, 2023Updated 2 years ago
OpenCSGs / Awesome-SLMs
View on GitHub
survery of small language models
☆18Jul 23, 2024Updated 2 years ago
Visual-AI / PromptCCD
View on GitHub
[ECCV2024] PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery
☆31Apr 3, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
wsysx123 / AliyunContest2020
View on GitHub
首届阿里云弹性计算挑战赛云资源调度赛道作品
☆15Mar 31, 2021Updated 5 years ago
mansicer / self-verification
View on GitHub
☆18Dec 23, 2025Updated 7 months ago
zzh-SJTU / CRT-QA
View on GitHub
The official data and code for EMNLP 2023 main conference paper: CRT-QA: A Dataset of Complex Reasoning Question Answering over Tabular D…
☆13May 19, 2025Updated last year
ShanghaitechGeekPie / LatexTemplate
View on GitHub
上海科技大学非官方Latex模版库
☆16Apr 12, 2018Updated 8 years ago
WilliamZR / ProTrix
View on GitHub
Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context
☆17Nov 15, 2024Updated last year
nzjin / awesome_moe
View on GitHub
The collections of MOE (Mixture Of Expert) papers, code and tools, etc.
☆12Mar 15, 2024Updated 2 years ago
935963004 / PhysioOmni
View on GitHub
☆16Oct 19, 2025Updated 9 months ago