PKU-RL/AdaRefiner

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PKU-RL/AdaRefiner)

PKU-RL / AdaRefiner

AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)

☆19

Alternatives and similar repositories for AdaRefiner

Users that are interested in AdaRefiner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BeingBeyond / VIPA-VLA
View on GitHub
Spatial-Aware VLA Pretraining through Visual-Physical Alignment from Human Videos (CVPR 2026)
☆26Dec 16, 2025Updated 7 months ago
nuomizai / T2VLM
View on GitHub
[ICCV'25] T2 -VLM: Training-Free Generation of Temporally Consistent Rewards from VLMs
☆16Jul 8, 2025Updated last year
BeingBeyond / UniTacHand
View on GitHub
UniTacHand: Unified Spatio-Tactile Representation for Human-to-Dexterous-Hand Skill Transfer
☆26Dec 25, 2025Updated 7 months ago
zawnpn / Steam-Bot_Market
View on GitHub
Simple tool to help find good price on steam market.
☆13Jul 14, 2020Updated 6 years ago
BeingBeyond / Being-M0.5
View on GitHub
Being-M0.5: A Real-Time Controllable Vision-Language-Motion Model (ICCV 2025)
☆37Sep 4, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ai4ce / INT-ACT
View on GitHub
Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models
☆33Nov 2, 2025Updated 8 months ago
OpenCausaLab / ADAM
View on GitHub
We introduce ADAM, An emboDied causal Agent in Minecraft, that can autonomously navigate the open world, perceive multimodal contexts, le…
☆33Apr 7, 2025Updated last year
PKU-RL / MBOM
View on GitHub
☆13Oct 11, 2022Updated 3 years ago
PKU-RL / Creative-Agents
View on GitHub
☆50Dec 11, 2023Updated 2 years ago
gmme1996 / PolynomialFitting
View on GitHub
哈工大机器学习作业一——多项式拟合曲线
☆10Oct 19, 2016Updated 9 years ago
zhoubohan0 / STG-Transformer
View on GitHub
[NeurIPS 2023] Official implementation of "Learning from Visual Observation via Offline Pretrained State-to-Go Transformer"
☆17Oct 1, 2023Updated 2 years ago
tuyunbin / SRDRL
View on GitHub
[ACL 2021] This is the Pytorch code for our paper "Semantic Relation-aware Difference Representation Learning for Change Captioning".
☆13Jan 16, 2022Updated 4 years ago
sunkevin1214 / codes
View on GitHub
☆11Aug 16, 2018Updated 7 years ago
Gzh0821 / Optimization_project
View on GitHub
一个使用模拟退火算法和登山算法解决流水车间调度问题的最优化方法实验。
☆14Mar 7, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kiddyboots216 / FedRL
View on GitHub
Federated Reinforcement Learning
☆12Jun 20, 2019Updated 7 years ago
AgentGuo / Backdoor_Attack_LeNet5_MNIST
View on GitHub
使用投毒posion的方式backdoor攻击LeNet-5网络，使用MNIST手写数据集
☆14Feb 5, 2021Updated 5 years ago
Abluceli / H2G-MAAC
View on GitHub
The code of paper "Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning in Mixed Cooperative-Competitive …
☆16Jul 17, 2021Updated 5 years ago
PKU-RL / PTGM
View on GitHub
[ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning
☆30Mar 1, 2024Updated 2 years ago
jiechuanjiang / I2Q
View on GitHub
I2Q: A Fully Decentralized Q-Learning Algorithm
☆19Nov 10, 2022Updated 3 years ago
renatolfc / sched-rl-gym
View on GitHub
☆20Jul 1, 2026Updated 3 weeks ago
daniel-merrick / Learning-from-Simulated-and-Unsupervised-Images-through-Adversarial-Training-SimGAN-PyTorch
View on GitHub
PyTorch implementation of 'Learning from Simulated and Unsupervised Images through Adversarial Training'
☆16Jun 16, 2020Updated 6 years ago
PKU-RL / RoadnetSZ
View on GitHub
☆17Feb 17, 2023Updated 3 years ago
DA2I2-SLM / DAR
View on GitHub
Source code for the paper: Hear Both Sides: Efficient Multi-Agent Debate via Diversity-Aware Message Retention
☆18Apr 16, 2026Updated 3 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
tuyunbin / NCT
View on GitHub
[IEEE TMM 2023] This is the Pytorch code for our paper "Neighborhood Contrastive Transformer for Change Captioning".
☆13Aug 30, 2023Updated 2 years ago
Eydcao / Yan-CG-SnowSim
View on GitHub
My final project, Snow Simulation, for Prof. Lingqi Yan's online open course games 101-Intro to Modern Computer Graphics
☆12Mar 12, 2021Updated 5 years ago
HansenHua / MFPO-INFOCOM24
View on GitHub
An online federated reinforcement learning algorithm published in INFOCOM2024
☆16Dec 1, 2024Updated last year
doeun-235 / Cucker-Smale-Model
View on GitHub
Works about Cucker-Smale model and its extensions. =Keywords: ODE, Runge-Kutta methods, SDE, Euler-Maruyama method, NumPy, Matplotlib
☆12Feb 14, 2024Updated 2 years ago
uwsbel / low-fidelity-dynamic-models
View on GitHub
A library of fast and accurate low fidelity dynamic models for applications in robotics
☆14Jul 12, 2024Updated 2 years ago
Scientific-Computing-Lab / MPI-rigen
View on GitHub
MPI Code Generation through Domain-Specific Language Models
☆16Nov 19, 2024Updated last year
lizhemin18 / pymarl_LLM
View on GitHub
This repo supports integrating LLMs and communication algorithms with MARL using SMAC as the platform. It provides an end-to-end workflow…
☆20Mar 8, 2025Updated last year
tensorinfinitysip / a-PyTorch-Project-to-Image-Caption
View on GitHub
Image Caption with Attention | a PyTorch Project to Image Caption
☆17Jul 14, 2019Updated 7 years ago
M3RG-IITD / benchmarking_graph
View on GitHub
☆12Nov 30, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sampleargmax / schedulingattentionmodel
View on GitHub
The light codes for the paper published in JMS named 'Solving task scheduling problems in cloud manufacturing via attention mechanism and…
☆20May 15, 2023Updated 3 years ago
agentification / Language-Integrated-VI
View on GitHub
☆21Apr 12, 2024Updated 2 years ago
BeingBeyond / DemoGrasp
View on GitHub
DemoGrasp: Universal Dexterous Grasping from a Single Demonstration (ICLR 2026)
☆81Feb 14, 2026Updated 5 months ago
yuqingd / ellm
View on GitHub
☆91Aug 21, 2023Updated 2 years ago
RTkenny / RiskPO
View on GitHub
Official implementation of 'RiskPO: Risk-based Policy Optimization via Verifiable Reward for LLM Post-Training', accepted by ICLR 2026
☆18Oct 15, 2025Updated 9 months ago
WeihaoTan / TWOSOME
View on GitHub
Implementation of TWOSOME
☆82Jan 11, 2025Updated last year
PKU-RL / Plan4MC
View on GitHub
[NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks
☆200Mar 6, 2024Updated 2 years ago