AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)
☆19Aug 9, 2024Updated last year
Alternatives and similar repositories for AdaRefiner
Users that are interested in AdaRefiner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple tool to help get information in NKU-EAMIS(NKU Education Affairs Management Information System).☆10Jul 27, 2020Updated 5 years ago
- Being-VL-0.5: Unified Multimodal Understanding via Byte-Pair Visual Encoding (ICCV 2025, Highlight)☆52Dec 22, 2025Updated 5 months ago
- ☆49Dec 11, 2023Updated 2 years ago
- Being-M0.5: A Real-Time Controllable Vision-Language-Motion Model (ICCV 2025)☆36Sep 4, 2025Updated 8 months ago
- CEVAE with VampPrior☆11Jul 18, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Mar 5, 2023Updated 3 years ago
- CEVAE(Causal Effect Variational AutoEncoder) written with pytorch and pyro.☆10Feb 15, 2021Updated 5 years ago
- This repo supports integrating LLMs and communication algorithms with MARL using SMAC as the platform. It provides an end-to-end workflow…☆19Mar 8, 2025Updated last year
- Official code for "Efficient Residual Learning with Mixture-of-Experts for Universal Dexterous Grasping" (ICLR 2025)☆28Oct 25, 2025Updated 7 months ago
- MENTOR is a highly efficient visual RL algorithm that excels in both simulation and real-world complex robotic learning tasks.☆27Jul 9, 2025Updated 10 months ago
- The code of paper "Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning in Mixed Cooperative-Competitive …☆16Jul 17, 2021Updated 4 years ago
- [ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents☆77Apr 23, 2026Updated last month
- [ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆30Mar 1, 2024Updated 2 years ago
- ☆18Sep 11, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Michael Collins NLP课程资料☆14Dec 13, 2020Updated 5 years ago
- Federated Reinforcement Learning☆12Jun 20, 2019Updated 6 years ago
- ☆28Nov 6, 2023Updated 2 years ago
- Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch☆21May 26, 2021Updated 5 years ago
- An online federated reinforcement learning algorithm published in INFOCOM2024☆17Dec 1, 2024Updated last year
- MPI Code Generation through Domain-Specific Language Models☆16Nov 19, 2024Updated last year
- ☆11Aug 16, 2018Updated 7 years ago
- Generative Exploration and Exploitation☆24Nov 27, 2021Updated 4 years ago
- ☆16Feb 23, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository contains code examples for the paper "Learning to sequence and blend robotics skills via differentiable optimization".☆13Sep 11, 2022Updated 3 years ago
- The light codes for the paper published in JMS named 'Solving task scheduling problems in cloud manufacturing via attention mechanism and…☆20May 15, 2023Updated 3 years ago
- Code the ICML 2024 paper: "Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models"☆12Jun 25, 2024Updated last year
- ☆12Nov 30, 2022Updated 3 years ago
- Playing Pokemon Red with Reinforcement Learning☆21Jul 28, 2025Updated 9 months ago
- Implementation of TWOSOME☆82Jan 11, 2025Updated last year
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆200Mar 6, 2024Updated 2 years ago
- LITEN: Learning from Inference Time Execution for VLAs☆27Oct 23, 2025Updated 7 months ago
- Works about Cucker-Smale model and its extensions. =Keywords: ODE, Runge-Kutta methods, SDE, Euler-Maruyama method, NumPy, Matplotlib☆12Feb 14, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Term Project. Use Simulated Annealing Algorithm for the basic Job Shop Scheduling Problem (模拟退火解决车间调度问题)☆19Aug 2, 2021Updated 4 years ago
- ☆11Jul 4, 2024Updated last year
- ☆13Apr 28, 2019Updated 7 years ago
- ☆91Aug 21, 2023Updated 2 years ago
- Performing Symbolic Regression via Monte Carlo Tree Search (MCTS)☆14Nov 2, 2018Updated 7 years ago
- Repo for Paper "OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft"☆33May 10, 2026Updated 2 weeks ago
- ☆16Apr 14, 2026Updated last month