AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)
☆19Aug 9, 2024Updated last year
Alternatives and similar repositories for AdaRefiner
Users that are interested in AdaRefiner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation (ICML 2024)☆12Aug 9, 2024Updated last year
- A simple tool to help get information in NKU-EAMIS(NKU Education Affairs Management Information System).☆10Jul 27, 2020Updated 5 years ago
- 一款基于DQN算法的牌类游戏AI框架 / An AI framework for card games based on DQN algorithm☆13Jul 25, 2024Updated last year
- Being-VL-0.5: Unified Multimodal Understanding via Byte-Pair Visual Encoding (ICCV 2025, Highlight)☆53Dec 22, 2025Updated 3 months ago
- Simple tool to help find good price on steam market.☆14Jul 14, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- My Blog (https://www.zhangwp.com).☆30Jan 11, 2024Updated 2 years ago
- ☆13Oct 11, 2022Updated 3 years ago
- Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills☆62Jun 19, 2025Updated 9 months ago
- Being-M0.5: A Real-Time Controllable Vision-Language-Motion Model (ICCV 2025)☆35Sep 4, 2025Updated 6 months ago
- This repo supports integrating LLMs and communication algorithms with MARL using SMAC as the platform. It provides an end-to-end workflow…☆17Mar 8, 2025Updated last year
- Official code for "Efficient Residual Learning with Mixture-of-Experts for Universal Dexterous Grasping" (ICLR 2025)☆29Oct 25, 2025Updated 5 months ago
- MENTOR is a highly efficient visual RL algorithm that excels in both simulation and real-world complex robotic learning tasks.☆27Jul 9, 2025Updated 8 months ago
- 使用投毒posion的方式backdoor攻击LeNet-5网络,使用MNIST手写数据集☆14Feb 5, 2021Updated 5 years ago
- The code of paper "Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning in Mixed Cooperative-Competitive …☆16Jul 17, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- CVPR 2026 - MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation☆35Mar 17, 2026Updated last week
- [ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆30Mar 1, 2024Updated 2 years ago
- ☆18Sep 11, 2022Updated 3 years ago
- PyTorch implementation of 'Learning from Simulated and Unsupervised Images through Adversarial Training'☆16Jun 16, 2020Updated 5 years ago
- ☆17Feb 17, 2023Updated 3 years ago
- DemoGrasp: Universal Dexterous Grasping from a Single Demonstration (ICLR 2026)☆46Feb 14, 2026Updated last month
- ☆39Aug 10, 2025Updated 7 months ago
- This notebook presents an example of the equal risk pricing framework with deep hedging from my paper Carbonneau, A. and Godin, F. (2020)…☆15Oct 15, 2021Updated 4 years ago
- Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch☆21May 26, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- An online federated reinforcement learning algorithm published in INFOCOM2024☆17Dec 1, 2024Updated last year
- Using Large Language Models for Hyperparameter Optimization☆28May 13, 2024Updated last year
- Generative Exploration and Exploitation☆24Nov 27, 2021Updated 4 years ago
- Image Caption with Attention | a PyTorch Project to Image Caption☆17Jul 14, 2019Updated 6 years ago
- This repository contains code examples for the paper "Learning to sequence and blend robotics skills via differentiable optimization".☆12Sep 11, 2022Updated 3 years ago
- ☆16Feb 23, 2024Updated 2 years ago
- The light codes for the paper published in JMS named 'Solving task scheduling problems in cloud manufacturing via attention mechanism and…☆20May 15, 2023Updated 2 years ago
- Code the ICML 2024 paper: "Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models"☆12Jun 25, 2024Updated last year
- 💬 Send iMessages using Python through the Shortcuts app.☆18May 25, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Nov 30, 2022Updated 3 years ago
- ☆21Apr 12, 2024Updated last year
- Playing Pokemon Red with Reinforcement Learning☆21Jul 28, 2025Updated 7 months ago
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆34Feb 9, 2026Updated last month
- [KDD 2023] Causal Inference via Style Transfer for Out-of-distribution Generalisation☆28Feb 29, 2024Updated 2 years ago
- Implementation of TWOSOME☆82Jan 11, 2025Updated last year
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆199Mar 6, 2024Updated 2 years ago