AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)
☆19Aug 9, 2024Updated last year
Alternatives and similar repositories for AdaRefiner
Users that are interested in AdaRefiner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation (ICML 2024)☆12Aug 9, 2024Updated last year
- Simple tool to help find good price on steam market.☆14Jul 14, 2020Updated 5 years ago
- Fast Fourier Transform Acceleration Algorithm. (Accelerated by CUDA)☆12Jul 8, 2018Updated 7 years ago
- Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills☆63Jun 19, 2025Updated 10 months ago
- Being-M0.5: A Real-Time Controllable Vision-Language-Motion Model (ICCV 2025)☆35Sep 4, 2025Updated 8 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆14Mar 5, 2023Updated 3 years ago
- [ACL 2021] This is the Pytorch code for our paper "Semantic Relation-aware Difference Representation Learning for Change Captioning".☆13Jan 16, 2022Updated 4 years ago
- Official code for "Efficient Residual Learning with Mixture-of-Experts for Universal Dexterous Grasping" (ICLR 2025)☆28Oct 25, 2025Updated 6 months ago
- MENTOR is a highly efficient visual RL algorithm that excels in both simulation and real-world complex robotic learning tasks.☆27Jul 9, 2025Updated 9 months ago
- The code of paper "Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning in Mixed Cooperative-Competitive …☆16Jul 17, 2021Updated 4 years ago
- I2Q: A Fully Decentralized Q-Learning Algorithm☆19Nov 10, 2022Updated 3 years ago
- ☆17Feb 17, 2023Updated 3 years ago
- Federated Reinforcement Learning☆12Jun 20, 2019Updated 6 years ago
- DemoGrasp: Universal Dexterous Grasping from a Single Demonstration (ICLR 2026)☆62Feb 14, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- My final project, Snow Simulation, for Prof. Lingqi Yan's online open course games 101-Intro to Modern Computer Graphics☆12Mar 12, 2021Updated 5 years ago
- Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch☆21May 26, 2021Updated 4 years ago
- Using Large Language Models for Hyperparameter Optimization☆29May 13, 2024Updated last year
- An online federated reinforcement learning algorithm published in INFOCOM2024☆17Dec 1, 2024Updated last year
- MPI Code Generation through Domain-Specific Language Models☆15Nov 19, 2024Updated last year
- ☆11Aug 16, 2018Updated 7 years ago
- A customized docker for headless GPU rendering without host-side configuration☆11Aug 22, 2022Updated 3 years ago
- Generative Exploration and Exploitation☆24Nov 27, 2021Updated 4 years ago
- A library of fast and accurate low fidelity dynamic models for applications in robotics☆11Jul 12, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆16Feb 23, 2024Updated 2 years ago
- Image Caption with Attention | a PyTorch Project to Image Caption☆17Jul 14, 2019Updated 6 years ago
- This repository contains code examples for the paper "Learning to sequence and blend robotics skills via differentiable optimization".☆12Sep 11, 2022Updated 3 years ago
- 💬 Send iMessages using Python through the Shortcuts app.☆18May 25, 2024Updated last year
- ☆12Nov 30, 2022Updated 3 years ago
- ☆21Apr 12, 2024Updated 2 years ago
- Implementation of TWOSOME☆82Jan 11, 2025Updated last year
- [NeurIPS 2023 FMDM Workshop] Skill Reinforcement Learning and Planning for Open-World Long-Horizon Tasks☆200Mar 6, 2024Updated 2 years ago
- LITEN: Learning from Inference Time Execution for VLAs☆26Oct 23, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Works about Cucker-Smale model and its extensions. =Keywords: ODE, Runge-Kutta methods, SDE, Euler-Maruyama method, NumPy, Matplotlib☆12Feb 14, 2024Updated 2 years ago
- [ECCV] HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning☆26Sep 6, 2025Updated 8 months ago
- ☆90Aug 21, 2023Updated 2 years ago
- [SIGGRAPH 2024] Official Implementation of "Toonify3D: StyleGAN-based 3D Stylized Face Generator"☆37Apr 29, 2025Updated last year
- Hamiltonian neural network implementation for Henon Heiles dynamical system learning mix of order and chaos☆11Dec 2, 2023Updated 2 years ago
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆39Feb 9, 2026Updated 2 months ago
- Performing Symbolic Regression via Monte Carlo Tree Search (MCTS)☆14Nov 2, 2018Updated 7 years ago