AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)
☆18Aug 9, 2024Updated last year
Alternatives and similar repositories for AdaRefiner
Users that are interested in AdaRefiner are comparing it to the libraries listed below
Sorting:
- 一款基于DQN算法的牌类游戏AI框架 / An AI framework for card games based on DQN algorithm☆13Jul 25, 2024Updated last year
- Being-VL-0.5: Unified Multimodal Understanding via Byte-Pair Visual Encoding (ICCV 2025, Highlight)☆49Dec 22, 2025Updated 2 months ago
- A library of fast and accurate low fidelity dynamic models for applications in robotics☆11Jul 12, 2024Updated last year
- ☆11Jul 4, 2024Updated last year
- 哈工大机器学习作业一——多项式拟合曲线☆10Oct 19, 2016Updated 9 years ago
- biorbd + casadi + variational integrator☆10Apr 30, 2024Updated last year
- Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills☆62Jun 19, 2025Updated 8 months ago
- Simplifying the autonomous vehicle development process.☆19Jul 22, 2025Updated 7 months ago
- MPI Code Generation through Domain-Specific Language Models☆14Nov 19, 2024Updated last year
- This notebook presents an example of the equal risk pricing framework with deep hedging from my paper Carbonneau, A. and Godin, F. (2020)…☆15Oct 15, 2021Updated 4 years ago
- ☆12Nov 30, 2022Updated 3 years ago
- 一个简易的不自动化的autodl部署自己的代理的指南,帮助下载huggingface的模型(鉴于官方学术加速以及hfmirror很不好用)☆16Sep 3, 2025Updated 6 months ago
- ☆47Dec 11, 2023Updated 2 years ago
- Hamiltonian neural network implementation for Henon Heiles dynamical system learning mix of order and chaos☆11Dec 2, 2023Updated 2 years ago
- ☆13Feb 21, 2025Updated last year
- Hangman Game implementation using n-gram language model in NLP, achieved an accuracy of more than 50%☆13Jul 18, 2023Updated 2 years ago
- ☆18Jun 26, 2024Updated last year
- ☆14Mar 5, 2023Updated 3 years ago
- Repo for Paper "OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft"☆24Feb 5, 2026Updated last month
- ☆13Oct 11, 2022Updated 3 years ago
- LITEN: Learning from Inference Time Execution for VLAs☆26Oct 23, 2025Updated 4 months ago
- Implementation of Lagrangian Neural Networks in PyTorch☆14Nov 21, 2025Updated 3 months ago
- Code for "Symplectic Adjoint Method for Exact Gradient of Neural ODE with Minimal Memory," NeurIPS, 2021.☆18Jan 8, 2022Updated 4 years ago
- Text world based on Minecraft rules.☆17May 13, 2024Updated last year
- The code of paper "Learning Heterogeneous Strategies via Graph-based Multi-agent Reinforcement Learning in Mixed Cooperative-Competitive …☆15Jul 17, 2021Updated 4 years ago
- This is the source repository containing all information necessary to reproduce the Cambridge RoboMaster platform.☆18Oct 16, 2024Updated last year
- ☆15Jan 18, 2026Updated last month
- [NeurIPS 2023] Official implementation of "Learning from Visual Observation via Offline Pretrained State-to-Go Transformer"☆18Oct 1, 2023Updated 2 years ago
- Symplectic integration of Hamiltonian systems. Zymplectic is a pre-compiled GUI and engine with 2D/3D-graphics bundled with more than 80 …☆15Feb 25, 2026Updated last week
- code of ICLR 2024 paper Reinforcement Symbolic Regression Machine☆15Feb 19, 2024Updated 2 years ago
- Deep Optimal Stopping Project☆15Jun 8, 2019Updated 6 years ago
- ☆22Jun 6, 2025Updated 9 months ago
- ☆24Jan 1, 2025Updated last year
- ☆21Nov 5, 2024Updated last year
- ☆15Dec 15, 2020Updated 5 years ago
- 官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。☆17Sep 25, 2023Updated 2 years ago
- Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models☆32Nov 2, 2025Updated 4 months ago
- Multibody dynamics project for rigid-flexible systems with actuated joints. Flexible bodies use Cosserat Rod Theory formulated on a Lie G…☆23Jun 19, 2023Updated 2 years ago
- The pastebin for mathematicians☆35Jun 9, 2014Updated 11 years ago