PKU-RL / AdaRefinerView external linksLinks
AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)
☆18Aug 9, 2024Updated last year
Alternatives and similar repositories for AdaRefiner
Users that are interested in AdaRefiner are comparing it to the libraries listed below
Sorting:
- Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation (ICML 2024)☆11Aug 9, 2024Updated last year
- A simple tool to help get information in NKU-EAMIS(NKU Education Affairs Management Information System).☆10Jul 27, 2020Updated 5 years ago
- The first comprehensive developer’s guide to Bitcoin's most powerful upgrade — Taproot, fully open-access, reproducible, and testnet-veri…☆29Updated this week
- Code for the paper: "Abide by the Law and Follow the Flow: Conservation Laws for Gradient Flows".☆12Feb 24, 2024Updated last year
- Code the ICML 2024 paper: "Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models"☆11Jun 25, 2024Updated last year
- Works about Cucker-Smale model and its extensions. =Keywords: ODE, Runge-Kutta methods, SDE, Euler-Maruyama method, NumPy, Matplotlib☆11Feb 14, 2024Updated last year
- biorbd + casadi + variational integrator☆10Apr 30, 2024Updated last year
- Mixed Integer Linear Programming (MILP) Model for solving airport gate assignment problem.☆13Oct 18, 2022Updated 3 years ago
- 一个简易的不自动化的autodl部署自己的代理的指南,帮助下载huggingface的模型(鉴于官方学术加速以及hfmirror很不好用)☆16Sep 3, 2025Updated 5 months ago
- This repository contains code examples for the paper "Learning to sequence and blend robotics skills via differentiable optimization".☆12Sep 11, 2022Updated 3 years ago
- Simplifying the autonomous vehicle development process.☆19Jul 22, 2025Updated 6 months ago
- This notebook presents an example of the equal risk pricing framework with deep hedging from my paper Carbonneau, A. and Godin, F. (2020)…☆15Oct 15, 2021Updated 4 years ago
- My final project, Snow Simulation, for Prof. Lingqi Yan's online open course games 101-Intro to Modern Computer Graphics☆12Mar 12, 2021Updated 4 years ago
- MPI Code Generation through Domain-Specific Language Models☆14Nov 19, 2024Updated last year
- ☆46Dec 11, 2023Updated 2 years ago
- Hangman Game implementation using n-gram language model in NLP, achieved an accuracy of more than 50%☆13Jul 18, 2023Updated 2 years ago
- Matlab bindings and interface for Haskell☆13Aug 24, 2020Updated 5 years ago
- ☆18Jun 26, 2024Updated last year
- A customized docker for headless GPU rendering without host-side configuration☆10Aug 22, 2022Updated 3 years ago
- Repo for Paper "OpenHA: A Series of Open-Source Hierarchical Agentic Models in Minecraft"☆24Feb 5, 2026Updated last week
- ☆13Oct 11, 2022Updated 3 years ago
- Official implementation of "Multi-armed Bandit Algorithm against Strategic Replication"☆14May 17, 2022Updated 3 years ago
- 💬 Send iMessages using Python through the Shortcuts app.☆18May 25, 2024Updated last year
- [ECCV] HYDRA: A Hyper Agent for Dynamic Compositional Visual Reasoning☆24Sep 6, 2025Updated 5 months ago
- Physics-informed learning of governing equations from scarce data☆13Feb 16, 2021Updated 4 years ago
- 使用投毒posion的方式backdoor攻击LeNet-5网络,使用MNIST手写数据集☆14Feb 5, 2021Updated 5 years ago
- Deep Optimal Stopping Project☆15Jun 8, 2019Updated 6 years ago
- ☆22Jun 6, 2025Updated 8 months ago
- ☆15Jan 18, 2026Updated 3 weeks ago
- CBML The Code Block Markup Language☆14Jul 28, 2018Updated 7 years ago
- [NeurIPS 2023] Official implementation of "Learning from Visual Observation via Offline Pretrained State-to-Go Transformer"☆18Oct 1, 2023Updated 2 years ago
- code of ICLR 2024 paper Reinforcement Symbolic Regression Machine☆15Feb 19, 2024Updated last year
- Monte Carlo Method to Solve Laplace and Poisson Equations with example for EE447 High Voltage Engineering☆16Oct 4, 2023Updated 2 years ago
- Using Large Language Models for Hyperparameter Optimization☆25May 13, 2024Updated last year
- Contains Latex document: All solutions to M.A. Armstrong's "Basic Topology"☆13Jan 3, 2023Updated 3 years ago
- Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models☆32Nov 2, 2025Updated 3 months ago
- ☆18Sep 11, 2022Updated 3 years ago
- AgentX - Agent Extension: MCP Servers, Agent Skills and Plugins Manager☆47Feb 1, 2026Updated last week
- ☆16Aug 5, 2019Updated 6 years ago