ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"
☆33Nov 30, 2023Updated 2 years ago
Alternatives and similar repositories for MC-LAVE-RL
Users that are interested in MC-LAVE-RL are comparing it to the libraries listed below
Sorting:
- ☆17Dec 30, 2024Updated last year
- ☆11Dec 28, 2023Updated 2 years ago
- Repository (preliminary codes) for DSTC10 SIMMC track.☆19Dec 9, 2022Updated 3 years ago
- Official PyTorch implementation of AlberDICE☆23Dec 8, 2023Updated 2 years ago
- Code for GFlowNet-DPO (Direct Preference Optimization) EMNLP 2024 Main☆19Feb 22, 2026Updated 2 weeks ago
- MarketGPT: Developing a Pre-trained transformer (GPT) for Modeling Financial Time Series☆17Sep 5, 2025Updated 6 months ago
- OpenPipe Reinforcement Learning Experiments☆32Mar 14, 2025Updated 11 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Aug 20, 2025Updated 6 months ago
- A collection of environments and reference agents for planning and reinforcement learning research in partially observable, multi-agent …☆30Jun 2, 2025Updated 9 months ago
- ☆26Apr 12, 2018Updated 7 years ago
- Parallel Monte Carlo Tree Search with Batched Rigid-body Simulations☆31Aug 9, 2024Updated last year
- Financial Analysis and Algorithmic Trading Strategies in Python☆11Feb 16, 2023Updated 3 years ago
- FPGA Low latency 10GBASE-R PCS☆12May 23, 2023Updated 2 years ago
- Implementation of the model from "Faster sorting algorithms discovered using deep reinforcement learning" that discovered an all-new ult…☆11Aug 29, 2023Updated 2 years ago
- Some implementations from the paper robust risk aware reinforcement learning☆36Dec 15, 2021Updated 4 years ago
- RL algorithm for stock trading with multiple reward functions☆11Apr 21, 2024Updated last year
- Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)☆40Oct 30, 2023Updated 2 years ago
- ☆16Feb 22, 2025Updated last year
- NEFF Calculator and MSA File Converter☆13Sep 16, 2025Updated 5 months ago
- Implementations of the renormalization group-based diffusion model (RGDM).☆16Mar 10, 2025Updated last year
- ☆10Jul 21, 2019Updated 6 years ago
- LLM Skirmish☆44Feb 3, 2026Updated last month
- ☆14Mar 21, 2024Updated last year
- FinanceGPT-B☆10Mar 26, 2024Updated last year
- ☆11Jan 11, 2022Updated 4 years ago
- About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"☆13Oct 7, 2023Updated 2 years ago
- Disordered protein ensemble prediction☆12Feb 19, 2026Updated 2 weeks ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆30Updated this week
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- code for polite☆11Feb 28, 2024Updated 2 years ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- A collection of heat engines, based on the OpenAI Gym environment framework for use with reinforcement learning applications.☆15Dec 20, 2021Updated 4 years ago
- Open Source Tsetlin Machine framework☆17Oct 15, 2018Updated 7 years ago
- Isaac Gym Reinforcement Learning Environments for humanoid robot Bez☆10Jul 27, 2022Updated 3 years ago
- ReLAx - Reinforcement Learning Applications Library☆15Feb 19, 2023Updated 3 years ago
- Gathers machine learning and deep learning models for Reinforcement Learning☆10Sep 8, 2018Updated 7 years ago
- unifloc on python☆15Nov 14, 2020Updated 5 years ago
- Bitcoin blockchain to avro file☆12Feb 8, 2018Updated 8 years ago