jys5609/MC-LAVE-RL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jys5609/MC-LAVE-RL)

jys5609 / MC-LAVE-RL

ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"

☆33

Alternatives and similar repositories for MC-LAVE-RL

Users that are interested in MC-LAVE-RL are comparing it to the libraries listed below

Sorting:

KAIST-AILab / imitation-dice
View on GitHub
☆17Dec 30, 2024Updated last year
KAIST-AILab / palr
View on GitHub
☆11Dec 28, 2023Updated 2 years ago
KAIST-AILab / DSTC10-SIMMC
View on GitHub
Repository (preliminary codes) for DSTC10 SIMMC track.
☆19Dec 9, 2022Updated 3 years ago
dematsunaga / alberdice
View on GitHub
Official PyTorch implementation of AlberDICE
☆23Dec 8, 2023Updated 2 years ago
ggoggam / gdpo
View on GitHub
Code for GFlowNet-DPO (Direct Preference Optimization) EMNLP 2024 Main
☆19Feb 22, 2026Updated 2 weeks ago
aaron-wheeler / MarketGPT
View on GitHub
MarketGPT: Developing a Pre-trained transformer (GPT) for Modeling Financial Time Series
☆17Sep 5, 2025Updated 6 months ago
OpenPipe / rl-experiments
View on GitHub
OpenPipe Reinforcement Learning Experiments
☆32Mar 14, 2025Updated 11 months ago
tsinghua-fib-lab / SmartAgent
View on GitHub
The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".
☆27Aug 20, 2025Updated 6 months ago
RDLLab / posggym
View on GitHub
A collection of environments and reference agents for planning and reinforcement learning research in partially observable, multi-agent …
☆30Jun 2, 2025Updated 9 months ago
activatedgeek / qmix
View on GitHub
☆26Apr 12, 2018Updated 7 years ago
arc-l / pmbs
View on GitHub
Parallel Monte Carlo Tree Search with Batched Rigid-body Simulations
☆31Aug 9, 2024Updated last year
canl / algo-trading
View on GitHub
Financial Analysis and Algorithmic Trading Strategies in Python
☆11Feb 16, 2023Updated 3 years ago
theSergeyGusev / simple10GbaseR
View on GitHub
FPGA Low latency 10GBASE-R PCS
☆12May 23, 2023Updated 2 years ago
kyegomez / AlphaDev
View on GitHub
Implementation of the model from "Faster sorting algorithms discovered using deep reinforcement learning" that discovered an all-new ult…
☆11Aug 29, 2023Updated 2 years ago
sebjai / robust-risk-aware-rl
View on GitHub
Some implementations from the paper robust risk aware reinforcement learning
☆36Dec 15, 2021Updated 4 years ago
somsagar07 / RL-stock-trading-
View on GitHub
RL algorithm for stock trading with multiple reward functions
☆11Apr 21, 2024Updated last year
LeapLabTHU / FamO2O
View on GitHub
Repository of "Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning" (NeurIPS 2023 Spotlight)
☆40Oct 30, 2023Updated 2 years ago
Improbable-AI / orso
View on GitHub
☆16Feb 22, 2025Updated last year
Maryam-Haghani / NEFFy
View on GitHub
NEFF Calculator and MSA File Converter
☆13Sep 16, 2025Updated 5 months ago
kantamasuki / RGDM
View on GitHub
Implementations of the renormalization group-based diffusion model (RGDM).
☆16Mar 10, 2025Updated last year
sjdee / Research-Stock-Prediction
View on GitHub
☆10Jul 21, 2019Updated 6 years ago
llmskirmish / skirmish
View on GitHub
LLM Skirmish
☆44Feb 3, 2026Updated last month
ReidarRiveland / Instruct-RNN
View on GitHub
☆14Mar 21, 2024Updated last year
Neviim96 / FinanceGPT-B
View on GitHub
FinanceGPT-B
☆10Mar 26, 2024Updated last year
inboxedshoe / RP-DQN
View on GitHub
☆11Jan 11, 2022Updated 4 years ago
OuAzusaKou / imagination_mechanism
View on GitHub
About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"
☆13Oct 7, 2023Updated 2 years ago
THGLab / IDPForge
View on GitHub
Disordered protein ensemble prediction
☆12Feb 19, 2026Updated 2 weeks ago
LARK-AI-Lab / CodeScaler
View on GitHub
The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"
☆30Updated this week
vint-1 / dreamsmooth
View on GitHub
DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)
☆12May 6, 2024Updated last year
wassname / rl_2d_walker.js
View on GitHub
Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)
☆10Sep 7, 2020Updated 5 years ago
holken / polite
View on GitHub
code for polite
☆11Feb 28, 2024Updated 2 years ago
nirgreshler / bayesian-online-planning
View on GitHub
The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.
☆13Jun 17, 2024Updated last year
CLEANit / heatenginegym
View on GitHub
A collection of heat engines, based on the OpenAI Gym environment framework for use with reinforcement learning applications.
☆15Dec 20, 2021Updated 4 years ago
cair / open-tsetlin-machine
View on GitHub
Open Source Tsetlin Machine framework
☆17Oct 15, 2018Updated 7 years ago
utra-robosoccer / Bez_IsaacGym
View on GitHub
Isaac Gym Reinforcement Learning Environments for humanoid robot Bez
☆10Jul 27, 2022Updated 3 years ago
nslyubaykin / relax
View on GitHub
ReLAx - Reinforcement Learning Applications Library
☆15Feb 19, 2023Updated 3 years ago
huseinzol05 / Reinforcement-Learning-Agents
View on GitHub
Gathers machine learning and deep learning models for Reinforcement Learning
☆10Sep 8, 2018Updated 7 years ago
unifloc / unifloc_py
View on GitHub
unifloc on python
☆15Nov 14, 2020Updated 5 years ago
cobookman / blockchainToAvro
View on GitHub
Bitcoin blockchain to avro file
☆12Feb 8, 2018Updated 8 years ago