yanxue7/RL-LLM-Prior

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yanxue7/RL-LLM-Prior)

yanxue7 / RL-LLM-Prior

☆26

Alternatives and similar repositories for RL-LLM-Prior

Users that are interested in RL-LLM-Prior are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yanxue7 / E3T-Overcooked
View on GitHub
☆15May 4, 2024Updated 2 years ago
leor-c / REM
View on GitHub
Improving Token-Based World Models with Parallel Observation Prediction (ICML 2024)
☆14Feb 23, 2026Updated 5 months ago
flowersteam / lamorel
View on GitHub
Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).
☆249Dec 11, 2025Updated 7 months ago
nevikw39 / oj
View on GitHub
My low-quality and poor-performance codes submitted to several online judges, such as ZeroJudge, GreenJudge, UVa, TIOJ, AtCoder, CSES pro…
☆14Apr 6, 2026Updated 3 months ago
ziyadsheeba / qfat
View on GitHub
[NeurIPS 2025, Spotlight] An official implementation of the paper Quantization-Free Autoregressive Action Transformer
☆12Mar 3, 2026Updated 4 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
zhourunlong / Reflect-RL
View on GitHub
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs
☆18Jul 19, 2025Updated last year
Ziwei89 / FBOD
View on GitHub
Flying bird object detection in surveillance video
☆17Apr 24, 2025Updated last year
thu-ml / Efficient-Diffusion-Alignment
View on GitHub
Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)
☆15Oct 29, 2024Updated last year
kayuksel / pytorch-ars
View on GitHub
PyTorch Implementations of Augmented Random Search
☆17Feb 28, 2019Updated 7 years ago
polixir / NeoRL2
View on GitHub
☆20Oct 27, 2025Updated 9 months ago
PacktPublishing / Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL
View on GitHub
Hands-On Reinforcement Learning with TensorFlow & TRFL
☆14Jan 18, 2021Updated 5 years ago
guyuntian / CoT_benchmark
View on GitHub
Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"
☆21Jul 16, 2023Updated 3 years ago
mail-ecnu / Text-Gym-Agents
View on GitHub
This project provides a set of translators to convert OpenAI Gym environments into text-based environments. It is designed to investigate…
☆22May 29, 2024Updated 2 years ago
jidiai / GRF_MARL
View on GitHub
Google Research Football MARL Benchmark and Research Toolkit
☆61May 19, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
allenai / sso
View on GitHub
Repository for Skill Set Optimization
☆14Jul 26, 2024Updated 2 years ago
jidiai / TaxAI
View on GitHub
☆71Jul 15, 2024Updated 2 years ago
burchim / TWISTER
View on GitHub
[ICLR 2025] Learning Transformer-based World Models with Contrastive Predictive Coding (TWISTER)
☆57Mar 9, 2025Updated last year
ALRhub / MTS3
View on GitHub
Implementation of Neurips 2023 Paper "Multi Time Scale World Models"
☆18Nov 8, 2024Updated last year
clvrai / leaps
View on GitHub
Code for Learning to Synthesize Programs as Interpretable and Generalizable Policies in NeurIPS 2021
☆40Sep 16, 2025Updated 10 months ago
jidiai / olympics_engine
View on GitHub
A simple 2D ball collision engine.
☆12Jun 15, 2023Updated 3 years ago
ajyl / mech_int_othelloGPT
View on GitHub
☆10Nov 6, 2024Updated last year
NRL-Plasma-Physics-Division / turbopy
View on GitHub
A lightweight computational physics framework, based on the organization of turboWAVE. Implements a "Simulation, PhysicsModule, ComputeTo…
☆12Updated this week
guosyjlu / OEMA
View on GitHub
Official PyTorch code for "Sample Efficient Offline-to-Online Reinforcement Learning" in TKDE'23.
☆16Aug 14, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
chang-github-00 / Predictive-Decoding
View on GitHub
Repo for Anonymous purpose, pls don't distribute
☆10Oct 2, 2024Updated last year
david-lindner / idrl
View on GitHub
Code accompanying the paper "Information Directed Reward Learning for Reinforcement Learning" (NeurIPS 2021).
☆13Nov 16, 2021Updated 4 years ago
lasgroup / rewarduq
View on GitHub
Code for "RewardUQ: A Unified Framework for Uncertainty-Aware Reward Models"
☆17Apr 21, 2026Updated 3 months ago
zisikons / deep-rl
View on GitHub
Deep Learning (FS 2020)
☆17Oct 10, 2022Updated 3 years ago
MeghnaKhaturia / 5G-Flow-RAN
View on GitHub
☆12May 17, 2021Updated 5 years ago
boschresearch / stuttgart-sumo-traffic-scenario
View on GitHub
A synthetic 24 hour traffic scenario for a 45 km section of the German highway A81 between Stuttgart Feuerbach - Heilbronn (Baden-Württem…
☆13Oct 5, 2020Updated 5 years ago
aidanscannell / GPJax
View on GitHub
Minimal Gaussian process library in JAX with a simple (custom) approach to state management.
☆12Dec 20, 2023Updated 2 years ago
BNN-UPC / DRL-ES-OTN
View on GitHub
☆20Feb 18, 2022Updated 4 years ago
labicon / CurricuLLM
View on GitHub
Official code repository for CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models
☆28Sep 26, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mh-cad / vistarsier
View on GitHub
VisTarsier opensource implementation.
☆14Jan 16, 2024Updated 2 years ago
evanthebouncy / icml2018_selecting_representative_examples
View on GitHub
code for icml paper: https://arxiv.org/abs/1711.03243v3
☆12Jul 8, 2018Updated 8 years ago
thuml / SPOT
View on GitHub
Code release for "Supported Policy Optimization for Offline Reinforcement Learning" (NeurIPS 2022), https://arxiv.org/abs/2202.06239
☆22Jun 24, 2023Updated 3 years ago
gdamaskinos / fleet
View on GitHub
Online Federated Learning.
☆16Apr 26, 2021Updated 5 years ago
billtubbs / gym-CartPole-bt-v0
View on GitHub
A modified version of the cart-pole OpenAI Gym environment for testing different control policies
☆13May 4, 2026Updated 2 months ago
nerdimite / maml
View on GitHub
Model-Agnostic Meta-Learning in PyTorch
☆12Jul 31, 2020Updated 5 years ago
ambujtewari / stats701-winter2021
View on GitHub
Theory of Reinforcement Learning
☆18Apr 20, 2021Updated 5 years ago