alexlioralexli/rllab-finetuning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alexlioralexli/rllab-finetuning)

alexlioralexli / rllab-finetuning

☆32

Alternatives and similar repositories for rllab-finetuning

Users that are interested in rllab-finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

4rChon / NL-FuN
View on GitHub
N-Layered FeUdal Networks based on FeUdal Networks adapted to suit PySC2 observations
☆19Sep 17, 2019Updated 6 years ago
MWATelescope / Birli
View on GitHub
A Rust library for preprocessing tasks in the Murchison Widefield Array (MWA) data pipeline.
☆18Updated this week
tristandeleu / jax-meta-learning
View on GitHub
A collection of meta-learning algorithms in Jax
☆24Sep 3, 2022Updated 3 years ago
seungyulhan / disc
View on GitHub
☆10Aug 17, 2022Updated 3 years ago
vitchyr / torch-rl
View on GitHub
A reinforcement learning package implemented in Torch
☆11Jan 24, 2016Updated 10 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
alexlioralexli / learned-fourier-features
View on GitHub
Code for the paper "Functional Regularization for Reinforcement Learning via Learned Fourier Features"
☆20Oct 2, 2022Updated 3 years ago
bhairavmehta95 / data-efficient-hrl
View on GitHub
Implementation of Data Efficient Reinforcement Learning in Pytorch
☆20Aug 6, 2019Updated 6 years ago
htdt / lwm
View on GitHub
Latent World Models For Intrinsically Motivated Exploration | Official repository
☆23Apr 28, 2021Updated 5 years ago
GTLIDAR / digit_controller
View on GitHub
☆10Mar 9, 2023Updated 3 years ago
chenhch8 / rl4rs-dqn
View on GitHub
This is a DQN-based recommendation system for item-list recommendation and it finally achieved second place in the competition of RL-base…
☆11Oct 8, 2021Updated 4 years ago
makokal / MDPN
View on GitHub
Unified notation for Markov Decision Processes PO(MDP)s
☆24Apr 27, 2018Updated 8 years ago
SeungeonBaek / discrete-agents-test
View on GitHub
utilizing RL and GNN for trajectory planning(co-work)
☆13Jul 28, 2023Updated 2 years ago
hemilpanchiwala / Hindsight-Experience-Replay
View on GitHub
Implementation of HindSight Experience Replay paper with Pytorch
☆31Apr 28, 2021Updated 5 years ago
lsst / lsst
View on GitHub
Configures environment for LSST software (newinstall.sh)
☆16Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
nishantkr18 / guided-cost-learning
View on GitHub
Implementation of the paper https://arxiv.org/abs/1603.00448.
☆38Dec 31, 2020Updated 5 years ago
irom-princeton / Invariant-Policy-Optimization
View on GitHub
Code for Invariant Policy Optimization
☆15Jul 22, 2020Updated 5 years ago
kpaonaut / HAAR-A-Hierarchical-RL-Algorithm
View on GitHub
Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards
☆31Jan 19, 2023Updated 3 years ago
jrl-umi3218 / mc_force_sensor_calibration_controller
View on GitHub
Controller to calibrate force sensors and let mc_rtc remove the effect of gravity due to links attached to the force sensors (grippers/f…
☆10Jan 26, 2026Updated 5 months ago
kkhetarpal / ioc
View on GitHub
Options of Interest: Temporal Abstraction with Interest Functions AAAI 2020
☆25Jul 31, 2020Updated 5 years ago
KhalilDMK / DebiasedBERT4Rec
View on GitHub
Pytorch implementation of the paper "Debiasing the Cloze Task in Sequential Recommendation with Bidirectional Transformers".
☆12Jan 22, 2023Updated 3 years ago
mw9385 / Hierarchical_Coverage_Path_Planning
View on GitHub
☆12Apr 22, 2022Updated 4 years ago
ztjhz / t5-jax
View on GitHub
JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
☆24Jun 10, 2023Updated 3 years ago
yardenas / jax-dreamer
View on GitHub
Dreamer on JAX
☆16Jan 19, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jacobandreas / bibs
View on GitHub
Annotated bibliographies.
☆40Aug 25, 2019Updated 6 years ago
Alestaubin / stable-imitation-policy-with-waypoints
View on GitHub
Learning globally stable dynamical systems policies through imitation. A modification of the original work, focussing on waypoint-based i…
☆14Oct 12, 2024Updated last year
ranlongyu / pycloudsim
View on GitHub
云任务调度仿真平台
☆13Mar 11, 2020Updated 6 years ago
alversafa / option-critic-arch
View on GitHub
Implementation of the Option-Critic Architecture
☆42Dec 9, 2018Updated 7 years ago
julianje / Bishop
View on GitHub
Mental state inference from observable behavior
☆15Dec 3, 2021Updated 4 years ago
ixaxaar / pytorch-dni
View on GitHub
Decoupled Neural Interfaces Using Synthetic Gradients - under develeopment
☆11Jun 27, 2025Updated last year
ben-eysenbach / sac
View on GitHub
Soft Actor-Critic
☆160Mar 13, 2018Updated 8 years ago
mala-lab / STEN
View on GitHub
Official implementation of ECML PKDD'24 paper 'Self-Supervised Spatial-Temporal Normality Learning for Time Series Anomaly Detection'.
☆17Aug 17, 2024Updated last year
pokaxpoka / rad_procgen
View on GitHub
RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)
☆19Mar 29, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
BerkeleyAutomation / tsc
View on GitHub
Implements experiments to evaluate transition state clustering
☆13Jul 12, 2016Updated 10 years ago
mklissa / PPOC
View on GitHub
Proximal Policy Option-Critic
☆26Jan 4, 2019Updated 7 years ago
xukai92 / WeightsAndBiasLogger.jl
View on GitHub
Log to W&B from Julia
☆12Jun 13, 2022Updated 4 years ago
rocketman123456 / GAMES-105
View on GitHub
☆14Oct 21, 2024Updated last year
deterministic-algorithms-lab / Jax-Journey
View on GitHub
A pathway and collection of resources to learning Jax from beginning to advance.
☆11Jan 2, 2021Updated 5 years ago
Kizmelvin / aws-amplify-figmatocode
View on GitHub
☆11Apr 2, 2022Updated 4 years ago
kvgarimella / dagger
View on GitHub
Training a car to drive in the CarRacing-v0 Gym Environment using imitation learning.
☆21Oct 18, 2020Updated 5 years ago