☆12Apr 26, 2022Updated 4 years ago
Alternatives and similar repositories for rl-toolkit
Users that are interested in rl-toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of BC-IRL and other IRL baselines☆30Jun 6, 2023Updated 2 years ago
- 심층강화학습 책 https://hiddenbeginner.github.io/Deep-Reinforcement-Learnings☆11May 10, 2024Updated last year
- LLM의 다양한 튜닝 방법과 데이터 전처리 코드를 정리해놓았습니다.☆14Apr 19, 2026Updated 2 weeks ago
- Korean Sub for CS285 2021 fall lecture☆14Apr 2, 2022Updated 4 years ago
- Reservoir Simulation environment for Reinforcement Learning. Eclipse Integration for Gym toolkit.☆21Dec 8, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Modèles de schémas pour Graphviz☆19Apr 11, 2021Updated 5 years ago
- Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)☆20Feb 29, 2020Updated 6 years ago
- [EMNLP 2025 Industry] Datasets and Recipes for Video Temporal Grounding via Reinforcement Learning☆36Oct 22, 2025Updated 6 months ago
- An OpenAI Gym to benchmark AI Reinforcement Learning algorithms in fisheries-related control problems☆10Jul 17, 2023Updated 2 years ago
- Fork of https://github.com/xbpeng/DeepMimic☆14Sep 10, 2020Updated 5 years ago
- Scalable Probabilistic Estimates of Electric Vehicle Charging (SPEECh)☆13Nov 12, 2024Updated last year
- ☆15Sep 21, 2020Updated 5 years ago
- ☆60Updated this week
- An implementation of popular Inverse Reinforcement Learning algorithms for various tasks.☆21Jul 26, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Collection of awesome Call for Papers to submit your Reinforcement Learning papers☆18Sep 13, 2021Updated 4 years ago
- This project provides a data set with bounding boxes, body poses, 3D face meshes & captions of people from our LAION-2.2B. Additionally i…☆14Jan 2, 2022Updated 4 years ago
- Wrappers and utilities for Nvidia IsaacGym☆99Apr 16, 2022Updated 4 years ago
- PyTorch implementation for our paper "Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation"☆13Apr 19, 2023Updated 3 years ago
- ☆13Mar 13, 2024Updated 2 years ago
- Curated implementation notebooks and scripts of deep learning based natural language processing tasks and challenges in TensorFlow.☆11Apr 24, 2020Updated 6 years ago
- 用parl框架的DQN强化学习算法玩 “合成大西瓜”☆14Mar 5, 2021Updated 5 years ago
- Least Squares GANs in Tensorflow☆17Apr 20, 2017Updated 9 years ago
- code to help with tsne plotting☆16May 19, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [COLING 2025] "Physics Reasoner: Knowledge-Augmented Reasoning for Solving Physics Problems with Large Language Models"☆22Dec 18, 2024Updated last year
- Proximal Policy Optimization in PyTorch☆39Dec 10, 2017Updated 8 years ago
- ☆11Dec 9, 2017Updated 8 years ago
- Deep Reinforcement Learning for Routing a Heterogeneous Fleet of Vehicles☆18Jan 15, 2020Updated 6 years ago
- ☆21Mar 15, 2023Updated 3 years ago
- Power-Law Distribution Analysis☆27Jul 1, 2019Updated 6 years ago
- Examples for using Java clients to interact with Amazon ElastiCache for Redis using the open-source Redis client Jedis and Amazon ElastiC…☆11Apr 24, 2021Updated 5 years ago
- A lightweight RL library inspired from salina☆19Jan 27, 2026Updated 3 months ago
- ☆44May 30, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆41Apr 2, 2026Updated last month
- Word Embedding Annealing Using Sequence-to-sequence Model☆16Dec 2, 2020Updated 5 years ago
- D3QN implementation using pytorch☆15Jun 4, 2021Updated 4 years ago
- Jupyter notebook examples for EXAONE Atelier in AWS Marketplace☆14Dec 8, 2023Updated 2 years ago
- Double pendulum on a cart (dpc) simulation model☆14Aug 12, 2019Updated 6 years ago
- A public repository for our paper, Rao-Blackwellized Stochastic Gradients for Discrete Distributions☆22May 5, 2019Updated 7 years ago
- ☆24Jun 26, 2022Updated 3 years ago