ai-glimpse / toyrl

Reinforce learning is awesome!

☆13

Alternatives and similar repositories for toyrl

Users that are interested in toyrl are comparing it to the libraries listed below

Sorting:

Rose-STL-Lab / AutoSTPP
Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efﬁcient, non-parametric inf…
☆24Updated 7 months ago
ictorv / Large-Language-Pretraining
Building large language foundational model
☆9Updated 2 months ago
catid / spectral_ssm
Implementation of Spectral State Space Models
☆16Updated last year
EzgiKorkmaz / generalization-reinforcement-learning
A Survey Analyzing Generalization in Deep Reinforcement Learning
☆32Updated 6 months ago
codingfisch / flashrl
Fast reinforcement learning 💨
☆24Updated 2 months ago
google / werewolf_arena
☆14Updated 9 months ago
ComputationalRobotics / TRAC
This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …
☆25Updated 2 weeks ago
grasp-lyrl / low-dimensional-deepnets
☆16Updated last year
uvadlc / uvadlc_practicals_2021
Repository for the code assignment of the Deep Learning 1 course, Fall 2021 edition
☆10Updated 2 years ago
cambridge-mlg / jolt
☆11Updated 2 months ago
smearle / autoverse
Generative cellular automaton-like learning environments for RL.
☆19Updated 3 months ago
ml-jku / DIffUCO
☆47Updated 2 months ago
CEC-Agent / CEC
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆31Updated last year
facebookresearch / ssorl
Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories
☆42Updated last year
lab-v2 / pyreason-gym
An OpenAI wrapper for PyReason to use in a Grid World reinforcement learning setting
☆31Updated last year
cwj22 / BeT-AIL
☆11Updated last year
facebookresearch / macta
MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection
☆46Updated 2 years ago
catid / dataloader
High-performance tokenized language data-loader for Python C++ extension
☆13Updated 9 months ago
NVlabs / gbrl_sb3
GBRL-based Actor-Critic algorithms implemented in stable-baselines3
☆34Updated last month
kyegomez / MobileVLM
Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …
☆16Updated last year
1danielr / auto-gb
Automatic Differentiation for Gradient Boosted Decision Trees.
☆13Updated 2 years ago
nikitadhawan / natural
☆43Updated 6 months ago
theOGognf / rl8
A high throughput, end-to-end RL library for infinite horizon tasks.
☆20Updated 11 months ago
CLAIRE-Labo / EvoTune
Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.
☆78Updated 3 weeks ago
jhoon-cho / MBTL
Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)
☆24Updated 4 months ago
RylanSchaeffer / Stanford-AI-Alignment-Double-Descent-Tutorial
Code for Arxiv Double Descent Demystified: Identifying, Interpreting & Ablating the Sources of a Deep Learning Puzzle
☆25Updated last year
google-deepmind / agent_debugger
Causal Analysis of Agent Behavior for AI Safety
☆18Updated last year
HazyResearch / Accelerated-PCA
Accelerated Stochastic Power Iteration with Momentum
☆9Updated 7 years ago
CausalML-Lab / PCMCI-Omega
Code for PCMCI-Ω algorithm from the NeurIPS'23 paper "Causal Discovery in Semi-Stationary Time Series"
☆17Updated 7 months ago
RandallBalestriero / SplineLLM
☆16Updated last year