☆18Aug 3, 2022Updated 3 years ago
Alternatives and similar repositories for Stabilizing-Off-Policy-RL
Users that are interested in Stabilizing-Off-Policy-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository is a collection of widely used self-supervised auxiliary losses used for learning representations in reinforcement learni…☆14Feb 27, 2023Updated 3 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆16May 19, 2023Updated 3 years ago
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆19Mar 1, 2021Updated 5 years ago
- Code for "TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning"☆27May 19, 2024Updated 2 years ago
- A collection of meta-learning algorithms in Jax☆24Sep 3, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Original tensorflow implementation of SILOT (Spatially Invariant, Label-free Object Tracking).☆13Mar 24, 2023Updated 3 years ago
- Self-Supervised Attention-Aware Reinforcement Learning☆18May 20, 2022Updated 4 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆229May 19, 2024Updated 2 years ago
- Simple Recipe Works: Vision-Language-Action Models are Natural Continual Learners with Reinforcement Learning☆50Mar 16, 2026Updated 2 months ago
- LED : Light Enhanced Depth Estimation at Night☆15Mar 24, 2026Updated 2 months ago
- The homework of robos learning base.☆11May 23, 2023Updated 3 years ago
- Unlock smooth and continuous data generation for robotics with Flow Matching! Transform simple noise into precise, fluid robot actions an…☆23Jan 17, 2025Updated last year
- DrQ-v2: Improved Data-Augmented Reinforcement Learning☆436May 31, 2022Updated 3 years ago
- The official Python library for Formulaic☆18Apr 25, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Truncated Normal Distribution in PyTorch☆87Dec 19, 2023Updated 2 years ago
- COMP760 Lecture Notes☆33Jan 13, 2023Updated 3 years ago
- Official Release of NeurIPS 2020 Spotlight paper "Generative Neurosymbolic Machines"☆37Mar 9, 2024Updated 2 years ago
- ☆40Jun 17, 2023Updated 2 years ago
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆73Feb 2, 2023Updated 3 years ago
- ☆56Jan 20, 2023Updated 3 years ago
- ☆13Jul 25, 2023Updated 2 years ago
- ☆11Dec 13, 2021Updated 4 years ago
- A raytracer written with webgpu (wgpu-rs).☆13Nov 19, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.☆18Apr 20, 2023Updated 3 years ago
- DrQ: Data regularized Q☆422Jan 13, 2023Updated 3 years ago
- Codebase of NeurIPS 2022 paper ''Planning for Sample Efficient Imitation Learning''☆41Oct 25, 2022Updated 3 years ago
- ratsnlp, KOGPT2와 recipegpt github를 참고하여 음식명과 식재료명을 입력하면 레시피를 생성해주는 모델을 제작하였습니다!!☆11Dec 28, 2021Updated 4 years ago
- Official python implementation of ASGRL in ICML 2022 paper: Leveraging Approximate Symbolic Models for Reinforcement Learning via Skill D…☆20Oct 5, 2022Updated 3 years ago
- Improving upon state of the art cooperative deep reinforcement learning in StarCraft II☆13May 16, 2019Updated 7 years ago
- ☆59Jun 6, 2023Updated 2 years ago
- ☆47Sep 24, 2024Updated last year
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Annotated implementation of vanilla Transformers to guide through all the ambiguities.☆10Jun 20, 2025Updated 11 months ago
- PlaNet: Learning Latent Dynamics for Planning from Pixels☆10Feb 13, 2020Updated 6 years ago
- ☆19Nov 19, 2025Updated 6 months ago
- Reinforcement Learning with Latent Flow☆44Mar 25, 2021Updated 5 years ago
- A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].☆16Oct 21, 2023Updated 2 years ago
- An experiment to compare the performance of Rust and Cython☆16Aug 7, 2021Updated 4 years ago
- Experiments and content for the "Accelerating hyperbolic t-SNE" paper.☆19Apr 30, 2026Updated 3 weeks ago