☆17Aug 3, 2022Updated 3 years ago
Alternatives and similar repositories for Stabilizing-Off-Policy-RL
Users that are interested in Stabilizing-Off-Policy-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository is a collection of widely used self-supervised auxiliary losses used for learning representations in reinforcement learni…☆14Feb 27, 2023Updated 3 years ago
- Large language models to diffusion finetuning code☆25Jun 2, 2025Updated 9 months ago
- Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP☆11Jan 29, 2024Updated 2 years ago
- Adding Dreamer-v3's implementation tricks to CleanRL's PPO☆14May 19, 2023Updated 2 years ago
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆18Mar 1, 2021Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- S.M.Ali Eslam et.al. Attend, Infer, Repeat: Fast Scene Understanding with Generative Models ICML16☆14Sep 27, 2018Updated 7 years ago
- A collection of meta-learning algorithms in Jax☆24Sep 3, 2022Updated 3 years ago
- Original tensorflow implementation of SILOT (Spatially Invariant, Label-free Object Tracking).☆13Mar 24, 2023Updated 3 years ago
- Self-Supervised Attention-Aware Reinforcement Learning☆18May 20, 2022Updated 3 years ago
- krazy grid world☆25Mar 2, 2020Updated 6 years ago
- LED : Light Enhanced Depth Estimation at Night☆14Dec 9, 2025Updated 3 months ago
- The homework of robos learning base.☆11May 23, 2023Updated 2 years ago
- ☆23Mar 10, 2026Updated 2 weeks ago
- Implementation of MuZero with PyTorch, based on the pseudocode from DeepMind (https://arxiv.org/src/1911.08265v2/anc/pseudocode.py).☆33Aug 14, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Unlock smooth and continuous data generation for robotics with Flow Matching! Transform simple noise into precise, fluid robot actions an…☆19Jan 17, 2025Updated last year
- DrQ-v2: Improved Data-Augmented Reinforcement Learning☆432May 31, 2022Updated 3 years ago
- The official Python library for Formulaic☆18Apr 25, 2024Updated last year
- Truncated Normal Distribution in PyTorch☆87Dec 19, 2023Updated 2 years ago
- COMP760 Lecture Notes☆31Jan 13, 2023Updated 3 years ago
- Official Release of NeurIPS 2020 Spotlight paper "Generative Neurosymbolic Machines"☆36Mar 9, 2024Updated 2 years ago
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆72Feb 2, 2023Updated 3 years ago
- ☆53Jan 20, 2023Updated 3 years ago
- A raytracer written with webgpu (wgpu-rs).☆13Nov 19, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.☆18Apr 20, 2023Updated 2 years ago
- Codebase of NeurIPS 2022 paper ''Planning for Sample Efficient Imitation Learning''☆41Oct 25, 2022Updated 3 years ago
- DrQ: Data regularized Q☆420Jan 13, 2023Updated 3 years ago
- Official python implementation of ASGRL in ICML 2022 paper: Leveraging Approximate Symbolic Models for Reinforcement Learning via Skill D…☆20Oct 5, 2022Updated 3 years ago
- ratsnlp, KOGPT2와 recipegpt github를 참고하여 음식명과 식재료명을 입력하면 레시피를 생성해주는 모델을 제작하였습니다!!☆11Dec 28, 2021Updated 4 years ago
- Improving upon state of the art cooperative deep reinforcement learning in StarCraft II☆13May 16, 2019Updated 6 years ago
- Experiments and content for the "Accelerating hyperbolic t-SNE" paper.☆15Aug 29, 2024Updated last year
- ☆58Jun 6, 2023Updated 2 years ago
- ☆47Sep 24, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- Annotated implementation of vanilla Transformers to guide through all the ambiguities.☆10Jun 20, 2025Updated 9 months ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Feb 14, 2019Updated 7 years ago
- PlaNet: Learning Latent Dynamics for Planning from Pixels☆10Feb 13, 2020Updated 6 years ago
- Reinforcement Learning with Latent Flow☆44Mar 25, 2021Updated 5 years ago
- A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].☆16Oct 21, 2023Updated 2 years ago
- An experiment to compare the performance of Rust and Cython☆16Aug 7, 2021Updated 4 years ago