Explorations into the proposed Streaming Deep Reinforcement Learning, from University of Alberta
☆30May 18, 2026Updated last week
Alternatives and similar repositories for streaming-deep-rl
Users that are interested in streaming-deep-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind p…☆20Apr 17, 2026Updated last month
- UM1 test programs and sample code☆11Jul 25, 2022Updated 3 years ago
- Exploration into the Firefly algorithm in Pytorch☆41Feb 14, 2025Updated last year
- Suite of Quantum Characterization, Verification, and Validation (QCVV) tools for quantum computing☆20Apr 13, 2026Updated last month
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Sep 27, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Implementation of an Attention layer where each head can attend to more than just one token, using coordinate descent to pick topk☆47Jul 16, 2023Updated 2 years ago
- [ICLR 2025] UniCO: On Unified Combinatorial Optimization via Problem Reduction to Matrix-Encoded General TSP☆16Jun 20, 2025Updated 11 months ago
- ☆18Oct 6, 2025Updated 7 months ago
- [ICML 2026] InnoEval: On Research Idea Evaluation as a Knowledge-Grounded, Multi-Perspective Reasoning Problem☆22Apr 7, 2026Updated last month
- Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"☆182Jun 20, 2024Updated last year
- ☆12Oct 7, 2020Updated 5 years ago
- Implementation of a multimodal diffusion transformer in Pytorch☆108Jun 24, 2024Updated last year
- ☆46Jul 12, 2024Updated last year
- Hazardous Materials Sign Dataset☆14Jul 12, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆30Oct 24, 2025Updated 7 months ago
- ☆14Aug 22, 2025Updated 9 months ago
- RLMM is a reinforcement learning env for molecular modeling (currently only protein-ligand docking).☆11Nov 14, 2022Updated 3 years ago
- Implementation of Diffusion Policy, Toyota Research's supposed breakthrough in leveraging DDPMs for learning policies for real-world Robo…☆135Jul 6, 2024Updated last year
- Implementation of Metaformer, but in an autoregressive manner☆26Jun 21, 2022Updated 3 years ago
- Source code for Pathfinding in Stochastic Environments paper.☆15Oct 27, 2022Updated 3 years ago
- Local Attention - Flax module for Jax☆22May 26, 2021Updated 5 years ago
- ☆14Jun 21, 2023Updated 2 years ago
- Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Transformer Train…☆32Mar 17, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Differentiable and GPU accelerated scattering covariance statistics on the sphere☆16May 27, 2025Updated last year
- Autonomous robot exploration in unknown outdoor environment☆10Nov 13, 2018Updated 7 years ago
- A simple python script that, given a location and a date, uses the Nasa Earth API to show a photo taken by the Landsat 8 satellite. The s…☆44Apr 13, 2022Updated 4 years ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆154May 2, 2025Updated last year
- Concept Relevance Propagation for Localization Models, accepted at SAIAD workshop at CVPR 2023.☆15Jan 16, 2024Updated 2 years ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆126Jul 26, 2024Updated last year
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30May 31, 2022Updated 3 years ago
- The code for the paper "Pre-trained Vision-Language Models Learn Discoverable Concepts"☆21Jun 5, 2024Updated last year
- ☆21Aug 8, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Navigation_DRL_UAV 是一个基于深度强化学习(DRL)的无人机导航平台,用于在复杂未知环境中训练无人机导航策略。该平台基于 AirSim 和 Stable-Baselines3,包含多旋翼和固定翼无人机的运动学模型,并提供多种 UE4 环境用于训练和测试。☆25Apr 25, 2025Updated last year
- embedding playground☆15Mar 6, 2025Updated last year
- REOBench: Benchmarking Robustness of Earth Observation Foundation Models☆24May 22, 2026Updated last week
- ☆10Mar 24, 2025Updated last year
- Parrot robot Jumping Sumo☆10Jan 8, 2016Updated 10 years ago
- Reliable, minimal and scalable library for pretraining foundation and world models☆234May 17, 2026Updated last week
- [ICLR 2021] Group Equivariant Generative Adversarial Networks.☆14May 6, 2021Updated 5 years ago