lucidrains / streaming-deep-rl
Explorations into the proposed Streaming Deep Reinforcement Learning, from University of Alberta
☆16Updated 2 months ago
Alternatives and similar repositories for streaming-deep-rl:
Users that are interested in streaming-deep-rl are comparing it to the libraries listed below
- Exploration into the Firefly algorithm in Pytorch☆33Updated 4 months ago
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- Multivariate Learned Adaptive Noise for Diffusion Models☆15Updated last month
- RS-IMLE☆36Updated last month
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆19Updated 2 months ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated 3 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Updated 10 months ago
- Self contained pytorch implementation of a sinkhorn based router, for mixture of experts or otherwise☆32Updated 4 months ago
- ☆12Updated 4 months ago
- Implementation of an Attention layer where each head can attend to more than just one token, using coordinate descent to pick topk☆46Updated last year
- Model-Based Transfer Learning for Contextual Reinforcement Learning (NeurIPS 2024)☆18Updated last month
- Implementation of the model "Hedgehog" from the paper: "The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry"☆13Updated 10 months ago
- Source code for the paper "Positional Attention: Out-of-Distribution Generalization and Expressivity for Neural Algorithmic Reasoning"☆14Updated this week
- ☆24Updated 6 months ago
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆24Updated 3 months ago
- Implementation of the proposed Spline-Based Transformer from Disney Research☆85Updated 2 months ago
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆44Updated 3 months ago
- ☆44Updated 8 months ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated 9 months ago
- Diffusing States and Matching Scores: A New Framework for Imitation Learning☆12Updated 2 months ago
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆26Updated 4 years ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 2 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week