lucidrains / streaming-deep-rl
Explorations into the proposed Streaming Deep Reinforcement Learning, from University of Alberta
☆17Updated 3 months ago
Alternatives and similar repositories for streaming-deep-rl:
Users that are interested in streaming-deep-rl are comparing it to the libraries listed below
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated 4 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Updated 11 months ago
- [NeurIPS 2024, spotlight] Multivariate Learned Adaptive Noise for Diffusion Models☆15Updated 2 months ago
- Implementation of Spectral State Space Models☆16Updated 11 months ago
- Exploration into the Firefly algorithm in Pytorch☆35Updated last week
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Implementation of the model "Hedgehog" from the paper: "The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry"☆13Updated 11 months ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆20Updated 3 months ago
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆44Updated this week
- Minimum Description Length probing for neural network representations☆18Updated 3 weeks ago
- ☆17Updated 3 months ago
- ☆12Updated 5 months ago
- Official code for the paper: "Metadata Archaeology"☆19Updated last year
- RS-IMLE☆38Updated 2 months ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆23Updated last month
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆21Updated 7 months ago
- Code for paper Rethinking the Data Annotation Process for Multi-view 3D Pose Estimation with Active Learning and Self-Training☆22Updated last year
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 8 months ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 2 months ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Updated last year
- Implementation of the proposed Spline-Based Transformer from Disney Research☆86Updated 3 months ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆17Updated 11 months ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Updated last year
- Generative cellular automaton-like learning environments for RL.☆19Updated 3 weeks ago