erwanplantec / LNDP
☆50Updated last month
Related projects ⓘ
Alternatives and complementary repositories for LNDP
- ☆50Updated 5 months ago
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆41Updated this week
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆84Updated 2 months ago
- ☆36Updated 2 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆36Updated last year
- Meta-Learning for Compositionality (MLC) for modeling human behavior☆138Updated 10 months ago
- Triton Implementation of HyperAttention Algorithm☆46Updated 11 months ago
- A State-Space Model with Rational Transfer Function Representation.☆70Updated 6 months ago
- ☆29Updated 2 months ago
- ☆17Updated 5 months ago
- tinybig for deep function learning☆36Updated this week
- GoldFinch and other hybrid transformer components☆39Updated 4 months ago
- Codes for the paper "A mathematical perspective on Transformers".☆32Updated 4 months ago
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated last month
- ☆25Updated last month
- Evaluating the Mamba architecture on the Othello game☆43Updated 6 months ago
- ☆22Updated 5 months ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆119Updated last year
- Griffin MQA + Hawk Linear RNN Hybrid☆85Updated 6 months ago
- σ-GPT: A New Approach to Autoregressive Models☆59Updated 3 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆39Updated 11 months ago
- Code for the "Cultural evolution in populations of Large Language Models" paper☆28Updated 3 weeks ago
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆32Updated 9 months ago
- Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)☆42Updated 3 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆37Updated 5 months ago
- Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models☆12Updated 8 months ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆87Updated 11 months ago
- Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Mode…☆78Updated 2 months ago