tinkoff-ai / probabilistic-embeddings
"Probabilistic Embeddings Revisited" paper official repository
☆25Updated last year
Related projects ⓘ
Alternatives and complementary repositories for probabilistic-embeddings
- Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022☆28Updated 2 years ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆37Updated last year
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Updated last year
- ☆20Updated 11 months ago
- Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)☆47Updated last year
- Repository for "Generative Flow Networks as Entropy-Regularized RL" (AISTATS-2024, Oral)☆25Updated 7 months ago
- ☆25Updated last year
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆67Updated last year
- Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"☆27Updated 2 months ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆95Updated last year
- ☆29Updated 2 years ago
- ☆33Updated 10 months ago
- PyTorch implementation of the mixture distribution family with implicit reparametrisation gradients.☆19Updated 10 months ago
- Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]☆31Updated 2 months ago
- AdaCat☆49Updated 2 years ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Updated last year
- Official code for Long Expressive Memory (ICLR 2022, Spotlight)☆69Updated 2 years ago
- Implementation of Hierarchical Transformer Memory (HTM) for Pytorch☆72Updated 3 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆101Updated 2 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆26Updated 2 years ago
- [AAAI 2020 Oral] Low-variance Black-box Gradient Estimates for the Plackett-Luce Distribution☆36Updated 3 years ago
- ☆56Updated 2 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 2 months ago
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Updated last year
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆46Updated last year
- Model Agnostic Meta Learning (MAML) implemented in Flax, the neural network library for JAX.☆19Updated 4 years ago
- Deep learning models for contextual multi-armed bandit setting☆12Updated 3 years ago
- Meta-Album meta-dataset for few-shot image classification☆24Updated last year
- ☆35Updated last year