SpirinEgor / gulag
GULAG: GUessing LAnGuages with neural networks
☆13Updated 2 years ago
Alternatives and similar repositories for gulag:
Users that are interested in gulag are comparing it to the libraries listed below
- Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight☆37Updated 2 years ago
- Примеры пропозалов для подачи заявки в Open.TLab☆27Updated 2 years ago
- Vintix: Action Model via In-Context Reinforcement Learning - - —☆34Updated last month
- ☆20Updated 9 months ago
- FusionBrain Challenge 2.0: creating multimodal multitask model☆16Updated 2 years ago
- Official implementation of the paper "You Do Not Fully Utilize Transformer's Representation Capacity"☆28Updated 2 months ago
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - —☆66Updated 2 months ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆39Updated last year
- Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"☆29Updated 7 months ago
- ☆18Updated 2 weeks ago
- Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022☆28Updated 2 years ago
- Russian dialog datasets parsers and crawlers.☆16Updated 3 years ago
- Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023☆52Updated 2 years ago
- Framework for probing tasks☆26Updated last year
- RuTransform: python framework for adversarial attacks and text data augmentation for Russian☆19Updated last year
- ☆21Updated 5 months ago
- ☆13Updated 2 years ago
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Updated 2 years ago
- The official implementation of the ChordMixer architecture.☆61Updated last year
- ☆22Updated last year
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆71Updated last year
- eco4cast library aims to reduce carbon footprint of machine learning models with predictive cloud computing scheduling☆17Updated 7 months ago
- Russian Drug Reaction Corpus (RuDReC)☆10Updated 4 years ago
- Compression schema for gradients of activations in backward pass☆44Updated last year
- The source code for the paper "Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization" (ICLR 2025)☆26Updated last month
- Skoltech NLA 2024 course.☆25Updated 4 months ago
- Official Implementation for "In-Context Reinforcement Learning for Variable Action Spaces"☆84Updated last year
- Active learning☆78Updated 2 years ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆62Updated 6 months ago
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆52Updated last year