tinkoff-ai/palbert

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tinkoff-ai/palbert)

tinkoff-ai / palbert

Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight

☆37

Alternatives and similar repositories for palbert

Users that are interested in palbert are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tinkoff-ai / cnf
View on GitHub
Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…
☆12Jan 31, 2023Updated 3 years ago
tinkoff-ai / lb-sac
View on GitHub
Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…
☆21Feb 27, 2023Updated 3 years ago
tinkoff-ai / open-tlab
View on GitHub
Примеры пропозалов для подачи заявки в Open.TLab
☆27Dec 15, 2022Updated 3 years ago
corl-team / katakomba
View on GitHub
Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
☆43Aug 22, 2023Updated 2 years ago
tinkoff-ai / probabilistic-embeddings
View on GitHub
"Probabilistic Embeddings Revisited" paper official repository
☆31Dec 30, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tinkoff-ai / katakomba
View on GitHub
Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
☆79Jun 23, 2023Updated 3 years ago
tinkoff-ai / ReBRAC
View on GitHub
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆63Aug 3, 2023Updated 2 years ago
tinkoff-ai / exact
View on GitHub
The original PyTorch implementation of the "EXACT: How Train Your Accuracy"
☆10Sep 22, 2022Updated 3 years ago
dunnolab / NinA
View on GitHub
Official implementation of "NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows"
☆17Sep 22, 2025Updated 10 months ago
vklabmipt / implicit-unlikelihood-training
View on GitHub
Improving Neural Text Generation with Reinforcement Learning
☆23Jan 13, 2021Updated 5 years ago
amirassov / data-science-bowl
View on GitHub
Данный репозиторий содержит мое решение конкурса Data-Science-Bowl-2018
☆16Apr 13, 2018Updated 8 years ago
corl-team / counting_manifolds
View on GitHub
Code for the reproduction of counting manifolds
☆16Feb 26, 2026Updated 4 months ago
tinkoff-ai / sac-rnd
View on GitHub
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
☆58Feb 3, 2023Updated 3 years ago
corl-team / lime
View on GitHub
Official implementation of the paper "You Do Not Fully Utilize Transformer's Representation Capacity"
☆32May 28, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
corl-team / ad-eps
View on GitHub
Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"
☆35Sep 18, 2024Updated last year
dunnolab / phi-module
View on GitHub
[ICML 2025 GenBio Workshop] Official Implementation for "Electrostatics from Laplacian Eigenbasis for Neural Network Interatomic Potentia…
☆18Jun 12, 2025Updated last year
ENOT-AutoDL / gpt-j-6B-tensorrt-int8
View on GitHub
GPT-J 6B inference on TensorRT with INT-8 precision
☆11Apr 5, 2023Updated 3 years ago
dunnolab / vintix
View on GitHub
Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025
☆51May 23, 2025Updated last year
corl-team / CORL
View on GitHub
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…
☆652Feb 10, 2024Updated 2 years ago
intsystems / hippotrainer
View on GitHub
[BMM 24-25] HippoTrainer: Gradient-Based Hyperparameter Optimization
☆11May 6, 2025Updated last year
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
alxmamaev / ultimate_tts
View on GitHub
☆13Aug 7, 2021Updated 4 years ago
corl-team / steering-reasoning
View on GitHub
Official implementation of "Steering LLM Reasoning Through Bias-Only Adaptation" and "Small Vectors, Big Effects: A Mechanistic Study of …
☆54Oct 7, 2025Updated 9 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
radarFudan / mamba-minimal-jax
View on GitHub
☆36Nov 22, 2024Updated last year
Howuhh / sac-n-jax
View on GitHub
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
☆56May 21, 2023Updated 3 years ago
glassroom / heinsen_sequence
View on GitHub
Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)
☆98Dec 5, 2024Updated last year
corl-team / rebased
View on GitHub
Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"
☆169Jan 16, 2025Updated last year
Bradley-Butcher / Conformers
View on GitHub
Unofficial implementation of Conformal Language Modeling by Quach et al
☆29Jul 15, 2023Updated 3 years ago
IgorKhramtsov / DeEsser
View on GitHub
Experimental university project. Audio processing program to get rid of excessive prominence of sibilant consonants
☆11Jul 27, 2021Updated 4 years ago
fgvbrt / retro_contest
View on GitHub
☆15Mar 31, 2023Updated 3 years ago
alexeykarnachev / dialogs_data_parsers
View on GitHub
Russian dialog datasets parsers and crawlers.
☆15Sep 6, 2021Updated 4 years ago
uoe-agents / CMID
View on GitHub
☆13Apr 25, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
HCIILAB / Water-Meter-Number-DataSet
View on GitHub
The water-meter images are captured by camera and labeled with water-meter number, for the research of the water-meter image recognition.
☆20Jan 12, 2019Updated 7 years ago
pilot7747 / sldl
View on GitHub
Single-line inference of SOTA deep learning models
☆28Jan 22, 2023Updated 3 years ago
corl-team / flexsae
View on GitHub
Official Triton kernels for TopK and HierarchicalTopK Sparse Autoencoder decoders.
☆29Sep 29, 2025Updated 9 months ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
zombie-einstein / jaxpr-viz
View on GitHub
Jaxpr Visualisation Tool
☆37Dec 22, 2024Updated last year
DT6A / ReBRAC
View on GitHub
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆19Oct 22, 2023Updated 2 years ago
yandex-research / gan_vs_diff_sr
View on GitHub
Does Diffusion Beat GAN in Image Super Resolution?
☆12May 27, 2024Updated 2 years ago