☆53Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for barrel-rec-pytorch
Users that are interested in barrel-rec-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆98Dec 5, 2024Updated last year
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- ☆19Dec 4, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- NanoGPT-speedrunning for the poor T4 enjoyers☆74Apr 22, 2025Updated last year
- ☆13May 4, 2026Updated 3 weeks ago
- Simplex Random Feature attention, in PyTorch☆76Oct 10, 2023Updated 2 years ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Jan 4, 2024Updated 2 years ago
- ☆54May 20, 2024Updated 2 years ago
- Pytorch Implementation of Residual Multiplicative Filter Networks, NeurIPS 2022☆22Nov 17, 2022Updated 3 years ago
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Feb 27, 2023Updated 3 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- Solve puzzles. Learn CUDA.☆62Dec 13, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Grokking on modular arithmetic in less than 150 epochs in MLX☆15Oct 24, 2024Updated last year
- A collection of various custom nodes for ComfyUI (Work in progress)☆14Jun 9, 2025Updated 11 months ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆169Jan 16, 2025Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆82Mar 18, 2024Updated 2 years ago
- ☆68Aug 16, 2024Updated last year
- utilities to facilitate working with codebases that don't ascribe to normal package management paradigms, e.g. ML research code that can …☆13Nov 26, 2022Updated 3 years ago
- ☆34May 14, 2025Updated last year
- ☆40Jan 5, 2024Updated 2 years ago
- Annotated version of the Mamba paper☆501Feb 27, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- minGPT in JAX☆49Jan 10, 2022Updated 4 years ago
- ☆50Mar 14, 2024Updated 2 years ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆57Sep 18, 2022Updated 3 years ago
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Apr 17, 2024Updated 2 years ago
- Fluid Language Model Benchmarking☆30Sep 16, 2025Updated 8 months ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆18Oct 24, 2022Updated 3 years ago
- A discrete sequential VAE☆41Apr 22, 2020Updated 6 years ago
- Примеры пропозалов для подачи заявки в Open.TLab☆27Dec 15, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Collection of autoregressive model implementation☆85Feb 23, 2026Updated 3 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆60Oct 18, 2025Updated 7 months ago
- Accelerated First Order Parallel Associative Scan☆197Jan 7, 2026Updated 4 months ago
- A minimal home grid world environment to evaluate language understanding in interactive agents.☆24Sep 6, 2023Updated 2 years ago
- ARC Community Project☆22Aug 2, 2024Updated last year
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆34Mar 7, 2025Updated last year
- Code for the paper "PALBERT: Teaching ALBERT to Ponder", NeurIPS 2022 Spotlight☆37Apr 8, 2023Updated 3 years ago