☆53Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for barrel-rec-pytorch
Users that are interested in barrel-rec-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆98Dec 5, 2024Updated last year
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- ☆19Dec 4, 2025Updated 4 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Apr 22, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Simplex Random Feature attention, in PyTorch☆76Oct 10, 2023Updated 2 years ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Jan 4, 2024Updated 2 years ago
- Pytorch Implementation of Residual Multiplicative Filter Networks, NeurIPS 2022☆22Nov 17, 2022Updated 3 years ago
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Feb 27, 2023Updated 3 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- Solve puzzles. Learn CUDA.☆62Dec 13, 2023Updated 2 years ago
- Grokking on modular arithmetic in less than 150 epochs in MLX☆15Oct 24, 2024Updated last year
- ☆13Aug 7, 2021Updated 4 years ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆169Jan 16, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆82Mar 18, 2024Updated 2 years ago
- ☆68Aug 16, 2024Updated last year
- QLoRA for Masked Language Modeling☆23Sep 11, 2023Updated 2 years ago
- utilities to facilitate working with codebases that don't ascribe to normal package management paradigms, e.g. ML research code that can …☆13Nov 26, 2022Updated 3 years ago
- Simple GRPO scripts and configurations.☆59Feb 6, 2025Updated last year
- ☆34May 14, 2025Updated 11 months ago
- ☆40Jan 5, 2024Updated 2 years ago
- Annotated version of the Mamba paper☆500Feb 27, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- minGPT in JAX☆48Jan 10, 2022Updated 4 years ago
- ☆15May 17, 2024Updated last year
- Fluid Language Model Benchmarking☆27Sep 16, 2025Updated 7 months ago
- ☆50Mar 14, 2024Updated 2 years ago
- A scalable Dreamer implementation in JAX☆10May 22, 2022Updated 3 years ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆57Sep 18, 2022Updated 3 years ago
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Apr 17, 2024Updated 2 years ago
- A discrete sequential VAE☆41Apr 22, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022☆28Jul 10, 2022Updated 3 years ago
- video prediction and world model research☆14Jun 10, 2022Updated 3 years ago
- Collection of autoregressive model implementation☆85Feb 23, 2026Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆60Oct 18, 2025Updated 6 months ago
- Adaptive Subgoal Search☆20Apr 3, 2023Updated 3 years ago
- Accelerated First Order Parallel Associative Scan☆197Jan 7, 2026Updated 3 months ago
- ☆28Sep 22, 2025Updated 6 months ago