☆53Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for barrel-rec-pytorch
Users that are interested in barrel-rec-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆98Dec 5, 2024Updated last year
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- ☆19Dec 4, 2025Updated 3 months ago
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Apr 22, 2025Updated 11 months ago
- ☆13Jan 15, 2025Updated last year
- ☆23Jun 18, 2024Updated last year
- Improving Neural Text Generation with Reinforcement Learning☆23Jan 13, 2021Updated 5 years ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Jan 4, 2024Updated 2 years ago
- ☆54May 20, 2024Updated last year
- Pytorch Implementation of Residual Multiplicative Filter Networks, NeurIPS 2022☆22Nov 17, 2022Updated 3 years ago
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Feb 27, 2023Updated 3 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Solve puzzles. Learn CUDA.☆62Dec 13, 2023Updated 2 years ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆169Jan 16, 2025Updated last year
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- ☆68Aug 16, 2024Updated last year
- QLoRA for Masked Language Modeling☆23Sep 11, 2023Updated 2 years ago
- utilities to facilitate working with codebases that don't ascribe to normal package management paradigms, e.g. ML research code that can …☆13Nov 26, 2022Updated 3 years ago
- Simple GRPO scripts and configurations.☆59Feb 6, 2025Updated last year
- ☆34May 14, 2025Updated 10 months ago
- ☆40Jan 5, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Annotated version of the Mamba paper☆499Feb 27, 2024Updated 2 years ago
- ☆15May 17, 2024Updated last year
- Fluid Language Model Benchmarking☆27Sep 16, 2025Updated 6 months ago
- A scalable Dreamer implementation in JAX☆10May 22, 2022Updated 3 years ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆56Sep 18, 2022Updated 3 years ago
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- A custom PyTorch layer that is capable of implementing extremely wide and sparse linear layers efficiently☆51Dec 14, 2023Updated 2 years ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Apr 17, 2024Updated last year
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆18Oct 24, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A discrete sequential VAE☆41Apr 22, 2020Updated 5 years ago
- Code for the paper "Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters", ICML 2022☆28Jul 10, 2022Updated 3 years ago
- Примеры пропозалов для подачи заявки в Open.TLab☆27Dec 15, 2022Updated 3 years ago
- Diffusion on syntax trees for program synthesis☆484Jun 27, 2024Updated last year
- video prediction and world model research☆14Jun 10, 2022Updated 3 years ago
- Collection of autoregressive model implementation☆85Feb 23, 2026Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Oct 18, 2025Updated 5 months ago