dvruette / barrel-rec-pytorchView external linksLinks
☆53Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for barrel-rec-pytorch
Users that are interested in barrel-rec-pytorch are comparing it to the libraries listed below
Sorting:
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- ☆23Jun 18, 2024Updated last year
- Simplex Random Feature attention, in PyTorch☆75Oct 10, 2023Updated 2 years ago
- ☆68Aug 16, 2024Updated last year
- Solve puzzles. Learn CUDA.☆63Dec 13, 2023Updated 2 years ago
- utilities to facilitate working with codebases that don't ascribe to normal package management paradigms, e.g. ML research code that can …☆13Nov 26, 2022Updated 3 years ago
- ☆13Jan 15, 2025Updated last year
- A collection of various custom nodes for ComfyUI (Work in progress)☆14Jun 9, 2025Updated 8 months ago
- A scalable Dreamer implementation in JAX☆10May 22, 2022Updated 3 years ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆56Sep 18, 2022Updated 3 years ago
- Adaptive Subgoal Search☆20Apr 3, 2023Updated 2 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆18Oct 24, 2022Updated 3 years ago
- Fluid Language Model Benchmarking☆26Sep 16, 2025Updated 5 months ago
- video prediction and world model research☆14Jun 10, 2022Updated 3 years ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Jan 4, 2024Updated 2 years ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆167Jan 16, 2025Updated last year
- ☆40Jan 5, 2024Updated 2 years ago
- A wrapper for TensorBoard SummaryWriter with real-time terminal visualization using the Rich library.☆18Nov 5, 2023Updated 2 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- Annotated version of the Mamba paper☆496Feb 27, 2024Updated last year
- Automatically remove watermarks from illustrations using AI (Stable Diffusion).☆20Dec 17, 2024Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆82Mar 18, 2024Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Apr 17, 2024Updated last year
- A minimal home grid world environment to evaluate language understanding in interactive agents.☆24Sep 6, 2023Updated 2 years ago
- minimal C implementation of speculative decoding based on llama2.c☆25Jul 15, 2024Updated last year
- Generate a cute welcome message for yourself each day☆22Mar 30, 2023Updated 2 years ago
- ☆21Apr 14, 2025Updated 10 months ago
- ☆135Nov 24, 2023Updated 2 years ago
- Collection of autoregressive model implementation☆85Updated this week
- QLoRA for Masked Language Modeling☆22Sep 11, 2023Updated 2 years ago
- ☆22Dec 11, 2024Updated last year
- ☆21Mar 15, 2023Updated 2 years ago
- [ICML 2024] SINGD: KFAC-like Structured Inverse-Free Natural Gradient Descent (http://arxiv.org/abs/2312.05705)☆24Nov 4, 2024Updated last year
- ☆20Jun 10, 2024Updated last year
- ☆26Sep 22, 2025Updated 4 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Oct 18, 2025Updated 3 months ago
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year