☆53Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for barrel-rec-pytorch
Users that are interested in barrel-rec-pytorch are comparing it to the libraries listed below
Sorting:
- Parallel Associative Scan for Language Models☆18Jan 8, 2024Updated 2 years ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆98Dec 5, 2024Updated last year
- Efficient PScan implementation in PyTorch☆17Jan 2, 2024Updated 2 years ago
- ☆19Dec 4, 2025Updated 3 months ago
- ☆23Jun 18, 2024Updated last year
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Apr 22, 2025Updated 10 months ago
- ☆68Aug 16, 2024Updated last year
- Simplex Random Feature attention, in PyTorch☆76Oct 10, 2023Updated 2 years ago
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Solve puzzles. Learn CUDA.☆62Dec 13, 2023Updated 2 years ago
- A collection of various custom nodes for ComfyUI (Work in progress)☆14Jun 9, 2025Updated 9 months ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- ☆15Mar 31, 2022Updated 3 years ago
- Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp☆14Feb 18, 2025Updated last year
- A scalable Dreamer implementation in JAX☆10May 22, 2022Updated 3 years ago
- Adaptive Subgoal Search☆20Apr 3, 2023Updated 2 years ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆55Sep 18, 2022Updated 3 years ago
- video prediction and world model research☆14Jun 10, 2022Updated 3 years ago
- Grokking on modular arithmetic in less than 150 epochs in MLX☆14Oct 24, 2024Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Jan 4, 2024Updated 2 years ago
- Official implementation of the paper "Linear Transformers with Learnable Kernel Functions are Better In-Context Models"☆169Jan 16, 2025Updated last year
- Transformer experiments☆16May 8, 2023Updated 2 years ago
- Fluid Language Model Benchmarking☆26Sep 16, 2025Updated 5 months ago
- ☆40Jan 5, 2024Updated 2 years ago
- ☆22May 3, 2022Updated 3 years ago
- Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…☆21Feb 27, 2023Updated 3 years ago
- A wrapper for TensorBoard SummaryWriter with real-time terminal visualization using the Rich library.☆18Nov 5, 2023Updated 2 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- Annotated version of the Mamba paper☆497Feb 27, 2024Updated 2 years ago
- finetune your florence2 model easy☆21Jul 27, 2024Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆82Mar 18, 2024Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Apr 17, 2024Updated last year
- A minimal home grid world environment to evaluate language understanding in interactive agents.☆24Sep 6, 2023Updated 2 years ago
- ☆22Apr 14, 2025Updated 10 months ago
- Accelerated First Order Parallel Associative Scan☆195Jan 7, 2026Updated 2 months ago
- Generate a cute welcome message for yourself each day☆22Mar 30, 2023Updated 2 years ago
- ☆135Nov 24, 2023Updated 2 years ago
- ☆20Jun 10, 2024Updated last year