charlesfrye / cuda-substringsLinks

Because it's there.

☆16

Alternatives and similar repositories for cuda-substrings

Users that are interested in cuda-substrings are comparing it to the libraries listed below

Sorting:

xjdr-alt / muzero_sketch
☆40Updated last year
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆58Updated last year
leloykun / modded-nanogpt
NanoGPT (124M) quality in 2.67B tokens
☆28Updated 2 months ago
kanpuriyanawab / picograd
Rust Implementation of micrograd
☆53Updated last year
doomslide / autoloom
Approximating the joint distribution of language models via MCTS
☆22Updated last year
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆58Updated last month
okarthikb / state-space-models
☆28Updated last year
xjdr-alt / llmri
look how they massacred my boy
☆63Updated last year
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆107Updated 8 months ago
RiddleHe / llm-interp
A collection of lightweight interpretability scripts to understand how LLMs think
☆66Updated last week
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆67Updated last week
dorjeduck / momograd
A Learning Journey: Micrograd in Mojo 🔥
☆63Updated last year
facebookresearch / fastgen
Simple high-throughput inference library
☆149Updated 6 months ago
Narsil / hf-chat
☆25Updated 11 months ago
areu01or00 / Tensor-Slayer
Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…
☆26Updated 5 months ago
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated last month
catid / lllm
Latent Large Language Models
☆19Updated last year
HazyResearch / train-tk
train with kittens!
☆63Updated last year
sfcompute / tinynarrations
A synthetic story narration dataset to study small audio LMs.
☆31Updated last year
evanatyourservice / llm-jax
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆18Updated 3 months ago
joey00072 / Attention-as-graph
alternative way to calculating self attention
☆18Updated last year
PrimeIntellect-ai / pccl
PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP
☆138Updated 2 months ago
NousResearch / StripedHyenaTrainer
☆62Updated last year
notarussianteenager / srf-attention
Simplex Random Feature attention, in PyTorch
☆74Updated 2 years ago
enjalot / latent-sae
Training code for Sparse Autoencoders on Embedding models
☆38Updated 8 months ago
Birch-san / booru-embed
[WIP] Transformer to embed Danbooru labelsets
☆13Updated last year
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 6 months ago
kubernetes-bad / reward-composer
Lego for GRPO
☆30Updated 5 months ago
charlesfrye / minimodal
A miniature version of Modal
☆21Updated last year
Zyphra / zcookbook
Training hybrid models for dummies.
☆29Updated 3 weeks ago