charlesfrye / cuda-substrings
Because it's there.
☆16Updated 6 months ago
Alternatives and similar repositories for cuda-substrings:
Users that are interested in cuda-substrings are comparing it to the libraries listed below
- ☆38Updated 8 months ago
- Using modal.com to process FineWeb-edu data☆20Updated last week
- Rust Implementation of micrograd☆51Updated 9 months ago
- Latent Large Language Models☆17Updated 7 months ago
- alternative way to calculating self attention☆18Updated 10 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- ANE accelerated embedding models!☆16Updated 4 months ago
- A synthetic story narration dataset to study small audio LMs.☆32Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 6 months ago
- ☆20Updated 5 months ago
- new optimizer☆19Updated 8 months ago
- ☆26Updated 4 months ago
- NanoGPT (124M) quality in 2.67B tokens☆28Updated last week
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆95Updated last month
- ☆18Updated 7 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆17Updated last month
- Training hybrid models for dummies.☆20Updated 3 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 6 months ago
- look how they massacred my boy☆63Updated 6 months ago
- train with kittens!☆56Updated 5 months ago
- Training code for Sparse Autoencoders on Embedding models☆38Updated last month
- ☆27Updated 9 months ago
- NLP with Rust for Python 🦀🐍☆61Updated 10 months ago
- ☆60Updated last year
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- A sample pattern for running CI tests on Modal☆17Updated this week
- A miniature version of Modal☆20Updated 10 months ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆49Updated this week
- Simple GRPO scripts and configurations.☆58Updated 2 months ago