AnswerDotAI / fasttransform
Transform is the main building block of data pipelines in fastai. And elsewhere if you want.
☆16Updated this week
Alternatives and similar repositories for fasttransform:
Users that are interested in fasttransform are comparing it to the libraries listed below
- ☆9Updated 5 months ago
- Tools to make language models a bit easier to use☆39Updated this week
- ☆38Updated last month
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆42Updated last year
- Knowledge Graph Generator app☆30Updated 11 months ago
- Have UV deal with all your Jupyter deps.☆24Updated 6 months ago
- You should use PySR to find scaling laws. Here's an example.☆33Updated last year
- ☆28Updated 6 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated last month
- ☆38Updated 8 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆17Updated 2 weeks ago
- Mixtral finetuning☆19Updated last year
- Simple GRPO scripts and configurations.☆59Updated last month
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- NLP with Rust for Python 🦀🐍☆61Updated 10 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆49Updated this week
- ☆20Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebs☆48Updated last week
- ☆22Updated last year
- Training code for Sparse Autoencoders on Embedding models☆38Updated last month
- Chat Markup Language conversation library☆55Updated last year
- BH hackathon☆14Updated 11 months ago
- A library to use `modal` as a backend for `joblib`.☆28Updated 2 months ago
- A sample pattern for running CI tests on Modal☆16Updated 6 months ago
- LLM training in simple, raw C/CUDA☆14Updated 3 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated last month
- A fork of sqlite-utils with CLI etc removed☆14Updated 4 months ago
- An introduction to LLM Sampling☆77Updated 3 months ago
- Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.☆27Updated 5 months ago
- ☆23Updated this week