stas00 / ml-waysLinks

ML/DL Math and Method notes

☆64

Alternatives and similar repositories for ml-ways

Users that are interested in ml-ways are comparing it to the libraries listed below

Sorting:

rasbt / pytorch-memory-optim
This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…
☆91Updated 2 years ago
NielsRogge / awesome-huggingface
Repository containing awesome resources regarding Hugging Face tooling.
☆48Updated last year
muellerzr / nbdistributed
Seemless interface of using PyTOrch distributed with Jupyter notebooks
☆53Updated 2 months ago
rwightman / genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…
☆43Updated last year
Upaya07 / NeurIPS-llm-efficiency-challenge
Code for NeurIPS LLM Efficiency Challenge
☆59Updated last year
drisspg / transformer_nuggets
A place to store reusable transformer components of my own creation or found on the interwebs
☆59Updated last month
AnswerDotAI / minai
A miniture AI training framework for PyTorch
☆42Updated 9 months ago
Alignment-Lab-AI / datagen
a pipeline for using api calls to agnostically convert unstructured data into structured training data
☆31Updated last year
mlops-discord / talks
Slides and recordings of talks hosted by our community
☆21Updated last year
xrsrke / pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
☆87Updated last year
luyug / magix
Supercharge huggingface transformers with model parallelism.
☆77Updated 3 months ago
graphcore-research / out-of-the-box-fp8-training
Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.
☆45Updated last year
kevinwu23 / StanfordFineTuneBench
☆31Updated last year
warner-benjamin / commented-transformers
Highly commented implementations of Transformers in PyTorch
☆136Updated 2 years ago
YuchenJin / llm.c
LLM training in simple, raw C/CUDA
☆15Updated 11 months ago
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆33Updated last month
chainyo / tensorshare
🤝 Trade any tensors over the network
☆30Updated 2 years ago
lessw2020 / transformer_central
Various transformers for FSDP research
☆38Updated 3 years ago
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 6 months ago
HomebrewML / Olmax
HomebrewNLP in JAX flavour for maintable TPU-Training
☆51Updated last year
cloneofsimo / min-fsdp
☆91Updated last year
UmerHA / triton_util
Make triton easier
☆48Updated last year
eth-easl / fmengine
Utilities for Training Very Large Models
☆58Updated last year
hundredblocks / large-model-parallelism
Functional local implementations of main model parallelism approaches
☆96Updated 2 years ago
modal-labs / ci-on-modal
A sample pattern for running CI tests on Modal
☆18Updated 7 months ago
AnswerDotAI / nbdev-template
☆100Updated 4 months ago
pacman100 / peft-codegen-25
☆23Updated 2 years ago
google-deepmind / asyncdiloco
☆47Updated last year
huggingface / peft-pytorch-conference
Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…
☆14Updated 2 years ago
IlyasMoutawwakil / llm-perf-backend
The backend behind the LLM-Perf Leaderboard
☆11Updated last year