OptimalFoundation / nadirLinks

Nadir: Cutting-edge PyTorch optimizers for simplicity & composability! 🔥🚀💻

☆14

Alternatives and similar repositories for nadir

Users that are interested in nadir are comparing it to the libraries listed below

Sorting:

xrsrke / pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
☆87Updated last year
krandiash / quinine
A library to create and manage configuration files, especially for machine learning projects.
☆79Updated 3 years ago
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆115Updated 2 years ago
srush / do-we-need-attention
☆166Updated 2 years ago
johnma2006 / candle
Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.
☆52Updated last year
CarperAI / decontamination
This repository contains code for cleaning your training data of benchmark data to help combat data snooping.
☆27Updated 2 years ago
huggingface / bloom-jax-inference
☆66Updated 3 years ago
lucidrains / PaLM-jax
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)
☆188Updated 3 years ago
jaymody / picoBERT
Like picoGPT but for BERT.
☆50Updated 2 years ago
sholtodouglas / scalingExperiments
☆62Updated 3 years ago
HomebrewML / Olmax
HomebrewNLP in JAX flavour for maintable TPU-Training
☆51Updated last year
jxmorris12 / bm25_pt
minimal pytorch implementation of bm25 (with sparse tensors)
☆104Updated last year
alvarobartt / safejax
Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`
☆47Updated last year
lessw2020 / transformer_central
Various transformers for FSDP research
☆39Updated 2 years ago
kernelmachine / cbtm
Code repository for the c-BTM paper
☆107Updated 2 years ago
stas00 / ml-ways
ML/DL Math and Method notes
☆64Updated last year
google-research / jestimator
Amos optimizer with JEstimator lib.
☆82Updated last year
CarperAI / squeakily
A library for squeakily cleaning and filtering language datasets.
☆47Updated 2 years ago
EleutherAI / improved-t5
Experiments for efforts to train a new and improved t5
☆75Updated last year
abacaj / train-with-fsdp
☆94Updated 2 years ago
microsoft / mutransformers
some common Huggingface transformers in maximal update parametrization (µP)
☆85Updated 3 years ago
sayakpaul / count-tokens-hf-datasets
This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…
☆27Updated 2 years ago
cat-state / tinypar
☆20Updated 2 years ago
KhoomeiK / complexity-scaling
gzip Predicts Data-dependent Scaling Laws
☆34Updated last year
bloomberg / minilmv2.bb
Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)
☆61Updated 2 years ago
CarperAI / cheese
Used for adaptive human in the loop evaluation of language and embedding models.
☆308Updated 2 years ago
EleutherAI / tokengrams
Efficiently computing & storing token n-grams from large corpora
☆26Updated last year
euclaise / supertrainer2000
☆49Updated last year
HomebrewML / HomebrewNLP-torch
A case study of efficient training of large language models using commodity hardware.
☆68Updated 3 years ago
rwightman / genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…
☆43Updated last year