erfanzar / EasyDeLLinks

Accelerate, Optimize performance with streamlined training and serving options with JAX.

☆323

Alternatives and similar repositories for EasyDeL

Users that are interested in EasyDeL are comparing it to the libraries listed below

Sorting:

marin-community / levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆685Updated this week
ayaka14732 / llama-2-jax
JAX implementation of the Llama 2 model
☆216Updated last year
google-deepmind / nanodo
☆285Updated last year
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆174Updated 5 months ago
Sea-Snell / JAX_llama
Inference code for LLaMA models in JAX
☆120Updated last year
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆240Updated 2 months ago
MatX-inc / seqax
seqax = sequence modeling + JAX
☆168Updated 4 months ago
modula-systems / modula
🧱 Modula software package
☆307Updated 3 months ago
xrsrke / pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
☆87Updated last year
huggingface / picotron_tutorial
☆225Updated last month
jax-ml / jax-llm-examples
Minimal yet performant LLM examples in pure JAX
☆202Updated 2 months ago
yixiaoer / tpux
A set of Python scripts that makes your experience on TPU better
☆54Updated 2 months ago
imoneoi / multipack
Multipack distributed sampler for fast padding-free training of LLMs
☆202Updated last year
gautierdag / bpeasy
Fast bare-bones BPE for modern tokenizer training
☆171Updated 5 months ago
mlfoundations / open_lm
A repository for research on medium sized language models.
☆520Updated 5 months ago
imbue-ai / carbs
Cost aware hyperparameter tuning algorithm
☆175Updated last year
HenryNdubuaku / nanodl
A Jax-based library for building transformers, includes implementations of GPT, Gemma, LlaMa, Mixtral, Whisper, SWin, ViT and more.
☆297Updated last year
HazyResearch / based
Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"
☆243Updated 5 months ago
Sea-Snell / JAXSeq
Train very large language models in Jax.
☆210Updated 2 years ago
athms / mad-lab
A MAD laboratory to improve AI architecture designs 🧪
☆135Updated 11 months ago
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆277Updated last year
justinchiu / openlogprobs
Extract full next-token probabilities via language model APIs
☆248Updated last year
erfanzar / eformer
(EasyDel Former) is a utility library designed to simplify and enhance the development in JAX
☆28Updated this week
huggingface / kernels
Load compute kernels from the Hub
☆335Updated this week
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆257Updated 2 years ago
cloneofsimo / min-fsdp
☆91Updated last year
LeonGuertler / UnstableBaselines
☆106Updated last month
ethansmith2000 / fsdp_optimizers
supporting pytorch FSDP for optimizers
☆84Updated 11 months ago
cloneofsimo / min-max-gpt
Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training
☆132Updated last year
erfanzar / jax-flash-attn2
A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/…
☆30Updated 8 months ago