erfanzar/eformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/erfanzar/eformer)

erfanzar / eformer

(EasyDel Former) is a utility library designed to simplify and enhance the development in JAX

☆33

Alternatives and similar repositories for eformer

Users that are interested in eformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

erfanzar / InstinctiveDiffuse
View on GitHub
A cutting-edge text-to-image generator model that leverages state-of-the-art Stable Diffusion Model Type to produce high-quality, realist…
☆13Mar 4, 2024Updated 2 years ago
erfanzar / OST-OpenSourceTransformers
View on GitHub
OST Collection: An AI-powered suite of models that predict the next word matches with remarkable accuracy (Text Generative Models). OST C…
☆16Nov 16, 2023Updated 2 years ago
erfanzar / Xerxes-Agents
View on GitHub
Agents for intelligence and coordination
☆26Updated this week
erfanzar / EasyDeL
View on GitHub
Accelerate, Optimize performance with streamlined training and serving options with JAX.
☆368Updated this week
erfanzar / jax-flash-attn2
View on GitHub
A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/…
☆34Mar 4, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ehsanAghahoseini / slider
View on GitHub
☆19Oct 3, 2023Updated 2 years ago
JesseFarebro / flax-mup
View on GitHub
Maximal Update Parametrization (μP) with Flax & Optax.
☆16Dec 27, 2023Updated 2 years ago
ehsanAghahoseini / components
View on GitHub
☆36Jul 14, 2026Updated last week
ronghanghu / vit_10b_fsdp_example
View on GitHub
See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md
☆25Dec 22, 2022Updated 3 years ago
opooladz / Preconditioned-Stochastic-Gradient-Descent
View on GitHub
A repo based on XiLin Li's PSGD repo that extends some of the experiments.
☆14Oct 7, 2024Updated last year
lucaslingle / mu_transformer
View on GitHub
Official implementation of 'A Large-Scale Exploration of mu-Transfer' (CoRR 2024)
☆31Jun 5, 2025Updated last year
evanatyourservice / llm-jax
View on GitHub
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆19Jul 24, 2025Updated last year
louis925 / uclathes
View on GitHub
UCLA LaTeX Thesis Template
☆17Jun 13, 2017Updated 9 years ago
young-geng / scalax
View on GitHub
A simple library for scaling up JAX programs
☆148Nov 4, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
young-geng / mlxu
View on GitHub
Machine Learning eXperiment Utilities
☆48Jul 29, 2025Updated 11 months ago
aniquetahir / JORA
View on GitHub
JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)
☆36Apr 25, 2024Updated 2 years ago
rdyro / tune-jax
View on GitHub
Microbenchmarking hyperparameter tuning for JAX functions.
☆22Jul 7, 2026Updated 2 weeks ago
marin-community / haliax
View on GitHub
Named Tensors for Legible Deep Learning in JAX
☆226Nov 8, 2025Updated 8 months ago
HeegyuKim / torch-xla-SPMD
View on GitHub
Pytorch/XLA SPMD Test code in Google TPU
☆23Apr 3, 2024Updated 2 years ago
yandexdataschool / gumbel_dpg
View on GitHub
Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.
☆12Jun 20, 2017Updated 9 years ago
ClashLuke / tpucare
View on GitHub
Automatically take good care of your preemptible TPUs
☆37May 15, 2023Updated 3 years ago
pytorch-tpu / transformers
View on GitHub
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
☆17Jun 5, 2025Updated last year
fmeirinhos / pytorch-hessianfree
View on GitHub
PyTorch implementation of Hessian Free optimisation
☆44Dec 19, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
garzaa / vapor-trails
View on GitHub
❤⚡⏏
☆16Apr 7, 2023Updated 3 years ago
evanatyourservice / psgd_jax
View on GitHub
Implementation of PSGD optimizer in JAX
☆36Dec 31, 2024Updated last year
SonicCodes / subcloning
View on GitHub
implementation of https://arxiv.org/pdf/2312.09299
☆21Jul 3, 2024Updated 2 years ago
ayaka14732 / llama-2-jax
View on GitHub
JAX implementation of the Llama 2 model
☆217Feb 2, 2024Updated 2 years ago
eniompw / nanoGPTshakespeare
View on GitHub
finetuning shakespeare on karpathy/nanoGPT
☆23Feb 2, 2023Updated 3 years ago
google-deepmind / jmp
View on GitHub
JMP is a Mixed Precision library for JAX.
☆213Jul 8, 2026Updated 2 weeks ago
ellieyhcheng / dice-vs-code
View on GitHub
Dice Language Support for VS Code
☆10Sep 29, 2020Updated 5 years ago
young-geng / mintext
View on GitHub
Minimal but scalable implementation of large language models in JAX
☆34Nov 28, 2025Updated 7 months ago
human-rights-corpus / HRC
View on GitHub
#인권코퍼스
☆31Oct 6, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
thomasahle / kanmlps
View on GitHub
KANs and MLPs
☆12Jun 7, 2024Updated 2 years ago
AI-Hypercomputer / maxdiffusion
View on GitHub
☆377Updated this week
lixilinx / Fully-Trainable-SSM
View on GitHub
A fully trainable state space model (SSM)
☆16Mar 18, 2025Updated last year
ucsb-seclab / BullseyePoison
View on GitHub
Bullseye Polytope Clean-Label Poisoning Attack
☆18Nov 5, 2020Updated 5 years ago
xalaida / laravel-translatable
View on GitHub
Translate your Eloquent models into different languages.
☆21Apr 23, 2025Updated last year
KellerJordan / hlb-CIFAR10
View on GitHub
Train to 94% on CIFAR-10 in 4.4 seconds on a single A100
☆12Dec 30, 2023Updated 2 years ago
vasilisp / inez
View on GitHub
A Constraint Solver
☆12Dec 4, 2015Updated 10 years ago