erfanzar / eformer
(EasyDel Former) is a utility library designed to simplify and enhance the development in JAX
☆24Updated this week
Alternatives and similar repositories for eformer:
Users that are interested in eformer are comparing it to the libraries listed below
- OST Collection: An AI-powered suite of models that predict the next word matches with remarkable accuracy (Text Generative Models). OST C…☆16Updated last year
- Flash Attention Implementation with Multiple Backend Support and Sharding This module provides a flexible implementation of Flash Attenti…☆23Updated 2 months ago
- Xerxes, a highly advanced Persian AI assistant developed by InstinctAI, a cutting-edge AI startup. primary function is to assist users wi…☆11Updated 9 months ago
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆226Updated this week
- AgentX is an Open-source library that help people use LLMs on their own computers or help them to serve LLMs as easy as possible that sup…☆15Updated 8 months ago
- A cutting-edge text-to-image generator model that leverages state-of-the-art Stable Diffusion Model Type to produce high-quality, realist…☆14Updated 11 months ago
- Minimal but scalable implementation of large language models in JAX☆32Updated 3 months ago
- A set of Python scripts that makes your experience on TPU better☆48Updated 7 months ago
- Machine Learning eXperiment Utilities☆46Updated 8 months ago
- If it quacks like a tensor...☆56Updated 3 months ago
- Pytorch/XLA SPMD Test code in Google TPU☆23Updated 10 months ago
- See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md☆23Updated 2 years ago
- JAX implementation of the Mistral 7b v0.2 model☆35Updated 7 months ago
- LoRA for arbitrary JAX models and functions☆135Updated 11 months ago
- Tensor Parallelism with JAX + Shard Map☆11Updated last year
- Experimenting with how best to do multi-host dataloading☆10Updated 2 years ago
- Inference code for LLaMA models in JAX☆114Updated 9 months ago
- ☆75Updated 7 months ago
- ☆20Updated last year
- ☆42Updated last year
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆32Updated 9 months ago
- JAX bindings for Flash Attention v2☆85Updated 7 months ago
- Learn CUDA with PyTorch☆16Updated 3 weeks ago
- This repo is based on https://github.com/jiaweizzhao/GaLore☆24Updated 5 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆70Updated 3 months ago
- A simple library for scaling up JAX programs☆129Updated 3 months ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆81Updated last year
- Automatically take good care of your preemptible TPUs☆36Updated last year
- Simple and efficient pytorch-native transformer training and inference (batched)☆68Updated 10 months ago
- Code for the note "NF4 Isn't Information Theoretically Optimal (and that's Good)☆18Updated last year