hkproj / pytorch-transformer-distributed
Distributed training (multi-node) of a Transformer model
β54Updated 10 months ago
Alternatives and similar repositories for pytorch-transformer-distributed:
Users that are interested in pytorch-transformer-distributed are comparing it to the libraries listed below
- β128Updated last month
- β135Updated last week
- Complete implementation of Llama2 with/without KV cache & inference πβ47Updated 8 months ago
- Prune transformer layersβ67Updated 8 months ago
- β142Updated last year
- Notes on Direct Preference Optimizationβ16Updated 10 months ago
- LoRA and DoRA from Scratch Implementationsβ196Updated 11 months ago
- LLaMA 2 implemented from scratch in PyTorchβ294Updated last year
- Notes and commented code for RLHF (PPO)β69Updated 11 months ago
- A pipeline for LLM knowledge distillationβ91Updated 3 weeks ago
- Collection of autoregressive model implementationβ81Updated last week
- Notes about LLaMA 2 modelβ53Updated last year
- Unofficial implementation of https://arxiv.org/pdf/2407.14679β42Updated 5 months ago
- ring-attention experimentsβ123Updated 4 months ago
- Building GPT ...β17Updated 2 months ago
- β40Updated 9 months ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTOβ¦β53Updated last week
- From scratch implementation of a vision language model in pure PyTorchβ194Updated 9 months ago
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)β149Updated 2 months ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorchβ94Updated last year
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024β272Updated last week
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIMβ51Updated 10 months ago
- Set of scripts to finetune LLMsβ36Updated 10 months ago
- β47Updated 5 months ago
- β70Updated 7 months ago
- Training and Fine-tuning an llm in Python and PyTorch.β41Updated last year
- β167Updated 2 months ago