TheSeriousProgrammer / SimpleBitNet
Simple Adaptation of BitNet
☆30Updated 9 months ago
Alternatives and similar repositories for SimpleBitNet:
Users that are interested in SimpleBitNet are comparing it to the libraries listed below
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆225Updated 2 months ago
- Prune transformer layers☆67Updated 8 months ago
- Collection of autoregressive model implementation☆77Updated 3 weeks ago
- Set of scripts to finetune LLMs☆36Updated 10 months ago
- ☆117Updated 3 months ago
- LoRA and DoRA from Scratch Implementations☆195Updated 10 months ago
- Notebooks for fine tuning pali gemma☆90Updated last month
- ☆122Updated 5 months ago
- ☆110Updated 4 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆260Updated last week
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆123Updated last year
- ☆154Updated last month
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆190Updated 6 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated 8 months ago
- Let's build better datasets, together!☆250Updated last month
- Pretraining and finetuning for visual instruction following with Mixture of Experts☆12Updated last year
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆262Updated 3 weeks ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆178Updated 4 months ago
- ☆121Updated this week
- Pytorch/XLA SPMD Test code in Google TPU☆23Updated 9 months ago
- An introduction to LLM Sampling☆75Updated last month
- End-to-End LLM Guide☆99Updated 6 months ago
- Google TPU optimizations for transformers models☆90Updated last week
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆154Updated 3 months ago
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizers☆78Updated 6 months ago
- ☆92Updated last year
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆37Updated 3 months ago
- Code for NeurIPS LLM Efficiency Challenge☆54Updated 9 months ago
- Notes on quantization in neural networks☆66Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆101Updated 4 months ago