Cerebras / gigaGPT
a small code base for training large models
☆288Updated 2 months ago
Alternatives and similar repositories for gigaGPT:
Users that are interested in gigaGPT are comparing it to the libraries listed below
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆343Updated 7 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆623Updated 2 weeks ago
- Visualize the intermediate output of Mistral 7B☆343Updated last month
- Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".☆270Updated last year
- The repository for the code of the UltraFastBERT paper☆517Updated 11 months ago
- run paligemma in real time☆131Updated 9 months ago
- Fast parallel LLM inference for MLX☆173Updated 8 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆230Updated 4 months ago
- An implementation of bucketMul LLM inference☆215Updated 8 months ago
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆201Updated 3 months ago
- Inference code for Persimmon-8B☆416Updated last year
- ☆511Updated 6 months ago
- ☆412Updated last year
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆849Updated 3 weeks ago
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"☆362Updated last year
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆603Updated 3 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆196Updated 7 months ago
- Official implementation of Half-Quadratic Quantization (HQQ)☆763Updated 2 weeks ago
- Fine-tune mistral-7B on 3090s, a100s, h100s☆706Updated last year
- A repository for research on medium sized language models.☆492Updated 2 months ago
- Long context evaluation for large language models☆201Updated last week
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆296Updated 4 months ago
- Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript☆571Updated 8 months ago
- A comprehensive deep dive into the world of tokens☆222Updated 8 months ago
- A bagel, with everything.☆317Updated 11 months ago
- Inference of Mamba models in pure C☆186Updated last year
- A pure NumPy implementation of Mamba.☆219Updated 8 months ago
- ☆500Updated 3 months ago
- [ICML 2024] CLLMs: Consistency Large Language Models☆385Updated 3 months ago
- PyTorch implementation of models from the Zamba2 series.☆177Updated last month