google / gemma_pytorchLinks
The official PyTorch implementation of Google's Gemma models
☆5,585Updated 7 months ago
Alternatives and similar repositories for gemma_pytorch
Users that are interested in gemma_pytorch are comparing it to the libraries listed below
Sorting:
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,650Updated last week
- Gemma open-weight LLM library, from Google DeepMind☆3,908Updated last month
- Modeling, training, eval, and inference code for OLMo☆6,263Updated last month
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,172Updated 4 months ago
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,353Updated 10 months ago
- ☆4,109Updated last year
- PyTorch native post-training library☆5,639Updated this week
- A PyTorch native platform for training generative AI models☆4,892Updated this week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,225Updated last year
- Large World Model -- Modeling Text and Video with Millions Context☆7,389Updated last year
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,330Updated last year
- ☆2,552Updated last year
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,750Updated 5 months ago
- An Extensible Deep Learning Library☆2,308Updated 2 weeks ago
- Video+code lecture on building nanoGPT from scratch☆4,637Updated last year
- Efficient Triton Kernels for LLM Training☆5,991Updated this week
- DataComp for Language Models☆1,402Updated 3 months ago
- High-speed Large Language Model Serving for Local Deployment☆8,503Updated 4 months ago
- CoreNet: A library for training deep neural networks☆7,025Updated 2 months ago
- tiny vision language model☆9,130Updated last month
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,623Updated 3 months ago
- llama3 implementation one matrix multiplication at a time☆15,203Updated last year
- Training LLMs with QLoRA + FSDP☆1,534Updated last year
- Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.☆2,073Updated last year
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,327Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,852Updated last year
- NanoGPT (124M) in 3 minutes☆4,035Updated this week
- Examples in the MLX framework☆8,085Updated 2 weeks ago
- 4M: Massively Multimodal Masked Modeling☆1,781Updated 6 months ago
- Official inference library for Mistral models☆10,606Updated last month