google-deepmind / gemmaLinks
Gemma open-weight LLM library, from Google DeepMind
☆3,372Updated this week
Alternatives and similar repositories for gemma
Users that are interested in gemma are comparing it to the libraries listed below
Sorting:
- The official PyTorch implementation of Google's Gemma models☆5,472Updated last week
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,450Updated this week
- PyTorch native post-training library☆5,233Updated this week
- ☆4,082Updated last year
- A PyTorch native platform for training generative AI models☆3,891Updated this week
- Modeling, training, eval, and inference code for OLMo☆5,648Updated this week
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆3,851Updated last year
- Set of tools to assess and improve LLM security.☆3,436Updated last week
- Tools for merging pretrained large language models.☆5,774Updated this week
- CoreNet: A library for training deep neural networks☆7,016Updated 3 weeks ago
- A simple, performant and scalable Jax LLM!☆1,746Updated this week
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,900Updated 8 months ago
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,013Updated 3 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆14,814Updated this week
- Simple, safe way to store and distribute tensors☆3,282Updated this week
- On-device AI across mobile, embedded and edge for PyTorch☆2,909Updated this week
- Agentic components of the Llama Stack APIs☆4,248Updated last month
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,588Updated 2 weeks ago
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,711Updated last year
- DataComp for Language Models☆1,305Updated 2 months ago
- ☆2,952Updated 8 months ago
- Training LLMs with QLoRA + FSDP☆1,483Updated 6 months ago
- This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi…☆3,340Updated this week
- AIOS: AI Agent Operating System☆4,205Updated 2 weeks ago
- Large World Model -- Modeling Text and Video with Millions Context☆7,282Updated 7 months ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,283Updated last year
- The official Meta Llama 3 GitHub site☆28,755Updated 4 months ago
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection☆1,565Updated 7 months ago
- AllenAI's post-training codebase☆2,993Updated this week
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,975Updated last month