google-deepmind / gemmaLinks
Gemma open-weight LLM library, from Google DeepMind
☆3,739Updated last week
Alternatives and similar repositories for gemma
Users that are interested in gemma are comparing it to the libraries listed below
Sorting:
- The official PyTorch implementation of Google's Gemma models☆5,554Updated 4 months ago
- lightweight, standalone C++ inference engine for Google's Gemma models.☆6,587Updated this week
- Modeling, training, eval, and inference code for OLMo☆6,019Updated last month
- PyTorch native post-training library☆5,523Updated this week
- A series of large language models trained from scratch by developers @01-ai☆7,842Updated 10 months ago
- Large World Model -- Modeling Text and Video with Millions Context☆7,348Updated 11 months ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"☆3,322Updated last year
- DeepSeek-VL: Towards Real-World Vision-Language Understanding☆3,964Updated last year
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,214Updated 7 months ago
- Home of StarCoder2!☆1,975Updated last year
- CoreNet: A library for training deep neural networks☆7,017Updated last month
- Official inference library for Mistral models☆10,497Updated 6 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆18,662Updated this week
- A simple, performant and scalable Jax LLM!☆1,923Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,611Updated last month
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,763Updated last year
- This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi…☆3,544Updated 2 weeks ago
- DataComp for Language Models☆1,367Updated last month
- official repository of aiXcoder-7B Code Large Language Model☆2,280Updated 3 months ago
- ☆4,096Updated last year
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,987Updated last year
- An Extensible Deep Learning Library☆2,262Updated this week
- ☆8,653Updated last year
- ☆2,539Updated last year
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆24,917Updated last week
- ☆3,028Updated last year
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizati…☆11,773Updated this week
- Reaching LLaMA2 Performance with 0.1M Dollars☆985Updated last year
- Training LLMs with QLoRA + FSDP☆1,527Updated 11 months ago
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆11,838Updated 2 weeks ago