google-deepmind / gemma_penzaiLinks
A JAX Research Toolkit for Visualizing, Manipulating, and Understanding Gemma Models with Multi-modal Support based on Penzai.
☆87Updated 2 weeks ago
Alternatives and similar repositories for gemma_penzai
Users that are interested in gemma_penzai are comparing it to the libraries listed below
Sorting:
- aesthetic tensor visualiser☆28Updated 9 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆61Updated last year
- Simple & Scalable Pretraining for Neural Architecture Research☆307Updated last month
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆27Updated 11 months ago
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆277Updated 2 months ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆288Updated 2 months ago
- Lego for GRPO☆30Updated 8 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆89Updated last week
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆127Updated 3 months ago
- ☆148Updated last year
- Curated collection of community environments☆208Updated this week
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆109Updated 10 months ago
- Open-source release accompanying Gao et al. 2025☆498Updated last month
- Hub for researchers exploring VLMs and Multimodal Learning:)☆61Updated last week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆358Updated 7 months ago
- minimal GRPO implementation from scratch☆102Updated 10 months ago
- open source alpha evolve☆68Updated 8 months ago
- An extension of the nanoGPT repository for training small MOE models.☆231Updated 10 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆260Updated last week
- [ICLR'26] The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"☆324Updated this week
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆109Updated 8 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆88Updated 10 months ago
- ☆52Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆175Updated last year
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆38Updated 2 months ago
- Exploring Applications of GRPO☆251Updated 5 months ago
- Open source interpretability artefacts for R1.☆169Updated 9 months ago
- Train, tune, and infer Bamba model☆138Updated 7 months ago
- Verification of Google DeepMind's AlphaEvolve 48-multiplication matrix algorithm, a breakthrough in matrix multiplication after 56 years.☆131Updated 7 months ago
- Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models☆227Updated 2 months ago