google-deepmind/gemma_penzai

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-deepmind/gemma_penzai)

google-deepmind / gemma_penzai

A JAX Research Toolkit for Visualizing, Manipulating, and Understanding Gemma Models with Multi-modal Support based on Penzai.

☆95

Alternatives and similar repositories for gemma_penzai

Users that are interested in gemma_penzai are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenRewardAI / openreward-cookbook
View on GitHub
Training and evaluating with OpenReward
☆33Apr 28, 2026Updated 3 months ago
yucenli / unify-diffusion
View on GitHub
☆28Dec 19, 2025Updated 7 months ago
archiki / ASR-Accent-Analysis
View on GitHub
Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.
☆15Jun 27, 2020Updated 6 years ago
PrimeIntellect-ai / renderers
View on GitHub
Programmable chat templates for LLM training and inference.
☆135Updated this week
jiangycTarheel / SQ-Transformer
View on GitHub
☆10Feb 12, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sail-sg / feedback-conditional-policy
View on GitHub
Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
☆65Jan 5, 2026Updated 6 months ago
alisawuffles / tokenizer-attack
View on GitHub
Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"
☆23May 15, 2025Updated last year
milarobotlearningcourse / robot_learning_2025
View on GitHub
☆13Aug 13, 2025Updated 11 months ago
allenai / fluid-benchmarking
View on GitHub
Fluid Language Model Benchmarking
☆29Sep 16, 2025Updated 10 months ago
aisa-group / PostTrainBench
View on GitHub
Measuring how well CLI agents like Claude Code or Codex CLI can post-train base LLMs on a single H100 GPU in 10 hours
☆472Jul 22, 2026Updated last week
goedelcodeprover / Goedel-Code-Prover
View on GitHub
☆50Apr 12, 2026Updated 3 months ago
arubique / OCCAM
View on GitHub
This is an implementation of the paper "Are We Done with Object-Centric Learning?"
☆14Jun 21, 2026Updated last month
deep-spin / sparse-communication
View on GitHub
☆12Mar 7, 2022Updated 4 years ago
harbor-framework / harbor-index
View on GitHub
A compact high-signal benchmark for evaluating frontier agents
☆21Updated this week
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
vsahil / MIMETIC-2
View on GitHub
Official Code for MIMETIC^2
☆13Nov 19, 2024Updated last year
Sharut / canonical-multimodal-rep
View on GitHub
☆15Feb 25, 2026Updated 5 months ago
Kripner / openprover
View on GitHub
Automated Theorem Prover inspired by Aletheia. Claude Code for mathematicians.
☆82Jul 13, 2026Updated 2 weeks ago
windows7lover / DTE-DynamicTrainingEngine
View on GitHub
Generic building-block toolbox for training neural networks with adaptive and recursive execution. It provides reusable components to con…
☆27Jun 29, 2026Updated last month
eligotts / legos
View on GitHub
☆24Jan 22, 2026Updated 6 months ago
SakanaAI / repo
View on GitHub
RePo: Language Models with Context Re-Positioning
☆83Mar 30, 2026Updated 3 months ago
ASTRAL-Group / LoRe
View on GitHub
When Reasoning Meets Its Laws
☆38Jan 2, 2026Updated 6 months ago
tobna / TaylorShift
View on GitHub
This repository contains the code for the paper "TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back)…
☆15Feb 25, 2026Updated 5 months ago
facebookresearch / PhysicsLM4
View on GitHub
Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality
☆356May 20, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
epfml / schedules-and-scaling
View on GitHub
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
☆93Oct 30, 2024Updated last year
JanTempus / tokenisation_lp
View on GitHub
☆15Updated this week
JinjieNi / dlms-are-super-data-learners
View on GitHub
The official github repo for "Diffusion Language Models are Super Data Learners".
☆227Nov 6, 2025Updated 8 months ago
OliverRichter / normalized-attention
View on GitHub
Code publication to the paper "Normalized Attention Without Probability Cage"
☆17Nov 9, 2021Updated 4 years ago
dsrahul08 / PythonProgramming
View on GitHub
☆15Dec 3, 2022Updated 3 years ago
ChinmayK0607 / heiretsu
View on GitHub
Educational WIP
☆73Feb 16, 2026Updated 5 months ago
matthias-wright / jax-fid
View on GitHub
FID computation in Jax/Flax.
☆29Jul 17, 2024Updated 2 years ago
cloneofsimo / repa-rf
View on GitHub
☆32Nov 4, 2024Updated last year
Becomebright / MTV
View on GitHub
Revisiting Multi-Task Visual Representation Learning
☆22Jan 21, 2026Updated 6 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
karthikncode / MorphoChain
View on GitHub
A model for unsupervised morphological analysis that integrates orthographic and semantic views of words.
☆13Oct 10, 2023Updated 2 years ago
doyc-1 / Crux
View on GitHub
The State Of The Art, intelligence
☆162Aug 12, 2025Updated 11 months ago
technion-cs-nlp / vlm-circuits-analysis
View on GitHub
Code for the experiments and websites of the paper "Same Task, Different Circuits"
☆36Jul 21, 2026Updated last week
DAMO-NLP-SG / LLM-Multilingual-Knowledge-Boundaries
View on GitHub
[ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations
☆19Oct 18, 2025Updated 9 months ago
jasonmayes / Retraining-TensorFlow-Classifier-Using-Video
View on GitHub
Script to convert all MP4 videos in a zip archive to JPG frames at a desired FPS with unique names. It will then retrain the top layers o…
☆12Jul 6, 2016Updated 10 years ago
kadarakos / hieratt
View on GitHub
Experimenting with Hierarchical Attention Networks from https://arxiv.org/abs/1606.02393 in Keras
☆13Oct 12, 2016Updated 9 years ago
orrzohar / LOVM
View on GitHub
[NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection
☆21Feb 3, 2024Updated 2 years ago