Reverse Engineering Gemma 3n: Google's New Edge-Optimized Language Model
☆276May 27, 2025Updated 11 months ago
Alternatives and similar repositories for reverse-engineering-gemma-3n
Users that are interested in reverse-engineering-gemma-3n are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- I saw this [Blog Post](https://www.morling.dev/blog/one-billion-row-challenge/) on a Billion Row challenge for Java so naturally I tried …☆14Jan 10, 2024Updated 2 years ago
- ☆14Dec 21, 2025Updated 4 months ago
- ☆16May 14, 2025Updated 11 months ago
- ☆27Jan 14, 2025Updated last year
- ☆12Apr 26, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention☆65Apr 7, 2026Updated 3 weeks ago
- Graph model execution API for Candle☆17Jul 27, 2025Updated 9 months ago
- ☆26Feb 26, 2026Updated 2 months ago
- This repository contains multiple implementations of Flash Attention optimized with Triton kernels, showcasing progressive performance im…☆11Mar 26, 2026Updated last month
- Various LLM Benchmarks☆25Feb 20, 2026Updated 2 months ago
- [EMNLP 2022] Official implementation of Transnormer in our EMNLP 2022 paper - The Devil in Linear Transformer☆64Jul 30, 2023Updated 2 years ago
- Fast, Lightweight, Unified Engine for Text2Image Diffusion Models☆20Apr 13, 2025Updated last year
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- recipe for training fully-featured self supervised image jepa models☆13Jun 4, 2025Updated 11 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Feb 11, 2025Updated last year
- Demo page of TAVGBench: Benchmarking Text to Audible-Video Generation☆15Apr 7, 2025Updated last year
- Deep neural models for core NLP tasks☆13Nov 9, 2017Updated 8 years ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆18May 20, 2025Updated 11 months ago
- ☆20Apr 27, 2026Updated last week
- ☆14Jul 25, 2024Updated last year
- Lego for GRPO☆30May 27, 2025Updated 11 months ago
- Viewer for text datasets in formats like HuggingFace, JSONL, etc.☆15Feb 25, 2025Updated last year
- Nano repo for RL training of LLMs☆70Apr 8, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆25Jun 6, 2024Updated last year
- ☆20Mar 25, 2025Updated last year
- Fork of Flame repo for training of some new stuff in development☆19Apr 24, 2026Updated last week
- Repository for DNN training, theory to practice, part of the Large Scale Machine Learning class at Mines Paritech☆11Mar 11, 2022Updated 4 years ago
- Code repository supporting the paper "Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segment…☆12Apr 29, 2024Updated 2 years ago
- ☆16Nov 23, 2023Updated 2 years ago
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 2 years ago
- ☆10Nov 29, 2022Updated 3 years ago
- fast trainer for educational purposes☆26Apr 24, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- FlexAttention w/ FlashAttention3 Support☆27Oct 5, 2024Updated last year
- ☆19Aug 10, 2024Updated last year
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- Bamboo-7B Large Language Model☆93Mar 28, 2024Updated 2 years ago
- Channels between coroutines in Python☆15Jan 4, 2021Updated 5 years ago
- ☆25May 23, 2025Updated 11 months ago
- High-Level Rust API for egglog☆29Updated this week