a simplified version of Google's Gemma model to be used for learning
☆26Mar 2, 2024Updated 2 years ago
Alternatives and similar repositories for minGemma
Users that are interested in minGemma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A GPT with self-similar nested properties☆20Mar 19, 2024Updated 2 years ago
- A curated collection of prompts for Grok Imagine by xAI☆28Oct 19, 2025Updated 6 months ago
- The official Languini Kitchen repository☆14May 6, 2024Updated 2 years ago
- [ICML 2025] SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models☆59Aug 9, 2024Updated last year
- ☆10Nov 16, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Train toy models using multi-token prediction objective☆14Apr 18, 2026Updated 3 weeks ago
- Full-stack web application using Python, Django, SQL, and Bootstrap. OpenSea clone with the ability for users to post Non-Fungible Tokens…☆15Dec 6, 2021Updated 4 years ago
- Python console application designed to provide an engaging and visually appealing LLM chat experience on Unix-like consoles or Terminals.☆25Mar 26, 2026Updated last month
- Remove generated stories with stray unicode characters☆12Jan 3, 2024Updated 2 years ago
- [NeurIPS 2024] "AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment" by Yonggan Fu, Zhongzhi Yu,…☆19Dec 13, 2024Updated last year
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated last year
- PubMed Healthcare Chatbot. LLM Augmented Q&A over PubMed Search Engine.☆27Jan 21, 2024Updated 2 years ago
- Summarize/analyze large amounts of text using local LLM models, langchain, ollama, and flask. No data leaves your computer.☆20May 7, 2024Updated 2 years ago
- Environment equipped with reinforcement learning algorithms to train agents to play tic-tac-toe.☆13Mar 4, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- GPT-4를 활용한 인공지능 앱 개발(원제: Developing apps with GPT-4 and ChatGPT)의 소스 코드를 제공합니다☆18Feb 21, 2024Updated 2 years ago
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆31Nov 14, 2023Updated 2 years ago
- ☆20Updated this week
- ☆31Jan 18, 2025Updated last year
- 제4회 AI × Bookathon 우수상☆14Jan 20, 2023Updated 3 years ago
- A Github template for writing LaTeX documents collaboratively with automatic rendering using Github actions.☆26Jan 17, 2023Updated 3 years ago
- ☆30Sep 3, 2025Updated 8 months ago
- A static deobfuscator for JavaScript Malware☆13May 6, 2020Updated 6 years ago
- PyTorch Implementation of FractalNet☆28Dec 15, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- rotating proxy server☆11Sep 17, 2024Updated last year
- [COLM 2025] DFRot: Achieving Outlier-Free and Massive Activation-Free for Rotated LLMs with Refined Rotation; 知乎:https://zhuanlan.zhihu.c…☆30Mar 5, 2025Updated last year
- This repo implements and trains DallE-1 on a synthetically generated dataset which has colored mnist images on texture/solid background a…☆13Oct 30, 2024Updated last year
- Trained a 114 million Parameter LLM from Scratch.☆19Jul 21, 2024Updated last year
- Drag-and-drop to find text. A work in progress.☆14Oct 6, 2022Updated 3 years ago
- Real-time fraud transaction detection system☆25Aug 28, 2024Updated last year
- ☆16Feb 21, 2026Updated 2 months ago
- Rust widget toolkit built on Reclutch☆11Mar 25, 2020Updated 6 years ago
- An experimental playground starting point to build React Server applications on the Cloudflare platform using Vite.☆16Jan 10, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implement FlashAttention v2 with minimal code to learn.☆16Jun 12, 2024Updated last year
- Simple PyTorch Denoisers for Waveform Audio☆41Apr 4, 2026Updated last month
- Ready to use whisper.cpp models implementation for iOS and Android☆24Sep 4, 2023Updated 2 years ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆30Updated this week
- Multi-turn dataset management tool for LLM trainers☆12Mar 31, 2025Updated last year
- A collection of Jupyter notebooks using rawpy☆30May 9, 2023Updated 3 years ago
- ☆13Apr 15, 2024Updated 2 years ago