a simplified version of Google's Gemma model to be used for learning
☆26Mar 2, 2024Updated 2 years ago
Alternatives and similar repositories for minGemma
Users that are interested in minGemma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch☆29Updated this week
- A GPT with self-similar nested properties☆20Mar 19, 2024Updated 2 years ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated 2 years ago
- OpenAI GPT model to build your personal assistant in IoT devices. Just like Alexa, Google Assistant, Siri, etc. but with your own skills,…☆12Aug 7, 2023Updated 2 years ago
- Train toy models using multi-token prediction objective☆14May 8, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Minimal implementation of multiple PEFT methods for LLaMA fine-tuning☆13May 7, 2023Updated 2 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 10 months ago
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated 11 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆18Sep 13, 2024Updated last year
- Implementation of a simple BPE tokenizer, but in Nim☆22Jul 2, 2023Updated 2 years ago
- [ 공모전 ] 다각적 모델을 활용한 대출 신청 여부 예측과 고객 군집 별 서비스 메시지 제안 : 이상치 탐지, 머신러닝, 딥러닝 모델☆14Jan 28, 2023Updated 3 years ago
- Summarize/analyze large amounts of text using local LLM models, langchain, ollama, and flask. No data leaves your computer.☆20May 7, 2024Updated last year
- ☆21Feb 5, 2024Updated 2 years ago
- GPT-4를 활용한 인공지능 앱 개발(원제: Developing apps with GPT-4 and ChatGPT)의 소스 코드를 제공합니다☆18Feb 21, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆20Updated this week
- Naive Bayesian Classifier from scratch using PyTorch and analysis of alcohol consumption☆17Sep 29, 2020Updated 5 years ago
- Based on "long-form-factuality" a python based processor to easily fact check anything.☆20Apr 1, 2024Updated 2 years ago
- My custom "3D Gaussian" splatting☆35Apr 12, 2026Updated last week
- A Github template for writing LaTeX documents collaboratively with automatic rendering using Github actions.☆25Jan 17, 2023Updated 3 years ago
- Non-official implementation of "Attention as an RNN" from https://arxiv.org/pdf/2405.13956, efficient associative parallel prefix scan an…☆27Jul 27, 2024Updated last year
- A static deobfuscator for JavaScript Malware☆13May 6, 2020Updated 5 years ago
- Transformers components but in Triton☆34May 9, 2025Updated 11 months ago
- [COLM 2025] DFRot: Achieving Outlier-Free and Massive Activation-Free for Rotated LLMs with Refined Rotation; 知乎:https://zhuanlan.zhihu.c…☆30Mar 5, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- rotating proxy server☆11Sep 17, 2024Updated last year
- customizable template GPT code designed for easy novel architecture experimentation☆26Mar 19, 2025Updated last year
- Simulator for LLM inference on an abstract 3D AIMC-based accelerator☆28Sep 18, 2025Updated 7 months ago
- Discrete Bayesian optimization with LLMs, PEFT finetuning methods, and the Laplace approximation.☆23Jul 30, 2024Updated last year
- Drag-and-drop to find text. A work in progress.☆14Oct 6, 2022Updated 3 years ago
- 《차근차근 실습하며 배우는 파이토치 딥러닝 프로그래밍》 예제 코드☆23Aug 17, 2022Updated 3 years ago
- Rust widget toolkit built on Reclutch☆11Mar 25, 2020Updated 6 years ago
- An experimental playground starting point to build React Server applications on the Cloudflare platform using Vite.☆16Jan 10, 2025Updated last year
- Implement FlashAttention v2 with minimal code to learn.☆16Jun 12, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition☆18Apr 25, 2021Updated 4 years ago
- Multiplicative Normalizing Flows in PyTorch.☆25Nov 3, 2025Updated 5 months ago
- Simple PyTorch Denoisers for Waveform Audio☆41Apr 4, 2026Updated 2 weeks ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- Multi-turn dataset management tool for LLM trainers☆12Mar 31, 2025Updated last year
- Simple proxy app made with HTML, Css, Javascript. Get random free Http/Https proxies.☆12Aug 25, 2024Updated last year
- An example that shows how to use the Nightmare headless browser to capture web-based visualizations under Node.js.☆15Mar 25, 2024Updated 2 years ago