A complete PyTorch implementation of Google's Gemma3 270M language model, featuring sliding window attention, RoPE positional encoding, and efficient training infrastructure.
☆48Sep 7, 2025Updated 8 months ago
Alternatives and similar repositories for gemma3-270M-tinystories-pytorch
Users that are interested in gemma3-270M-tinystories-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This projects aims to show how whisper model can be fine-tuned on language it was not trained but is trained on similar language to it.☆11May 10, 2024Updated 2 years ago
- This is a simple demonstration to show how to keep an LLM loaded for prolonged time in the memory or unloading the model immediately afte…☆13May 4, 2024Updated 2 years ago
- ChineseCLIP using online learning☆14Nov 7, 2022Updated 3 years ago
- Personal blog post set up using jekyll☆16May 4, 2026Updated 3 weeks ago
- ☆13May 30, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Mar 31, 2026Updated last month
- ☆17Feb 24, 2026Updated 3 months ago
- A web application demonstrating translations and summarization with Google Gemini Nano (on-device model)☆19Dec 4, 2024Updated last year
- 基于Funasr的[实时]AI语音助手☆24Dec 18, 2025Updated 5 months ago
- 动手训练一个简单的CLIP模型,加深对CLIP的理解。☆26May 20, 2025Updated last year
- ProxylessNAS-Pytorch☆24Aug 9, 2019Updated 6 years ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆24Oct 10, 2025Updated 7 months ago
- The repo for SHINE: A Scalable In-Context Hypernetwork for Mapping Context to LoRA in a Single Pass☆74May 23, 2026Updated last week
- The official implementation of the paper "MLP Memory: A Retriever-Pretrained Memory for Large Language Models". (ICLR 2026)☆66Jan 28, 2026Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ROSA-Tuning☆74Feb 4, 2026Updated 3 months ago
- Download manager for Ollama☆31Dec 3, 2024Updated last year
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆37Jan 7, 2024Updated 2 years ago
- A Python-based security assessment tool for continuous automated security scanning and monitoring of domains.☆13Apr 4, 2025Updated last year
- An introduction to global assessment techniques using Python☆12Apr 24, 2023Updated 3 years ago
- Scikit-learn vectorizer implementing "A simple but tough-to-beat baseline for sentence embeddings." by Arora, Sanjeev, Yingyu Liang, and …☆12Apr 1, 2018Updated 8 years ago
- ☆21May 17, 2026Updated last week
- Two-stage financial analysis workflow — executive briefing first, detailed deep dive on request☆40Feb 21, 2026Updated 3 months ago
- 30天实现一个claude code☆64Jan 13, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A multi-messaging-sevice aggregator into an all-in-one application (android's app beeper-like)☆21Sep 8, 2025Updated 8 months ago
- ☆17May 15, 2025Updated last year
- State tuning tunes the state☆35Feb 12, 2025Updated last year
- This Module Helps to Scan a Commit History of a Repo for Leakage of Secrets☆15Apr 26, 2025Updated last year
- ☆29Aug 30, 2024Updated last year
- brewpkg☆18Sep 30, 2025Updated 7 months ago
- Repository for "GIST: Distributed training for large-scale graph convolutional networks"☆15Jan 14, 2023Updated 3 years ago
- ☆10Oct 11, 2021Updated 4 years ago
- Winning Hackathon entry for Streamlit LLM Hackathon October 2023☆16Oct 19, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A short script showing how to build simple real-time video analytics apps using YOLOv8 and Supervision. Try it out, and most importantly …☆88Aug 17, 2023Updated 2 years ago
- Tool to create and update Munki manifests for devices managed in Intune☆12Sep 13, 2024Updated last year
- ☆44Oct 16, 2025Updated 7 months ago
- Contains Colab Notebooks show cool use-cases of different GCP ML APIs.☆10Nov 5, 2020Updated 5 years ago
- Gemini Live API + function calling for patient intake☆24Nov 8, 2025Updated 6 months ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/grafeas☆13Oct 31, 2023Updated 2 years ago
- everything i know about cuda and triton☆13Jan 28, 2025Updated last year