A complete PyTorch implementation of Google's Gemma3 270M language model, featuring sliding window attention, RoPE positional encoding, and efficient training infrastructure.
☆48Sep 7, 2025Updated 6 months ago
Alternatives and similar repositories for gemma3-270M-tinystories-pytorch
Users that are interested in gemma3-270M-tinystories-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An reconstruction of RL Introduction and its course materials for a more efficient entry☆21Mar 4, 2026Updated 3 weeks ago
- The repo for SHINE: A Scalable In-Context Hypernetwork for Mapping Context to LoRA in a Single Pass☆29Mar 21, 2026Updated last week
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Dec 23, 2025Updated 3 months ago
- pure go for rwkv☆19Dec 31, 2023Updated 2 years ago
- 基于Funasr的[实时]AI语音助手☆24Dec 18, 2025Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Low Quality Image Detection using Machine Learning☆21Jan 24, 2026Updated 2 months ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Feb 16, 2023Updated 3 years ago
- ☆26Nov 27, 2021Updated 4 years ago
- continous batching and parallel acceleration for RWKV6☆22Jun 28, 2024Updated last year
- An introduction to global assessment techniques using Python☆12Apr 24, 2023Updated 2 years ago
- Scikit-learn vectorizer implementing "A simple but tough-to-beat baseline for sentence embeddings." by Arora, Sanjeev, Yingyu Liang, and …☆12Apr 1, 2018Updated 7 years ago
- llama4_trip_planning_agent☆12Apr 5, 2025Updated 11 months ago
- Two-stage financial analysis workflow — executive briefing first, detailed deep dive on request☆40Feb 21, 2026Updated last month
- ☆96Jul 4, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Remote Model Context Protocol (MCP) server for Linear.☆17Apr 12, 2025Updated 11 months ago
- Project to save innocent infant lives using Cardiotocography and CNN image classification☆21May 22, 2019Updated 6 years ago
- ☆11Mar 20, 2026Updated last week
- State tuning tunes the state☆35Feb 12, 2025Updated last year
- A multi-messaging-sevice aggregator into an all-in-one application (android's app beeper-like)☆20Sep 8, 2025Updated 6 months ago
- ☆18May 15, 2025Updated 10 months ago
- This Module Helps to Scan a Commit History of a Repo for Leakage of Secrets☆15Apr 26, 2025Updated 11 months ago
- brewpkg☆17Sep 30, 2025Updated 6 months ago
- ☆10Oct 11, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Using ResNet3D to train on Kinetics form scratch or fine-tune on UCF-101(or others) with Kinetics pretrained model.☆30Aug 10, 2020Updated 5 years ago
- Winning Hackathon entry for Streamlit LLM Hackathon October 2023☆16Oct 19, 2023Updated 2 years ago
- 批量随机生成身份证图片用于ocr模型训练☆37Jul 24, 2019Updated 6 years ago
- Power BI dashboard templates for Microsoft Intune endpoint analytics and device management reporting☆18Jul 9, 2022Updated 3 years ago
- ☆12Nov 18, 2025Updated 4 months ago
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Oct 2, 2024Updated last year
- Sample Angular application for dog breed detection, where target Keras model hosted by TensorFlow Serving☆12Oct 25, 2018Updated 7 years ago
- Write Datasette canned queries as plain SQL files☆14Jul 2, 2022Updated 3 years ago
- Hidden Markov Model for .NET☆11Jul 13, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- PowerShell Functions to query the Microsoft Graph API☆10Jun 26, 2020Updated 5 years ago
- AI Browser☆17Jul 23, 2025Updated 8 months ago
- ☆35Apr 28, 2025Updated 11 months ago
- This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/grafeas☆13Oct 31, 2023Updated 2 years ago
- everything i know about cuda and triton☆13Jan 28, 2025Updated last year
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆40Feb 29, 2024Updated 2 years ago
- How to save a model for tfserving☆11Jan 13, 2018Updated 8 years ago