A complete PyTorch implementation of Google's Gemma3 270M language model, featuring sliding window attention, RoPE positional encoding, and efficient training infrastructure.
☆48Sep 7, 2025Updated 7 months ago
Alternatives and similar repositories for gemma3-270M-tinystories-pytorch
Users that are interested in gemma3-270M-tinystories-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Clean Architecture for Laravel using the Use Case (Application Service) pattern to keep business logic isolated, testable, and reusable.☆22Dec 23, 2025Updated 3 months ago
- Modular task agnostic training pipeline using LFM2 from Liquid AI with unsloth.☆16Sep 13, 2025Updated 7 months ago
- Chatbot implementation using ChatGPT API and Gradio.☆14Mar 2, 2023Updated 3 years ago
- Question Answering System API based on all of the Harry Potter Books that will allow to answer all the events that took please in the Har…☆13Feb 26, 2023Updated 3 years ago
- ChineseCLIP using online learning☆14Nov 7, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Solo Podcast Creation from Web Page content☆19Sep 23, 2024Updated last year
- Softened ROSA QKV Operators for Training Next-Generation LLM Models☆36Apr 7, 2026Updated last week
- ☆13May 30, 2024Updated last year
- ☆17Feb 24, 2026Updated last month
- This repository contains a project that focuses on evaluating the performance of different Language Models (LLMs) for multi-class news cl…☆18May 25, 2024Updated last year
- The repo for SHINE: A Scalable In-Context Hypernetwork for Mapping Context to LoRA in a Single Pass☆60Mar 21, 2026Updated 3 weeks ago
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆34Oct 13, 2025Updated 6 months ago
- 基于Funasr的[实时]AI语音助手☆24Dec 18, 2025Updated 4 months ago
- 动手训练一个简单的CLIP模型,加深对CLIP的理解。☆25May 20, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Feb 16, 2023Updated 3 years ago
- ROSA-Tuning☆71Feb 4, 2026Updated 2 months ago
- NeurIPS 2023, Recaptured Raw Screen Image and Video Demoiréing via Channel and Spatial Modulations☆20Aug 17, 2024Updated last year
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated 2 years ago
- ☆27Nov 27, 2021Updated 4 years ago
- A Python-based security assessment tool for continuous automated security scanning and monitoring of domains.☆13Apr 4, 2025Updated last year
- continous batching and parallel acceleration for RWKV6☆22Jun 28, 2024Updated last year
- Runtime protection for AI agents☆111Updated this week
- llama4_trip_planning_agent☆12Apr 5, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆85Mar 28, 2026Updated 3 weeks ago
- ☆23Jun 26, 2024Updated last year
- Two-stage financial analysis workflow — executive briefing first, detailed deep dive on request☆40Feb 21, 2026Updated last month
- A collaborative hub for AI enthusiasts and experts in the UAE to contribute and refine ideas under the Coders(HQ) initiative. Fork, innov…☆30Dec 20, 2024Updated last year
- Remote Model Context Protocol (MCP) server for Linear.☆17Apr 12, 2025Updated last year
- ☆14Mar 20, 2026Updated 3 weeks ago
- ☆18May 15, 2025Updated 11 months ago
- This Module Helps to Scan a Commit History of a Repo for Leakage of Secrets☆15Apr 26, 2025Updated 11 months ago
- Repository for "GIST: Distributed training for large-scale graph convolutional networks"☆15Jan 14, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A terminal-based, cross-platform Text User Interface (TUI) for exploring and managing devices, apps, and users in both Microsoft Intune a…☆11May 1, 2025Updated 11 months ago
- Using ResNet3D to train on Kinetics form scratch or fine-tune on UCF-101(or others) with Kinetics pretrained model.☆30Aug 10, 2020Updated 5 years ago
- A short script showing how to build simple real-time video analytics apps using YOLOv8 and Supervision. Try it out, and most importantly …☆88Aug 17, 2023Updated 2 years ago
- ☆22Apr 17, 2024Updated 2 years ago
- ☆16Jun 13, 2023Updated 2 years ago
- IntuneFirewallMigration is an updated version of the originally Microsoft provided tool to capture firewall rules from a target machine a…☆22Nov 6, 2025Updated 5 months ago
- ☆41Oct 16, 2025Updated 6 months ago