di37 / gemma3-270M-tinystories-pytorchView external linksLinks
A complete PyTorch implementation of Google's Gemma3 270M language model, featuring sliding window attention, RoPE positional encoding, and efficient training infrastructure.
☆45Sep 7, 2025Updated 5 months ago
Alternatives and similar repositories for gemma3-270M-tinystories-pytorch
Users that are interested in gemma3-270M-tinystories-pytorch are comparing it to the libraries listed below
Sorting:
- ☆17Aug 19, 2025Updated 5 months ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Feb 16, 2023Updated 3 years ago
- An introduction to global assessment techniques using Python☆12Apr 24, 2023Updated 2 years ago
- Code repository corresponding to the paper "Prompt Tuned Embedding Classification for Multi-Label Industry Sector Allocation" (NAACL 2024…☆10May 31, 2024Updated last year
- ☆15Oct 24, 2023Updated 2 years ago
- HyFormer: Hybrid Transformer and CNN For Pixel-level Multispectral Image Classification☆15Feb 15, 2023Updated 3 years ago
- Machine learning Regression problem with easy understandable solutions☆36Jul 23, 2018Updated 7 years ago
- TensorRT In Docker☆11Dec 7, 2024Updated last year
- ☆10Oct 11, 2021Updated 4 years ago
- [CVPR 2025] Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding☆15Jun 16, 2025Updated 8 months ago
- Whiteboard animation generator☆44Feb 6, 2026Updated last week
- ☆19Sep 19, 2025Updated 4 months ago
- prosEO – A Processing System for Earth Observation Data☆19Updated this week
- Observe the dataset of images and targets in few shots☆11Sep 27, 2022Updated 3 years ago
- brewpkg☆17Sep 30, 2025Updated 4 months ago
- Full End-to-End examples showing how to use First-gen Gaudi and Gaudi2 in common use cases☆13Dec 2, 2024Updated last year
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- [WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…☆12Feb 24, 2023Updated 2 years ago
- [ICPR-2024] S-MultiMAE - A Multi-Ground Truth approach for RGB-D Saliency Detection☆12Dec 13, 2024Updated last year
- openbharatocr is an opensource python library which facilitates extracting data from official indian government documents☆14Sep 4, 2025Updated 5 months ago
- Quantization of LLMs and benchmarking.☆10Apr 3, 2024Updated last year
- Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"☆10May 15, 2024Updated last year
- Sample codes for Pluto-V3R drone.☆11Mar 13, 2018Updated 7 years ago
- Contains Colab Notebooks show cool use-cases of different GCP ML APIs.☆10Nov 5, 2020Updated 5 years ago
- [WACV2025] source code of StrDA: https://arxiv.org/abs/2410.09913☆12Apr 15, 2025Updated 10 months ago
- A streamlit web app visualizing global surface water datasets.☆13Aug 11, 2022Updated 3 years ago
- Auto-differentiation library for javascript☆12Mar 4, 2021Updated 4 years ago
- NLP Workshops☆11Apr 24, 2025Updated 9 months ago
- ☆11Nov 18, 2025Updated 2 months ago
- Google Sheets to SQLite CLI tool.☆12Aug 15, 2023Updated 2 years ago
- ☆17May 15, 2025Updated 9 months ago
- ☆11Nov 15, 2020Updated 5 years ago
- ☆12Dec 15, 2022Updated 3 years ago
- Benchmarks for AutoAlbument - AutoML for Image Augmentation☆10Nov 5, 2023Updated 2 years ago
- A curated list of resources on Document Layout Analysis☆11Aug 7, 2025Updated 6 months ago
- Official implementation for AAAI 2025 paper: SSAN: A Symbol Spatial-Aware Network for Handwritten Mathematical Expression Recognition☆15Jan 21, 2025Updated last year
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Mar 19, 2025Updated 10 months ago
- llama4_trip_planning_agent☆12Apr 5, 2025Updated 10 months ago
- Earth observations, especially satellite data, have produced a wealth of methods and results in meeting global challenges, often presente…☆12Sep 22, 2022Updated 3 years ago