di37 / gemma3-270M-tinystories-pytorchLinks
A complete PyTorch implementation of Google's Gemma3 270M language model, featuring sliding window attention, RoPE positional encoding, and efficient training infrastructure.
☆29Updated this week
Alternatives and similar repositories for gemma3-270M-tinystories-pytorch
Users that are interested in gemma3-270M-tinystories-pytorch are comparing it to the libraries listed below
Sorting:
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆44Updated last week
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 3 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆11Updated last year
- Multimodal AI workloads: batch inference, model training and online serving.☆59Updated 3 weeks ago
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated last year
- ☆20Updated last year
- This repository holds files and scripts for incorporating simple CI/CD practices for model training in ML.☆21Updated 3 years ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated 2 months ago
- Framework for building and maintaining self-updating prompts for LLMs☆64Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 4 months ago
- ☆80Updated last year
- Building LLMs from scratch following the book from S. Raschka☆31Updated 5 months ago
- Image Search Engine with HuggingFace Sentence Transformer☆12Updated 2 years ago
- Tools for merging pretrained large language models.☆19Updated last year
- A collection of hand on notebook for LLMs practitioner☆50Updated 7 months ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆11Updated 8 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- Build Agentic workflows with function calling using open LLMs☆28Updated last week
- Chunk your text using gpt4o-mini more accurately☆44Updated last year
- Fine tune Gemma 3 on an object detection task☆82Updated last month
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆29Updated 8 months ago
- 🤖 AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.☆17Updated last year
- ML/DL Math and Method notes☆63Updated last year
- This repository shows various ways of deploying a vision model (TensorFlow) from 🤗 Transformers.☆30Updated 3 years ago
- zero-to-lightning☆31Updated last year
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Updated last year
- 🤝 Trade any tensors over the network☆30Updated last year
- Material for the series of seminars on Large Language Models☆34Updated last year
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆127Updated 7 months ago
- MLFlow End to End Workshop at Chandigarh University☆11Updated 2 years ago