di37 / gemma3-270M-tinystories-pytorchLinks
A complete PyTorch implementation of Google's Gemma3 270M language model, featuring sliding window attention, RoPE positional encoding, and efficient training infrastructure.
☆43Updated 3 months ago
Alternatives and similar repositories for gemma3-270M-tinystories-pytorch
Users that are interested in gemma3-270M-tinystories-pytorch are comparing it to the libraries listed below
Sorting:
- Fine tune Gemma 3 on an object detection task☆91Updated 4 months ago
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆57Updated 2 months ago
- A collection of hand on notebook for LLMs practitioner☆51Updated 10 months ago
- Build Agentic workflows with function calling using open LLMs☆28Updated last week
- Fine-tune an LLM to perform batch inference and online serving.☆114Updated 6 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated 2 months ago
- Notebooks for fine tuning pali gemma☆117Updated 7 months ago
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆140Updated 10 months ago
- Tools for merging pretrained large language models.☆19Updated last year
- Join 15k builders to the Real-World ML Newsletter ⬇️⬇️⬇️☆47Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆84Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated last year
- ML/DL Math and Method notes☆64Updated 2 years ago
- Multimodal AI workloads: batch inference, model training and online serving.☆103Updated 3 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- Andrej Kapathy's micrograd implemented in c☆30Updated last year
- zero-to-lightning☆31Updated last year
- Building LLMs from scratch following the book from S. Raschka☆32Updated 8 months ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Updated last year
- ☆46Updated 8 months ago
- Material for the series of seminars on Large Language Models☆34Updated last year
- 🤖 AI Assistant fine-tuned to provide support for coding and design questions based on the latest trends in the industry.☆17Updated last year
- ☆20Updated last year
- ☆80Updated last year
- Train LLM on Hugging Face infra☆67Updated 3 weeks ago
- A curated list of materials on AI guardrails☆43Updated 6 months ago
- ☆59Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆45Updated last year
- Learn the building blocks of how to build gpt-oss from scratch☆105Updated 2 months ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆12Updated 11 months ago