atullchaurasia / transformers
Transformers from scratch using PyTorch & NumPy.
☆24Updated 3 months ago
Alternatives and similar repositories for transformers
Users that are interested in transformers are comparing it to the libraries listed below
Sorting:
- ☆89Updated last month
- ☆46Updated last month
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆14Updated last month
- a simple CLI command that will create a template of a generic ML Project☆80Updated 7 months ago
- Fine tune Gemma 3 on an object detection task☆20Updated this week
- ☆30Updated last week
- ☆80Updated 3 weeks ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆194Updated 2 weeks ago
- ☆74Updated 7 months ago
- ☆61Updated 2 months ago
- Personal project, Generative AI, Streamlit, Python☆52Updated 2 weeks ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆216Updated 4 months ago
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10Updated last month
- purpose of this repo is to Implement LLMOPs as shared in Deeplearning AI course☆22Updated this week
- Question paper of courses taught at IISC as part of MTech AI curriculum☆63Updated 5 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆31Updated 2 months ago
- Train an LLM to generate cracked Manim animations for mathematical concepts.☆15Updated 2 months ago
- AI agent with RAG+ReAct on Indian Constitution & BNS☆64Updated 6 months ago
- image captioninggg🐳☆12Updated 8 months ago
- Coding an LLM and its building blocks from scratch.☆35Updated last month
- Collection of impressive LLM apps with a focus on the financial sector☆40Updated last month
- 📚 Tutorial on building a modern search app for Amazon e-commerce products leveraging tabular semantic search and natural language querie…☆64Updated 2 weeks ago
- building a Large Language Model (LLM) from scratch.☆31Updated 3 months ago
- ☆29Updated last year
- Code examples showing how to use Gemini, Gemma, Imagen, and more.☆39Updated last month
- Find your Twin Celebrity in Vector Space☆38Updated 4 months ago
- Quick Notebook Tutorials☆32Updated 3 months ago
- Building large language foundational model☆9Updated 2 months ago
- ☆75Updated 5 months ago
- ☆19Updated 9 months ago