atullchaurasia / transformersLinks
Transformers from scratch using PyTorch & NumPy.
☆24Updated 3 months ago
Alternatives and similar repositories for transformers
Users that are interested in transformers are comparing it to the libraries listed below
Sorting:
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆14Updated 2 months ago
- ☆46Updated 2 months ago
- ☆89Updated 2 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆196Updated last month
- a simple CLI command that will create a template of a generic ML Project☆80Updated 7 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆218Updated 5 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆60Updated 2 months ago
- Fine tune Gemma 3 on an object detection task☆43Updated this week
- ☆160Updated 2 weeks ago
- ☆39Updated last month
- AI agent with RAG+ReAct on Indian Constitution & BNS☆65Updated 7 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆268Updated 6 months ago
- Collection of impressive LLM apps with a focus on the financial sector☆41Updated 2 months ago
- rl from zero pretrain, can it be done? we'll see.☆24Updated this week
- Personal project, Generative AI, Streamlit, Python☆52Updated last month
- ☆75Updated 5 months ago
- ☆74Updated 8 months ago
- Question paper of courses taught at IISC as part of MTech AI curriculum☆65Updated 6 months ago
- 📚 Tutorial on building a modern search app for Amazon e-commerce products leveraging tabular semantic search and natural language querie…☆68Updated last month
- Assignments of courses taught at IISC as part of MTech AI curriculum☆116Updated 3 months ago
- Code examples showing how to use Gemini, Gemma, Imagen, and more.☆40Updated 2 months ago
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10Updated last month
- ☆38Updated 3 months ago
- Agentic RAG to help you build a startup🚀☆43Updated 2 months ago
- Join 15k builders to the Real-World ML Newsletter ⬇️⬇️⬇️☆48Updated last year
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆32Updated 2 weeks ago
- ☆83Updated 3 weeks ago
- Train an LLM to generate cracked Manim animations for mathematical concepts.☆16Updated 2 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 7 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆76Updated last month