silvaxxx1 / MyLLM101
"LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"
☆27Updated 2 weeks ago
Alternatives and similar repositories for MyLLM101:
Users that are interested in MyLLM101 are comparing it to the libraries listed below
- repo of paper implementations☆18Updated last month
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆158Updated last week
- GPU Kernels☆160Updated this week
- ☆232Updated this week
- ☆84Updated last week
- a simple CLI command that will create a template of a generic ML Project☆77Updated 6 months ago
- ☆77Updated last week
- ☆45Updated 2 weeks ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆214Updated 3 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆173Updated this week
- Question paper of courses taught at IISC as part of MTech AI curriculum☆61Updated 4 months ago
- everything i know about cuda and triton☆13Updated 2 months ago
- Assignments of courses taught at IISC as part of MTech AI curriculum☆91Updated 2 months ago
- A category wise collection of 200+ LLM survey papers.☆91Updated last week
- 100 days of building GPU kernels!☆336Updated this week
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆332Updated last month
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 10 months ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆14Updated 3 weeks ago
- Fine-tune an LLM to perform batch inference and online serving.☆107Updated this week
- ☆153Updated 3 months ago
- Building GPT ...☆17Updated 4 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆251Updated 4 months ago
- Notebooks for fine tuning pali gemma☆99Updated 3 months ago
- 100 days of learning & making kernels in cuda / triton☆21Updated 3 weeks ago
- When Philosophy meets AI Agents☆128Updated this week
- Reference implementation of Mistral AI 7B v0.1 model.☆28Updated last year
- Transformers from scratch using PyTorch & NumPy.☆22Updated 2 months ago
- Coding an LLM and its building blocks from scratch.☆33Updated 2 weeks ago
- Distributed training (multi-node) of a Transformer model☆64Updated last year
- ☆22Updated 6 months ago