silvaxxx1 / MyLLM101
"LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"
☆27Updated last week
Alternatives and similar repositories for MyLLM101:
Users that are interested in MyLLM101 are comparing it to the libraries listed below
- repo of paper implementations☆19Updated 2 months ago
- GPU Kernels☆172Updated last week
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆189Updated last week
- ☆87Updated last month
- ☆296Updated 3 weeks ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆216Updated 4 months ago
- ☆45Updated last month
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 11 months ago
- Question paper of courses taught at IISC as part of MTech AI curriculum☆62Updated 5 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆180Updated last week
- ☆80Updated 2 weeks ago
- Distributed training (multi-node) of a Transformer model☆65Updated last year
- 100 days of building GPU kernels!☆399Updated last week
- everything i know about cuda and triton☆13Updated 3 months ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆14Updated last month
- 100 days of learning & making kernels in cuda / triton☆22Updated last month
- building a Large Language Model (LLM) from scratch.☆31Updated 3 months ago
- making the official triton tutorials actually comprehensible☆27Updated last month
- Assignments of courses taught at IISC as part of MTech AI curriculum☆93Updated 2 months ago
- ☆159Updated 4 months ago
- Building GPT ...☆17Updated 5 months ago
- just me trying to implement deep learning concepts in code☆155Updated 2 weeks ago
- a simple CLI command that will create a template of a generic ML Project☆79Updated 7 months ago
- Coding an LLM and its building blocks from scratch.☆34Updated last month
- ☆23Updated 6 months ago
- An independent AI research program created by Harshit.☆94Updated 9 months ago
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility and…☆118Updated 2 weeks ago
- A category wise collection of 200+ LLM survey papers.☆129Updated last month
- Here's all my Python/Numba (CUDA) code for the encoder block I made :)☆60Updated last week
- Apply GPU in ML and DL☆52Updated 2 months ago