silvaxxx1 / MyLLM101

"LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"

☆27

Alternatives and similar repositories for MyLLM101:

Users that are interested in MyLLM101 are comparing it to the libraries listed below

SwekeR-463 / Papers-Implemented
repo of paper implementations
☆19Updated 2 months ago
1y33 / 100Days
GPU Kernels
☆172Updated last week
YuvrajSingh-mist / Paper-Replications
A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch
☆189Updated last week
kmohan321 / LLMs
☆87Updated last month
hkproj / 100-days-of-gpu
☆296Updated 3 weeks ago
0xD4rky / Vision-Transformers
This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…
☆216Updated 4 months ago
kmohan321 / Research_Papers
☆45Updated last month
ThinamXx / Meta-llama
Complete implementation of Llama2 with/without KV cache & inference 🚀
☆47Updated 11 months ago
victor-explore / AI-Q-Papers-IISC-Banglore
Question paper of courses taught at IISC as part of MTech AI curriculum
☆62Updated 5 months ago
MekkCyber / TritonAcademy
A repository to unravel the language of GPUs, making their kernel conversations easy to understand
☆180Updated last week
AniruddhaChattopadhyay / Books
☆80Updated 2 weeks ago
hkproj / pytorch-transformer-distributed
Distributed training (multi-node) of a Transformer model
☆65Updated last year
a-hamdi / GPU
100 days of building GPU kernels!
☆399Updated last week
aniket-mish / cuda
everything i know about cuda and triton
☆13Updated 3 months ago
YuvrajSingh-mist / SmolLlama
So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…
☆14Updated last month
SwekeR-463 / 100kernels
100 days of learning & making kernels in cuda / triton
☆22Updated last month
toheedakhtar / llm-scratch
building a Large Language Model (LLM) from scratch.
☆31Updated 3 months ago
evintunador / triton_docs_tutorials
making the official triton tutorials actually comprehensible
☆27Updated last month
victor-explore / AI-Assignments-IISC-Banglore
Assignments of courses taught at IISC as part of MTech AI curriculum
☆93Updated 2 months ago
hkproj / triton-flash-attention
☆159Updated 4 months ago
ThinamXx / build-GPT
Building GPT ...
☆17Updated 5 months ago
cneuralnetwork / solving-ml-papers
just me trying to implement deep learning concepts in code
☆155Updated 2 weeks ago
cneuralnetwork / ML-Project-CLI
a simple CLI command that will create a template of a generic ML Project
☆79Updated 7 months ago
hesamsheikh / llm-mechanics
Coding an LLM and its building blocks from scratch.
☆34Updated last month
Cohere-Labs-Community / AI-Alignment-Cohort
☆23Updated 6 months ago
dswh / ai-research-program
An independent AI research program created by Harshit.
☆94Updated 9 months ago
Sakil786 / LLM-PlayLab
This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility and…
☆118Updated 2 weeks ago
KalyanKS-NLP / LLM-Survey-Papers-Collection
A category wise collection of 200+ LLM survey papers.
☆129Updated last month
krupadav3 / Encoder-Block-in-CUDA
Here's all my Python/Numba (CUDA) code for the encoder block I made :)
☆60Updated last week
CisMine / GPU-in-ML-DL
Apply GPU in ML and DL
☆52Updated 2 months ago