silvaxxx1 / MyLLMLinks
"LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"
☆30Updated this week
Alternatives and similar repositories for MyLLM
Users that are interested in MyLLM are comparing it to the libraries listed below
Sorting:
- repo of paper implementations☆20Updated 3 months ago
- GPU Kernels☆181Updated last month
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆264Updated this week
- ☆89Updated 2 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆184Updated 3 weeks ago
- ☆38Updated 3 weeks ago
- ☆341Updated 2 months ago
- ☆46Updated 2 months ago
- coding CUDA everyday!☆33Updated 2 months ago
- PyTorch implementations of algorithms from "Reinforcement Learning: An Introduction by Sutton and Barto", along with various RL research …☆145Updated this week
- everything i know about cuda and triton☆13Updated 4 months ago
- learning & making kernels in cuda / triton☆22Updated last week
- ☆162Updated last week
- 100 days of building GPU kernels!☆442Updated last month
- Fine-tune an LLM to perform batch inference and online serving.☆112Updated 3 weeks ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆63Updated 2 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆46Updated last year
- Fine tune Gemma 3 on an object detection task☆55Updated this week
- Question paper of courses taught at IISC as part of MTech AI curriculum☆66Updated 6 months ago
- making the official triton tutorials actually comprehensible☆37Updated 3 months ago
- a simple CLI command that will create a template of a generic ML Project☆81Updated 8 months ago
- building a Large Language Model (LLM) from scratch.☆31Updated 4 months ago
- ☆54Updated last month
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆218Updated 5 months ago
- Building GPT ...☆17Updated 6 months ago
- ☆41Updated last month
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆15Updated 2 months ago
- ☆172Updated 5 months ago
- Distributed training (multi-node) of a Transformer model☆71Updated last year
- Assignments of courses taught at IISC as part of MTech AI curriculum☆117Updated 4 months ago