kmohan321 / LLMsLinks
☆89Updated 4 months ago
Alternatives and similar repositories for LLMs
Users that are interested in LLMs are comparing it to the libraries listed below
Sorting:
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆324Updated 2 weeks ago
- Repository of implementations of classic and sota rl algorithms from scratch in PyTorch☆124Updated this week
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆31Updated this week
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆226Updated 7 months ago
- GPU Kernels☆193Updated 3 months ago
- repo of paper implementations☆20Updated 5 months ago
- Transformers from scratch using PyTorch & NumPy.☆42Updated 6 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆69Updated 4 months ago
- A category wise collection of 200+ LLM survey papers.☆171Updated 4 months ago
- building a Large Language Model (LLM) from scratch.☆33Updated 6 months ago
- Implementations of Papers that I read, you can read my breakdown in my blog☆80Updated last month
- a simple CLI command that will create a template of a generic ML Project☆82Updated 10 months ago
- Hands-on tutorials on fine-tuning various LLMs using different fine-tuning techniques☆309Updated 2 months ago
- Question paper of courses taught at IISC as part of MTech AI curriculum☆71Updated 8 months ago
- ☆46Updated 4 months ago
- ☆362Updated 4 months ago
- Assignments of courses taught at IISC as part of MTech AI curriculum☆120Updated 6 months ago
- learningggggggg 🐳☆544Updated 4 months ago
- ☆64Updated this week
- ☆238Updated last week
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆278Updated 9 months ago
- Basically a repo containing architectures/algorithms/papers from scratch in pytorch☆30Updated 2 months ago
- Fine tune Gemma 3 on an object detection task☆77Updated last month
- just me trying to implement deep learning concepts in code☆186Updated 4 months ago
- ☆43Updated 3 months ago
- ☆55Updated 3 months ago
- everything i know about cuda and triton☆13Updated 6 months ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆15Updated 4 months ago
- agent-from-scratch is a Python-based repository designed for developers and researchers interested in understanding the inner workings of…☆91Updated 8 months ago
- 100 days of building GPU kernels!☆489Updated 3 months ago