kmohan321 / LLMsLinks
☆89Updated last week
Alternatives and similar repositories for LLMs
Users that are interested in LLMs are comparing it to the libraries listed below
Sorting:
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆402Updated 2 months ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last month
- Repository of implementations of classic and sota rl algorithms from scratch in PyTorch☆221Updated last month
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆229Updated last year
- repo of paper implementations☆20Updated 11 months ago
- Basically a repo containing architectures/algorithms/papers from scratch in pytorch☆30Updated last week
- Assignments of courses taught at IISC as part of MTech AI curriculum☆140Updated 11 months ago
- Question paper of courses taught at IISC as part of MTech AI curriculum☆105Updated last year
- building a Large Language Model (LLM) from scratch.☆35Updated 11 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆77Updated 9 months ago
- GPU Kernels☆218Updated 9 months ago
- Transformers from scratch using PyTorch & NumPy.☆49Updated 11 months ago
- ☆118Updated last month
- Hands-on tutorials on fine-tuning various LLMs using different fine-tuning techniques☆358Updated 7 months ago
- ☆46Updated 8 months ago
- ☆46Updated 10 months ago
- a simple CLI command that will create a template of a generic ML Project☆81Updated last month
- learningggggggg 🐳☆573Updated 10 months ago
- A category wise collection of 200+ LLM survey papers.☆271Updated 9 months ago
- Implementations of Papers that I read, you can read my breakdown in my blog☆89Updated 3 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆276Updated last year
- just me trying to implement deep learning concepts in code☆209Updated 2 months ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆16Updated 10 months ago
- ☆412Updated 9 months ago
- Building GPT ...☆18Updated last year
- Repository for ACM India Summer School on Generative AI for Text☆13Updated last year
- agent-from-scratch is a Python-based repository designed for developers and researchers interested in understanding the inner workings of…☆96Updated last year
- ☆121Updated 3 weeks ago
- everything i know about cuda and triton☆13Updated last year
- Fine-tune an LLM to perform batch inference and online serving.☆117Updated 8 months ago