kmohan321 / LLMsLinks
☆89Updated 3 months ago
Alternatives and similar repositories for LLMs
Users that are interested in LLMs are comparing it to the libraries listed below
Sorting:
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆315Updated last week
- ☆79Updated last week
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆30Updated 2 weeks ago
- building a Large Language Model (LLM) from scratch.☆32Updated 5 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆228Updated 7 months ago
- repo of paper implementations☆20Updated 5 months ago
- Transformers from scratch using PyTorch & NumPy.☆42Updated 5 months ago
- A category wise collection of 200+ LLM survey papers.☆165Updated 3 months ago
- GPU Kernels☆191Updated 3 months ago
- ☆162Updated last month
- Hands-on tutorials on fine-tuning various LLMs using different fine-tuning techniques☆266Updated last month
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆68Updated 3 months ago
- Basically a repo containing architectures/algorithms/papers from scratch in pytorch☆28Updated last month
- ☆43Updated 2 months ago
- learningggggggg 🐳☆541Updated 4 months ago
- Implementations of Papers that I read, you can read my breakdown in my blog☆76Updated 2 weeks ago
- Question paper of courses taught at IISC as part of MTech AI curriculum☆68Updated 8 months ago
- ☆57Updated this week
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆274Updated 8 months ago
- ☆46Updated 4 months ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆15Updated 4 months ago
- Assignments of courses taught at IISC as part of MTech AI curriculum☆120Updated 5 months ago
- ☆355Updated 3 months ago
- a simple CLI command that will create a template of a generic ML Project☆81Updated 9 months ago
- just me trying to implement deep learning concepts in code☆178Updated 3 months ago
- 100 days of building GPU kernels!☆470Updated 3 months ago
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model☆147Updated last year
- ☆54Updated 2 months ago
- Resources needed to start deep learning research. ML/DL/CV/NLP/ML-SYS/RL/Graphs/Maths/Med image lecture videos from professors at esteeme…☆84Updated last year
- Fine tune Gemma 3 on an object detection task☆73Updated 2 weeks ago