kmohan321 / LLMsLinks
☆89Updated 9 months ago
Alternatives and similar repositories for LLMs
Users that are interested in LLMs are comparing it to the libraries listed below
Sorting:
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆399Updated 2 months ago
- Repository of implementations of classic and sota rl algorithms from scratch in PyTorch☆216Updated last week
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated 2 weeks ago
- Transformers from scratch using PyTorch & NumPy.☆48Updated 11 months ago
- building a Large Language Model (LLM) from scratch.☆35Updated 11 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆229Updated last year
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆75Updated 9 months ago
- repo of paper implementations☆20Updated 10 months ago
- GPU Kernels☆218Updated 8 months ago
- ☆116Updated last month
- Implementations of Papers that I read, you can read my breakdown in my blog☆89Updated 2 months ago
- ☆59Updated 3 months ago
- A category wise collection of 200+ LLM survey papers.☆264Updated 9 months ago
- Basically a repo containing architectures/algorithms/papers from scratch in pytorch☆30Updated 2 months ago
- Assignments of courses taught at IISC as part of MTech AI curriculum☆140Updated 10 months ago
- a simple CLI command that will create a template of a generic ML Project☆81Updated last month
- Question paper of courses taught at IISC as part of MTech AI curriculum☆105Updated last year
- ☆409Updated 9 months ago
- ☆46Updated 9 months ago
- ☆45Updated 7 months ago
- Just enough Kubernetes for you to fly☆575Updated 9 months ago
- Hands-on tutorials on fine-tuning various LLMs using different fine-tuning techniques☆356Updated 6 months ago
- learningggggggg 🐳☆573Updated 9 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆276Updated last year
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆16Updated 9 months ago
- Fine tune Gemma 3 on an object detection task☆95Updated 6 months ago
- 100 days of building GPU kernels!☆561Updated 8 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆195Updated 7 months ago
- Fine-tune an LLM to perform batch inference and online serving.☆115Updated 7 months ago
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆141Updated 11 months ago