JohnMachado11 / Build-a-Large-Language-Model-from-ScratchLinks
Building a GPT-like LLM from scratch with PyTorch.
☆259Updated 6 months ago
Alternatives and similar repositories for Build-a-Large-Language-Model-from-Scratch
Users that are interested in Build-a-Large-Language-Model-from-Scratch are comparing it to the libraries listed below
Sorting:
- ☆161Updated 2 weeks ago
- Introduction to PyTorch, covering tensor initialization, operations, indexing, and reshaping.☆324Updated last week
- Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"☆225Updated 3 months ago
- A straightforward method for training your LLM, from downloading data to generating text.☆393Updated last month
- A roadmap for "generative AI" learning resources☆256Updated 9 months ago
- Building Large Language Model Applications, Published by Packt☆334Updated 8 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆65Updated 3 months ago
- ☆89Updated 3 months ago
- This repository provides programs to build Retrieval Augmented Generation (RAG) code for Generative AI with LlamaIndex, Deep Lake, and Pi…☆462Updated 3 months ago
- Hands-on tutorials on fine-tuning various LLMs using different fine-tuning techniques☆239Updated 3 weeks ago
- Just enough Kubernetes for you to fly☆391Updated 3 months ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆178Updated last year
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆309Updated 3 weeks ago
- Repository for the "Building LLMs for Production" book by Towards AI.☆480Updated 9 months ago
- Generative AI with Python and PyTorch , Second Edition - Published by Packt☆73Updated 3 months ago
- ☆124Updated 9 months ago
- Question paper of courses taught at IISC as part of MTech AI curriculum☆66Updated 7 months ago
- Machine Learning Q and AI book☆569Updated 9 months ago
- repo of paper implementations☆20Updated 4 months ago
- ☆487Updated 4 months ago
- ☆349Updated 3 months ago
- building a Large Language Model (LLM) from scratch.☆31Updated 5 months ago
- A category wise collection of 200+ LLM survey papers.☆160Updated 3 months ago
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆171Updated 10 months ago
- Here's all my Python/Numba (CUDA) code for the encoder block I made :)☆65Updated 2 months ago
- ☆600Updated this week
- Contains the public resources of Hands on GenAI book☆168Updated 6 months ago
- Transformers from scratch using PyTorch & NumPy.☆35Updated 5 months ago
- ☆85Updated 2 months ago
- ☆198Updated this week