JohnMachado11 / Build-a-Large-Language-Model-from-Scratch
Building a GPT-like LLM from scratch with PyTorch.
☆228Updated 4 months ago
Alternatives and similar repositories for Build-a-Large-Language-Model-from-Scratch
Users that are interested in Build-a-Large-Language-Model-from-Scratch are comparing it to the libraries listed below
Sorting:
- Hands-on tutorials on fine-tuning various LLMs using different fine-tuning techniques☆221Updated 3 weeks ago
- A category wise collection of 200+ LLM survey papers.☆140Updated last month
- Building Large Language Model Applications, Published by Packt☆322Updated 7 months ago
- ☆119Updated 7 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆60Updated last month
- ☆315Updated last month
- Unlocking Data with Generative AI and RAG, published by Packt☆92Updated 7 months ago
- Repository for the "Building LLMs for Production" book by Towards AI.☆460Updated 7 months ago
- This repository provides programs to build Retrieval Augmented Generation (RAG) code for Generative AI with LlamaIndex, Deep Lake, and Pi…☆436Updated 2 months ago
- Repository for the book GPT-Agents, published by Manning Publications☆98Updated 6 months ago
- ☆80Updated last month
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆194Updated this week
- Maximizing the Performance of a Simple RAG using RL☆57Updated last month
- A straightforward method for training your LLM, from downloading data to generating text.☆316Updated last month
- 📚 Tutorial on building a modern search app for Amazon e-commerce products leveraging tabular semantic search and natural language querie…☆65Updated 3 weeks ago
- The repository explores various RAG techniques, including implementation guides, use cases, and best practices. Each article is designed …☆46Updated this week
- Here's all my Python/Numba (CUDA) code for the encoder block I made :)☆62Updated 3 weeks ago
- Let's build a real time fraud detection system using TurboML☆69Updated 2 months ago
- Unsloth Fine-tuning Notebooks for Google Colab, Kaggle, Hugging Face and more.☆311Updated this week
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆162Updated 8 months ago
- ☆52Updated last week
- A roadmap for "generative AI" learning resources☆243Updated 7 months ago
- LLMs in Finance - Generative AI - AI Agents☆482Updated last week
- This is the official companion repository for the book The Complete LangGraph Blueprint: Build 50+ AI Agents for Business Success. The re…☆41Updated last month
- ☆89Updated last month
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆164Updated last year
- ☆35Updated 2 weeks ago
- Model Activity Visualiser☆477Updated last month
- Question paper of courses taught at IISC as part of MTech AI curriculum☆63Updated 5 months ago
- Code repository for AI Builders Bootcamp #1☆66Updated 3 months ago