toheedakhtar / llm-scratchLinks
building a Large Language Model (LLM) from scratch.
☆31Updated 3 months ago
Alternatives and similar repositories for llm-scratch
Users that are interested in llm-scratch are comparing it to the libraries listed below
Sorting:
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility and…☆122Updated last month
- ☆89Updated last month
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch☆196Updated last month
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆29Updated last month
- A category wise collection of 200+ LLM survey papers.☆147Updated last month
- Fine tune Gemma 3 on an object detection task☆43Updated this week
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆60Updated last month
- ☆15Updated 9 months ago
- Transformers from scratch using PyTorch & NumPy.☆24Updated 3 months ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆14Updated 2 months ago
- An independent AI research program created by Harshit.☆94Updated 10 months ago
- repo of paper implementations☆19Updated 3 months ago
- Just like the beloved character Doraemon who pulls out gadgets from his pocket, this agent can dynamically create, save, and utilize its …☆16Updated 4 months ago
- agent-from-scratch is a Python-based repository designed for developers and researchers interested in understanding the inner workings of…☆84Updated 5 months ago
- This repository will contain the presentation and python jupyter notebooks for the DataHack Summit 2024 conference talk, Improving Real-w…☆113Updated 8 months ago
- ☆43Updated 7 months ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆202Updated last week
- GenAI Experimentation☆57Updated last month
- An example showing how A2A and MCP can be used together☆158Updated 2 weeks ago
- Various installation guides for Large Language Models☆68Updated last month
- A Hands on series on developing LLM applications☆64Updated 8 months ago
- ☆48Updated last month
- ☆53Updated 3 weeks ago
- everything i know about cuda and triton☆13Updated 4 months ago
- ☆28Updated last year
- A repository containing general tutorials I'd like to share with the world.☆45Updated last month
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆46Updated last year
- Fine-tune an LLM to perform batch inference and online serving.☆111Updated 3 weeks ago
- Just enough Kubernetes for you to fly☆287Updated 2 months ago
- Gen AI Large Language Model Projects☆61Updated last year