FareedKhan-dev / train-llm-from-scratchLinks
A straightforward method for training your LLM, from downloading data to generating text.
☆426Updated last month
Alternatives and similar repositories for train-llm-from-scratch
Users that are interested in train-llm-from-scratch are comparing it to the libraries listed below
Sorting:
- Building a GPT-like LLM from scratch with PyTorch.☆288Updated 8 months ago
- ☆643Updated last week
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆182Updated last year
- Building DeepSeek R1 from Scratch☆698Updated 5 months ago
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆185Updated last year
- Maximizing the Performance of a Simple RAG using RL☆79Updated 5 months ago
- Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 ,A…☆388Updated last year
- Explore a comprehensive collection of resources, tutorials, papers, tools, and best practices for fine-tuning Large Language Models (LLMs…☆468Updated 9 months ago
- A Deep Research agent from scratch☆207Updated 4 months ago
- Model Activity Visualiser☆520Updated 5 months ago
- Implement a reasoning LLM in PyTorch from scratch, step by step☆1,293Updated this week
- 😎 Awesome list of Retrieval-Augmented Generation (RAG) applications in Generative AI.☆637Updated 2 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆71Updated 5 months ago
- Make any LLM to think like OpenAI o1 and deepseek R1☆490Updated 7 months ago
- Educational implementation of a small GPT model from scratch in a single Jupyter Notebook☆108Updated 7 months ago
- A fully custom chatbot built with Agentic RAG (Retrieval-Augmented Generation), combining Gemini models with a local knowledge base for a…☆155Updated 7 months ago
- Converting Unstructured Data to a Knowledge Graph: An End-to-End Pipeline☆243Updated 5 months ago
- Build datasets using natural language☆528Updated 4 months ago
- Hands-on tutorials on fine-tuning various LLMs using different fine-tuning techniques☆323Updated 3 months ago
- A step by step implementation of a complex RAG pipeline to solve real world situations☆280Updated 2 months ago
- Ollama's Interactive Prompt Engineering Tutorial☆255Updated 9 months ago
- A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.☆1,052Updated last month
- Turn topics into essays in seconds!☆187Updated 2 months ago
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆392Updated 2 months ago
- A category wise collection of 200+ LLM survey papers.☆178Updated 5 months ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆480Updated last month
- CPU inference for the DeepSeek family of large language models in C++☆313Updated 3 months ago
- Building LLaMA 4 MoE from Scratch☆64Updated 5 months ago
- ☆536Updated 6 months ago
- Train a Language Model with GRPO to create a schedule from a list of events and priorities☆231Updated 4 months ago