A straightforward method for training your LLM, from downloading data to generating text.
☆549Aug 3, 2025Updated 8 months ago
Alternatives and similar repositories for train-llm-from-scratch
Users that are interested in train-llm-from-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Understanding Large Language Transformer Architecture like a child☆29Apr 3, 2024Updated 2 years ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆204May 12, 2024Updated last year
- a feedforward neural network from scratch☆10Aug 5, 2024Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆203Aug 23, 2024Updated last year
- An automated Python tool that uses LLMs and internet to automatically fix your code until it runs perfectly.☆31Jan 22, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Maximizing the Performance of a Simple RAG using RL☆90Mar 20, 2025Updated last year
- Implementation of 12 AI agents evaluation techniques☆39Jul 31, 2025Updated 8 months ago
- Encountering 14 different Naive RAG fails and using KG to solve it☆24Dec 4, 2025Updated 4 months ago
- A zero-dependency ML framework in C with a modern Python API for full control over execution and memory.☆683Apr 10, 2026Updated last week
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 8 months ago
- Implemented a stable diffusion architecture using PyTorch.☆84Jan 3, 2024Updated 2 years ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆77Apr 4, 2025Updated last year
- DSA and Visualizations for various sorting algorithms☆15Apr 9, 2025Updated last year
- A Step-by-Step Implementation of Google Veo 3 Architecture from Scratch☆83Jun 16, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15Nov 4, 2024Updated last year
- ☆11Jan 12, 2017Updated 9 years ago
- 100 % FREE, Private (No Internet) DeepSeek’s Advanced RAG: Boost Your RAG Chatbot: Hybrid Retrieval (BM25 + FAISS) + Neural Reranking + H…☆1,720Sep 1, 2025Updated 7 months ago
- An implemention of GraphRAG using open source small LLMs☆14Nov 9, 2024Updated last year
- AI memory system combining vector search with temporal knowledge graph. Built-in cognitive engine for agents. Supports memory decay, cont…☆70Updated this week
- Parameter-Efficient Fine-Tuning for Foundation Models☆113Mar 31, 2025Updated last year
- ☆11Aug 3, 2024Updated last year
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆21Jan 24, 2025Updated last year
- Converting Unstructured Data to a Knowledge Graph: An End-to-End Pipeline☆291Apr 14, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Awesome AI Benchmarks☆29Jan 16, 2026Updated 3 months ago
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆90,803Apr 11, 2026Updated last week
- An end-to-end pipeline to optimize and host LLM for 100K parallel queries☆36Jul 6, 2025Updated 9 months ago
- ☆21Apr 6, 2024Updated 2 years ago
- world's stupidest moe llm in 103M parameters☆20Jul 18, 2025Updated 9 months ago
- Handling Big Data with Knowledge Graph: A Detailed Guide☆30May 11, 2025Updated 11 months ago
- An example showcasing how to create an agent with persistent long-term memory using Atomic Agents☆26Dec 15, 2024Updated last year
- An MCP server that provides persistent memory capabilities through a local knowledge graph, enabling AI assistants to maintain context ac…☆21Dec 20, 2025Updated 4 months ago
- Run GEPA on your favorite non-python libraries.☆34Jan 22, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Web Crawler built using asynchronous Python and distributed task management that extracts and saves web data for analysis.☆33Nov 6, 2025Updated 5 months ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Jan 5, 2026Updated 3 months ago
- Give your local LLM a real memory with a lightweight, fully local memory system. 100% offline and under your control.☆71Sep 16, 2025Updated 7 months ago
- ☆24Feb 2, 2026Updated 2 months ago
- ☆240Mar 9, 2025Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Jan 4, 2024Updated 2 years ago
- Streaming Retrieval-Augmented Generation (RAG) agent in Go. It consumes real-time data from Kafka topics, processes it in configurable wi…☆25Jun 7, 2025Updated 10 months ago