BabyGPT: Build Your Own GPT Large Language Model from Scratch Pre-Training Generative Transformer Models: Building GPT from Scratch with a Step-by-Step Guide to Generative AI in PyTorch and Python
☆119Dec 5, 2023Updated 2 years ago
Alternatives and similar repositories for BabyGPT-Build_GPT_From_Scratch
Users that are interested in BabyGPT-Build_GPT_From_Scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SAM application for creating Billing Conductor custom line items to distribute SP/RI benefits purchased outside of billing groups☆17Apr 13, 2026Updated last month
- Recent papers on Graph Neural Networks-based Recommender System.☆12Aug 21, 2023Updated 2 years ago
- Build chatbots with GPT3. Write a text file, get a chat bot.☆15Nov 19, 2022Updated 3 years ago
- Simple implementation of a GPT (training and inference) in PyTorch.☆13Dec 11, 2023Updated 2 years ago
- DevOps with Docker course by the University of Helsinki, Course material☆12Sep 9, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Welcome to the k8s-mongo-app repository! This project aims to provide a seamless integration of MongoDB with Kubernetes, enabling users t…☆12Feb 12, 2025Updated last year
- This repository is a blueprint and starter kit for building high-performance, production-ready Java backend systems that can be implement…☆15Jan 9, 2026Updated 4 months ago
- Created with Stability AIʼs Stable Video Diffusion XT 1.1 Image-to-Video latent diffusion model (SVD XT 1.1)☆17Apr 12, 2024Updated 2 years ago
- White Cats define Pure functions☆17Nov 4, 2025Updated 6 months ago
- Distribute and install Go binaries via NPM☆12Mar 27, 2024Updated 2 years ago
- This is a modern RESTful API built with Node.js and Express, designed to interact with a PostgreSQL database.☆18Jan 29, 2026Updated 4 months ago
- Sample code for several design patterns in PHP 8.x☆17Sep 9, 2025Updated 8 months ago
- ☆22Oct 24, 2025Updated 7 months ago
- Transformers指导手册中文翻译项目☆13Dec 2, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Data Structures in Python☆10Updated this week
- Researching Physics in LJPW Space☆21Updated this week
- AI-powered FAQ Bot is a backend solution built with NodeJS (Express) that integrates with OpenAI to provide AI-generated answers to user …☆16Feb 19, 2025Updated last year
- The NumFOCUS DISCOVER Cookbook (Diverse & Inclusive Spaces and Conferences: Overall Vision and Essential Resources). A guide for organizi…☆17Mar 27, 2025Updated last year
- secure header report and best practices config for Apache, Nginx, lighttpd, Cloudflare, netlify☆14Dec 27, 2018Updated 7 years ago
- Edge Detection of Biological Cells (Python Image Processing Script)☆22Updated this week
- In this project I used ML modeling and data analysis to predict ad clicks and significantly improve ad campaign performance, resulting in…☆13Nov 6, 2023Updated 2 years ago
- People ask me about data science resources so I've curated some here: this is <<20% of the size of an 'awesome' list but has 80% of the v…☆11Jan 14, 2023Updated 3 years ago
- A GraphQL-based e-commerce API built with Node.js for efficient product, order, and user management.☆19Jan 17, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- C++ Encrypted SSL/TLS REVERSE SHELL, designed to provide secure, encrypted communication between a compromised client and an attacker, wh…☆122Oct 9, 2025Updated 7 months ago
- YesMan☆19Nov 8, 2023Updated 2 years ago
- Official source code for AAAI 2025 paper: CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendatio…☆18Dec 11, 2024Updated last year
- In this project, we have to create a predictive model which allows the company to maximize the profit of the next marketing campaign☆15Oct 18, 2025Updated 7 months ago
- CIM J2EE application with resource adapter access to Spark.☆15Mar 31, 2023Updated 3 years ago
- Forecasting Netflix Customer Retention based on Gaussian Process Regression☆14Jul 22, 2023Updated 2 years ago
- ☆13Nov 29, 2024Updated last year
- Single-user Matrix.org Application Service (AS) to bridge SMSes to the Matrix network!☆12Jul 10, 2018Updated 7 years ago
- Just a better dirbuster☆13Dec 8, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A lightning-fast workflow builder, it supports multimodal interaction, highly customizable extensions, and is intuitive to use even witho…☆20Jan 22, 2026Updated 4 months ago
- ☆27Nov 22, 2024Updated last year
- Mixture of Experts from scratch☆14Apr 12, 2024Updated 2 years ago
- Virtual host bruteforcer against given network range or single ip☆11Mar 21, 2019Updated 7 years ago
- ✨ Hello! | Bonjour! | Hallo! | Ciao! | مرحبًا! | ¡Hola! | Привет! | 你好!✨ This is my README repository will appear on my public profile.☆20Jan 26, 2026Updated 4 months ago
- Front-Commerce is an agnostic frontend for ecommerce based on headless commerce & modern technologies: NodeJS, React, GraphQL. Our fronte…☆14Dec 16, 2024Updated last year
- Random apps and utilities☆16Mar 1, 2024Updated 2 years ago