Building LLMs from scratch following the book from S. Raschka
☆34Mar 27, 2025Updated last year
Alternatives and similar repositories for LLM_from_scratch
Users that are interested in LLM_from_scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- Onnx compatible styletts2 code☆17Apr 4, 2026Updated last month
- OcSort-Pip: Packaged version of the OcSort repository☆18Jan 6, 2023Updated 3 years ago
- The GraphBench package.☆32Apr 30, 2026Updated last week
- built a 124M param GPT☆23Jan 28, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Explorations into the proposed Streaming Deep Reinforcement Learning, from University of Alberta☆29Apr 27, 2026Updated last week
- Reinforcement Learning based on Stock Trading with multiple backends.☆11Mar 2, 2024Updated 2 years ago
- A Fast, Simplified Model for Molecular Generation with Improved Physical Quality☆28Oct 1, 2025Updated 7 months ago
- 机器人人工智能,优达学城cs373作业。 Artificial Intelligence for Robotics, this repository contains all the homework…☆12Nov 12, 2017Updated 8 years ago
- This repository is the official implementation of "DG-Mamba: Robust and Efficient Dynamic Graph Structure Learning with Selective State S…☆22Apr 17, 2025Updated last year
- ☆11Jun 15, 2019Updated 6 years ago
- ☆10Mar 28, 2022Updated 4 years ago
- AdaSplash: Adaptive Sparse Flash Attention (aka Flash Entmax Attention)☆40Sep 30, 2025Updated 7 months ago
- 트랜스포머 블록을 활용한 상품명 자연어처리 기반 카테고리 분류 모델☆10Dec 5, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ROS 2 New Features [Video], published by Packt☆10Oct 28, 2022Updated 3 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆25Aug 1, 2025Updated 9 months ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- ☆40Aug 2, 2024Updated last year
- IMDB API for Python☆16Mar 10, 2024Updated 2 years ago
- ☆16Jun 26, 2023Updated 2 years ago
- This study was published in 2022 in a scientific journal with SCI-Expanded index. The tooth numbering module uses the FDI notation, which…☆13Aug 9, 2022Updated 3 years ago
- Frame-agnostic XAI Library for Computer Vision, for understanding why models behave that way.☆11Feb 19, 2023Updated 3 years ago
- This repo lets you train GNN (MeshGraphNet, transformers, etc) to simulate physics on unstructured grids like meshes.☆40Apr 23, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Converting Instance segmentation labels in COCO format to YOLOv5-seg☆13Feb 10, 2023Updated 3 years ago
- Deep generative modeling of protein structural ensembles☆37Apr 20, 2026Updated 2 weeks ago
- A Small diffusion model in PyTorch.☆16Apr 18, 2024Updated 2 years ago
- R package for tracking Covid19 cases in San Francisco☆12Apr 2, 2023Updated 3 years ago
- LoG-VMamba: Local-Global Vision Mamba for Medical Image Segmentation☆28Oct 18, 2024Updated last year
- Text-to-image generation using Huggingface stable diffusion ControlNet conditioning and AWS Translate's prompt translation function☆14Aug 25, 2023Updated 2 years ago
- LaunchPad is a light-weighted Slurm job launcher designed for hyper-parameter search.☆11Aug 2, 2024Updated last year
- Python implementation of the offboard example at https://dev.px4.io/en/ros/mavros_offboard.html☆11Mar 23, 2018Updated 8 years ago
- simplest online-softmax notebook for explain Flash Attention☆16Jan 27, 2026Updated 3 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 🌟 (FE1 FPE) Make the index look like it is shuffled according to the range so that it is not conflicted without the actual shuffle. (Sup…☆16Jun 12, 2023Updated 2 years ago
- A list of various eye- and head-tracking software, products, etc. ℹ️ This is just a push-mirror. We develop here: https://codeberg.org/ey…☆21Apr 24, 2026Updated last week
- This bunch of scripts will make your conky 'clicky' :)☆17Dec 1, 2015Updated 10 years ago
- Train I3D on NTU-RGB+D dataset in keras☆11Feb 5, 2019Updated 7 years ago
- Docutils (a.k.a. reStructuredText, reST, RST) support for django☆12Updated this week
- gcc+newlib and gcc+glibc toolchains☆17Apr 12, 2019Updated 7 years ago
- ☆27Nov 3, 2025Updated 6 months ago