Building LLMs from scratch following the book from S. Raschka
☆33Mar 27, 2025Updated 11 months ago
Alternatives and similar repositories for LLM_from_scratch
Users that are interested in LLM_from_scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated 10 months ago
- ☆14Mar 9, 2023Updated 3 years ago
- LLM as World Models using Bayesian inference☆17May 27, 2025Updated 9 months ago
- OcSort-Pip: Packaged version of the OcSort repository☆17Jan 6, 2023Updated 3 years ago
- This is now the official location of the Kaldi project.☆10Aug 22, 2019Updated 6 years ago
- ☆21Apr 6, 2025Updated 11 months ago
- built a 124M param GPT☆23Jan 28, 2025Updated last year
- ☆14Jan 17, 2023Updated 3 years ago
- Repository for the companion Colab notebook of the Domain-Specific Small Language Models book.☆29Sep 9, 2025Updated 6 months ago
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆67Updated this week
- This repository is the official implementation of "DG-Mamba: Robust and Efficient Dynamic Graph Structure Learning with Selective State S…☆22Apr 17, 2025Updated 11 months ago
- ☆11Mar 18, 2024Updated 2 years ago
- An up-to-date & curated list of awesome layout to image papers, methods & resources.☆13Jun 28, 2024Updated last year
- ☆15Feb 18, 2023Updated 3 years ago
- Reproduction of DeepSeek-R1☆241Apr 14, 2025Updated 11 months ago
- ☆23Jun 2, 2016Updated 9 years ago
- ☆10Mar 28, 2022Updated 3 years ago
- AdaSplash: Adaptive Sparse Flash Attention (aka Flash Entmax Attention)☆35Sep 30, 2025Updated 5 months ago
- A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.☆113Jun 4, 2025Updated 9 months ago
- Stable Diffusion in TensorRT 8.5+☆14Mar 19, 2023Updated 3 years ago
- 트랜스포머 블록을 활용한 상품명 자연어처리 기반 카테고리 분류 모델☆10Dec 5, 2022Updated 3 years ago
- PhonePi MCP enables seamless integration between desktop AI tools and your smartphone, providing 23+ direct actions including SMS messagi…☆34Apr 15, 2025Updated 11 months ago
- ROS 2 New Features [Video], published by Packt☆10Oct 28, 2022Updated 3 years ago
- Tensorflow implementation of pix2pix for creating music from a voice. Vocals2Song.☆17Sep 26, 2022Updated 3 years ago
- ☆16Jun 27, 2025Updated 8 months ago
- audio cfeatures extraction tool from wav to h5features format☆19May 24, 2019Updated 6 years ago
- Benchmark scripts for comparing tutorials in PyTorch and JAX☆14Aug 25, 2022Updated 3 years ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated last year
- The real GPT-4 with image access (You probably don't have access)☆12Mar 17, 2023Updated 3 years ago
- KopikatAPI is Python library for interacting with the Kopikat API.☆17Mar 16, 2026Updated last week
- ☆16Jun 26, 2023Updated 2 years ago
- LossHub: Loss Functions Library for Image Classification and Detection☆14Oct 9, 2022Updated 3 years ago
- This study was published in 2022 in a scientific journal with SCI-Expanded index. The tooth numbering module uses the FDI notation, which…☆13Aug 9, 2022Updated 3 years ago
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆29Feb 4, 2025Updated last year
- This repo lets you train GNN (MeshGraphNet, transformers, etc) to simulate physics on unstructured grids like meshes.☆37Mar 16, 2026Updated last week
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Sep 17, 2018Updated 7 years ago
- Bu Course LLM(Large Language Model) Fine Tune işlemlerini Türkçe klavuz olarak☆11Mar 29, 2025Updated 11 months ago
- Frame-agnostic XAI Library for Computer Vision, for understanding why models behave that way.☆11Feb 19, 2023Updated 3 years ago
- Converting Instance segmentation labels in COCO format to YOLOv5-seg☆13Feb 10, 2023Updated 3 years ago