Building LLMs from scratch following the book from S. Raschka
☆34Mar 27, 2025Updated last year
Alternatives and similar repositories for LLM_from_scratch
Users that are interested in LLM_from_scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- A python package made to generate sequences (greedy and beam-search) from Pytorch (not necessarily HF transformers) models.☆18Dec 12, 2025Updated 6 months ago
- Onnx compatible styletts2 code☆16Apr 4, 2026Updated 2 months ago
- LLM as World Models using Bayesian inference☆20May 27, 2025Updated last year
- OcSort-Pip: Packaged version of the OcSort repository☆18Jan 6, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Stats 479 Project☆22May 9, 2019Updated 7 years ago
- Common CNN models defined for PyTorch Lightning☆10Jul 28, 2022Updated 3 years ago
- 8+ agents work together to build a game in pygame☆17Jul 27, 2024Updated last year
- Run Nx2 Cross Validation for multiple binary classifiers in parallel with optional downsampling☆13Jan 27, 2015Updated 11 years ago
- The GraphBench package.☆33May 28, 2026Updated 2 weeks ago
- built a 124M param GPT☆23Jan 28, 2025Updated last year
- Reinforcement Learning based on Stock Trading with multiple backends.☆11Mar 2, 2024Updated 2 years ago
- A Fast, Simplified Model for Molecular Generation with Improved Physical Quality☆28Oct 1, 2025Updated 8 months ago
- 机器人人工智能,优达学城cs373作业。 Artificial Intelligence for Robotics, this repository contains all the homework…☆12Nov 12, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆73May 29, 2026Updated 2 weeks ago
- Repository for the companion Colab notebook of the Domain-Specific Small Language Models book.☆54Jun 7, 2026Updated last week
- Learn Model Context Protocol with Python, published by Packt☆44Feb 20, 2026Updated 3 months ago
- An up-to-date & curated list of awesome layout to image papers, methods & resources.☆13Jun 28, 2024Updated last year
- ☆15Feb 18, 2023Updated 3 years ago
- This code is for the paper "e-TransUNet: TransUNet provides a strong spatial transformation for precise deforestation mapping" that is pu…☆15May 28, 2024Updated 2 years ago
- ☆10Mar 28, 2022Updated 4 years ago
- GeoVLM-R1: Reinforcement Fine-Tuning for Improved Remote Sensing Reasoning☆30Mar 27, 2026Updated 2 months ago
- Stable Diffusion in TensorRT 8.5+☆15Mar 19, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 트랜스포머 블록을 활용한 상품명 자연어처리 기반 카테고리 분류 모델☆10Dec 5, 2022Updated 3 years ago
- A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.☆118Jun 4, 2025Updated last year
- 🤗 MLOps with Hugging Face Spaces and Dagger☆47Jun 6, 2023Updated 3 years ago
- NASA SEES (2021): CNN Mosquito Detection Research☆11Mar 27, 2022Updated 4 years ago
- AdaSplash: Adaptive Sparse Flash Attention (aka Flash Entmax Attention)☆45May 20, 2026Updated 3 weeks ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆38Aug 27, 2025Updated 9 months ago
- ☆10May 19, 2022Updated 4 years ago
- ☆15Jun 27, 2025Updated 11 months ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆25Aug 1, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Pytorch implementation of "Large Language Models are Strong Audio-Visual Speech Recognition Learners" [ICASSP 2025] and "Mitigat…☆62Jan 18, 2026Updated 4 months ago
- Master programming by recreating your favorite technologies from scratch.☆46Jul 5, 2025Updated 11 months ago
- Benchmark scripts for comparing tutorials in PyTorch and JAX☆14Aug 25, 2022Updated 3 years ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated 2 years ago
- ☆40Aug 2, 2024Updated last year
- Teaching AI to play the classic text adventure Zork using Large Language Models☆37Apr 5, 2026Updated 2 months ago
- ☆79Nov 30, 2025Updated 6 months ago