IParraMartin / An-Explanation-Is-All-You-NeedLinks

The original transformer implementation from scratch. It contains informative comments on each block

☆35

Alternatives and similar repositories for An-Explanation-Is-All-You-Need

Users that are interested in An-Explanation-Is-All-You-Need are comparing it to the libraries listed below

Sorting:

merveenoyan / awesome-osml-for-devs
List of resources, libraries and more for developers who would like to build with open-source machine learning off-the-shelf
☆199Updated last year
tcapelle / llm_recipes
A set of scripts and notebooks on LLM finetunning and dataset creation
☆110Updated 9 months ago
ayulockin / neurips-llm-efficiency-challenge
Starter pack for NeurIPS LLM Efficiency Challenge 2023.
☆125Updated last year
0ssamaak0 / Karpathy-Neural-Networks-Zero-to-Hero
☆122Updated 5 months ago
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆232Updated 8 months ago
hkproj / multi-latent-attention
☆43Updated last month
Vaibhavs10 / notebooks
☆128Updated 3 months ago
rasbt / RAGs
RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems
☆123Updated 5 months ago
muellerzr / minimal-trainer-zoo
Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines
☆197Updated last year
rasbt / pycon2024
Tutorial Materials for "The Fundamentals of Modern Deep Learning with PyTorch" workshop at PyCon 2024
☆246Updated last year
Vaibhavs10 / gpu-poor-llm-notebooks
☆74Updated 9 months ago
rasbt / dora-from-scratch
LoRA and DoRA from Scratch Implementations
☆206Updated last year
rashmimarganiatgithub / LLMS_Library_2023
LLM_library is a comprehensive repository serves as a one-stop resource hands-on code, insightful summaries.
☆69Updated last year
TrelisResearch / install-guides
Various installation guides for Large Language Models
☆71Updated 2 months ago
YuvrajSingh-mist / Paper-Replications
A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorch
☆309Updated this week
kmohan321 / LLMs
☆89Updated 3 months ago
Laz4rz / GPT-2
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
☆173Updated 11 months ago
ariG23498 / gemma3-object-detection
Fine tune Gemma 3 on an object detection task
☆72Updated this week
deshwalmahesh / DataScience-StudyMaterial
Just some stuff for Interview questions, books, annotated paper, notes, cheat sheets etc etc related to ML,AI, Deep Learning and Data Sc…
☆119Updated 2 months ago
Sentdex / neural-net-internals-visualized
Visualizing some of the internals of a neural network during training and inference.
☆76Updated last year
AayushSameerShah / Neural-Net-Zero-to-Hero-with-Andrej
This repository contains the collection of explorative notebooks pure in python and in the language that we, humans can read. Have tried …
☆113Updated last year
AviSoori1x / seemore
From scratch implementation of a vision language model in pure PyTorch
☆227Updated last year
gautierdag / bpeasy
Fast bare-bones BPE for modern tokenizer training
☆159Updated 3 weeks ago
NielsRogge / tutorials
A repository containing general tutorials I'd like to share with the world.
☆45Updated 2 weeks ago
1y33 / 100Days
GPU Kernels
☆190Updated 2 months ago
0xD4rky / Vision-Transformers
This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…
☆226Updated 6 months ago
kmohan321 / Research_Papers
☆46Updated 3 months ago
broskicodes / slms
Experimenting with small language models
☆68Updated last year
center-for-humans-and-machines / transformer-heads
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆282Updated 4 months ago
AI4Bharat / IndicLLMSuite
A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languages
☆107Updated 9 months ago