springer-llms-deep-dive / llms-deep-dive-tutorials
☆29Updated last month
Related projects ⓘ
Alternatives and complementary repositories for llms-deep-dive-tutorials
- LoRA and DoRA from Scratch Implementations☆188Updated 8 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆33Updated 2 weeks ago
- A comprehensive deep dive into the world of tokens☆214Updated 4 months ago
- Tutorial Materials for "The Fundamentals of Modern Deep Learning with PyTorch" workshop at PyCon 2024☆233Updated 5 months ago
- Solutions provided to Chip Huyen's Machine Learning Interview Book with GPT☆34Updated 10 months ago
- Direct Preference Optimization Implementation☆14Updated 9 months ago
- Finetune a pre-trained GPT-2 model to generate personalized product recommendations for users, based on product reviews and metadata☆15Updated last month
- Conference schedule, top papers, and analysis of the data for NeurIPS 2023!☆107Updated 10 months ago
- Machine Learning Q and AI book☆342Updated last month
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆83Updated 7 months ago
- ☆21Updated last week
- ☆75Updated 4 months ago
- A 4-hour coding workshop to understand how LLMs are implemented and used☆744Updated last month
- Training small GPT-2 style models using Kolmogorov-Arnold networks.☆108Updated 5 months ago
- The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".☆128Updated 2 weeks ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆82Updated last year
- Representation Learning MSc course Summer Semester 2023☆70Updated last year
- Accelerate Model Training with PyTorch 2.X, published by Packt☆29Updated 5 months ago
- From scratch implementation of a vision language model in pure PyTorch☆160Updated 6 months ago
- Building GPT ...☆17Updated 2 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆92Updated last month
- A HuggingFace compatible Small Language Model trainer.☆73Updated 3 weeks ago
- The best repository showing why transformers might not be the answer for time series forecasting and showcasing the best SOTA non transfo…☆519Updated this week
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆83Updated last month
- Fast bare-bones BPE for modern tokenizer training☆142Updated 3 weeks ago
- This repository contains the collection of explorative notebooks pure in python and in the language that we, humans can read. Have tried …☆99Updated 6 months ago
- Highly commented implementations of Transformers in PyTorch☆128Updated last year
- I will build Transformer from scratch☆50Updated 5 months ago
- End-to-End LLM Guide☆97Updated 4 months ago
- Prune transformer layers☆64Updated 5 months ago