therealoliver / Deepdive-llama3-from-scratchLinks
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
☆617Updated 10 months ago
Alternatives and similar repositories for Deepdive-llama3-from-scratch
Users that are interested in Deepdive-llama3-from-scratch are comparing it to the libraries listed below
Sorting:
- A reimplementation of Stable Diffusion 3.5 in pure PyTorch☆690Updated 7 months ago
- Animating R1's thoughts.☆384Updated 11 months ago
- ☆250Updated last year
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆627Updated 9 months ago
- Examples and guides for using the VLM Run API☆304Updated 2 weeks ago
- Docker-based inference engine for AMD GPUs☆231Updated last year
- Run and explore Llama models locally with minimal dependencies on CPU☆190Updated last year
- Multimodal RAG to search and interact locally with technical documents of any kind☆284Updated 2 months ago
- High-Performance Implementation of OpenAI's TikToken.☆467Updated 6 months ago
- RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks☆230Updated 6 months ago
- ☆1,456Updated 11 months ago
- ☆199Updated 8 months ago
- R.L. methods and techniques.☆199Updated this week
- LLM Analytics☆704Updated last year
- A hub for various industry-specific schemas to be used with VLMs.☆538Updated last month
- A BERT that you can train on a (gaming) laptop.☆210Updated 2 years ago
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆254Updated 2 years ago
- See Through Your Models☆400Updated 6 months ago
- Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit …☆362Updated 7 months ago
- Proof of thought : LLM-based reasoning using Z3 theorem proving with multiple backend support (SMT2 and JSON DSL)☆364Updated 2 months ago
- Fully neural approach for text chunking☆406Updated 2 months ago
- OpenCV+YOLO+LLAVA powered video surveillance system☆779Updated 2 months ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆239Updated 2 years ago
- Generate Cool-Looking Mazes and Animations Illustrating the A* Pathfinding Algorithm☆175Updated 10 months ago
- Felafax is building AI infra for non-NVIDIA GPUs☆570Updated 11 months ago
- Integrate LLM in any pipeline - fit/predict pattern, JSON driven flows, and built in concurency support.☆606Updated 10 months ago
- A character-level language diffusion model trained on Tiny Shakespeare☆830Updated this week
- ☆280Updated 7 months ago
- Your toolkit for autonomous, evolving agent ecosystems. Create, execute, govern, and evolve agents that learn from experience, collaborat…☆447Updated last month
- This repo contains a new way to use bloom filters to do lossless video compression☆250Updated 7 months ago