angry-kratos / Simple_Llama3_from_scratch
☆32Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for Simple_Llama3_from_scratch
- Collection of autoregressive model implementation☆67Updated this week
- ☆45Updated 2 months ago
- ☆118Updated 3 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆77Updated 5 months ago
- From scratch implementation of a vision language model in pure PyTorch☆162Updated 6 months ago
- End-to-End LLM Guide☆97Updated 4 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆50Updated 7 months ago
- Set of scripts to finetune LLMs☆36Updated 7 months ago
- My fork os allen AI's OLMo for educational purposes.☆28Updated this week
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆38Updated 5 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 5 months ago
- ☆40Updated 2 weeks ago
- ☆35Updated 3 weeks ago
- ☆87Updated 9 months ago
- Video+code lecture on building nanoGPT from scratch☆64Updated 5 months ago
- ☆37Updated 5 months ago
- A minimal implementation of LLaVA-style VLM with interleaved image & text & video processing ability.☆84Updated 2 months ago
- An introduction to LLM Sampling☆64Updated last week
- ☆74Updated last month
- ☆81Updated last month
- This repository contains a better implementation of Kolmogorov-Arnold networks☆59Updated 6 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- Genetics for Language Models☆12Updated 4 months ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆85Updated 2 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 2 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆53Updated 2 months ago
- ☆57Updated 11 months ago
- Prune transformer layers☆64Updated 5 months ago
- ☆49Updated 8 months ago
- Survey: A collection of AWESOME papers and resources on the latest research in Mixture of Experts.☆68Updated 3 months ago
- ☆93Updated last month