computerhistory / AlexNet-Source-CodeLinks
This package contains the original 2012 AlexNet code.
☆2,736Updated 7 months ago
Alternatives and similar repositories for AlexNet-Source-Code
Users that are interested in AlexNet-Source-Code are comparing it to the libraries listed below
Sorting:
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,136Updated this week
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆5,619Updated 2 weeks ago
- s1: Simple test-time scaling☆6,567Updated 3 months ago
- Code for BLT research paper☆1,995Updated 4 months ago
- ☆1,195Updated 3 months ago
- ☆5,429Updated 8 months ago
- This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov☆1,946Updated 4 months ago
- Sky-T1: Train your own O1 preview model within $450☆3,341Updated 3 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,609Updated 6 months ago
- NanoGPT (124M) in 3 minutes☆3,176Updated 3 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,856Updated last month
- Everything about the SmolLM and SmolVLM family of models☆3,314Updated last month
- PyTorch code and models for VJEPA2 self-supervised learning from video.☆2,295Updated last month
- Minimal reproduction of DeepSeek R1-Zero☆12,258Updated 5 months ago
- Muon is an optimizer for hidden layers in neural networks☆1,888Updated 3 months ago
- ☆1,536Updated last week
- Official PyTorch implementation for "Large Language Diffusion Models"☆3,030Updated this week
- Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning☆3,460Updated 3 months ago
- Building DeepSeek R1 from Scratch☆708Updated 6 months ago
- Large Concept Models: Language modeling in a sentence representation space☆2,291Updated 8 months ago
- Textbook on reinforcement learning from human feedback☆1,259Updated 2 weeks ago
- llama3 implementation one matrix multiplication at a time☆15,172Updated last year
- Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and pe…☆3,728Updated 4 months ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,379Updated 5 months ago
- Interactive Pytorch forward pass visualization in notebooks☆594Updated 2 weeks ago
- Code release for DynamicTanh (DyT)☆1,019Updated 6 months ago
- Awesome Reasoning LLM Tutorial/Survey/Guide☆2,100Updated 3 months ago
- The best ChatGPT that $100 can buy.☆19,081Updated this week
- DataComp for Language Models☆1,375Updated last month
- Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw☆562Updated 10 months ago