This package contains the original 2012 AlexNet code.
☆2,846Mar 12, 2025Updated last year
Alternatives and similar repositories for AlexNet-Source-Code
Users that are interested in AlexNet-Source-Code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM training in simple, raw C/CUDA☆29,216Jun 26, 2025Updated 8 months ago
- FlashMLA: Efficient Multi-head Latent Attention Kernels☆12,521Feb 6, 2026Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆74,135Updated this week
- Fully open reproduction of DeepSeek-R1☆25,953Nov 24, 2025Updated 4 months ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆6,268Feb 27, 2026Updated 3 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Fast and memory-efficient exact attention☆22,938Updated this week
- ☆102,364Aug 28, 2025Updated 6 months ago
- LLM inference in C/C++☆98,911Updated this week
- ☆91,959Jun 27, 2025Updated 8 months ago
- Minimal reproduction of DeepSeek R1-Zero☆12,963Feb 27, 2026Updated 3 weeks ago
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆26,969Jan 9, 2026Updated 2 months ago
- A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.☆2,934Jan 14, 2026Updated 2 months ago
- Development repository for the Triton language and compiler☆18,708Updated this week
- A high-performance distributed file system designed to address the challenges of AI training and inference workloads.☆9,770Mar 9, 2026Updated 2 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Inference code for Llama models☆59,250Jan 26, 2025Updated last year
- Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation☆7,972May 15, 2025Updated 10 months ago
- DeepEP: an efficient expert-parallel communication library☆9,053Feb 9, 2026Updated last month
- Inference Llama 2 in one file of pure C☆19,302Aug 6, 2024Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆55,432Nov 12, 2025Updated 4 months ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆41,869Mar 18, 2026Updated last week
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆158,060Mar 18, 2026Updated last week
- SGLang is a high-performance serving framework for large language models and multimodal models.☆24,829Updated this week
- The official Meta Llama 3 GitHub site☆29,291Jan 26, 2025Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A generative world for general-purpose robotics & embodied AI learning.☆28,335Updated this week
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆89,206Updated this week
- s1: Simple test-time scaling☆6,646Jun 25, 2025Updated 9 months ago
- Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.☆57,673Updated this week
- Open-Sora: Democratizing Efficient Video Production for All☆28,728Apr 30, 2025Updated 10 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,385Jul 1, 2024Updated last year
- 深度学习经典、新论文逐段精读☆32,727Mar 22, 2025Updated last year
- verl: Volcano Engine Reinforcement Learning for LLMs☆20,097Updated this week
- NanoGPT (124M) in 2 minutes☆5,003Mar 17, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆53,751Sep 18, 2024Updated last year
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆165,557Mar 19, 2026Updated last week
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆98,480Updated this week
- 🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning☆22,612Updated this week
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆32,946Feb 18, 2026Updated last month
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆68,728Mar 18, 2026Updated last week
- No fortress, purely open ground. OpenManus is Coming.☆55,377Feb 11, 2026Updated last month