This package contains the original 2012 AlexNet code.
☆2,861Mar 12, 2025Updated last year
Alternatives and similar repositories for AlexNet-Source-Code
Users that are interested in AlexNet-Source-Code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM training in simple, raw C/CUDA☆29,511Jun 26, 2025Updated 9 months ago
- FlashMLA: Efficient Multi-head Latent Attention Kernels☆12,558Apr 7, 2026Updated last week
- Fully open reproduction of DeepSeek-R1☆25,973Apr 2, 2026Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆76,536Updated this week
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆6,319Mar 22, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Fast and memory-efficient exact attention☆23,344Updated this week
- ☆102,626Aug 28, 2025Updated 7 months ago
- AlphaFold 3 inference pipeline.☆7,844Updated this week
- LLM inference in C/C++☆103,237Updated this week
- ☆91,961Jun 27, 2025Updated 9 months ago
- Minimal reproduction of DeepSeek R1-Zero☆13,038Feb 27, 2026Updated last month
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆27,088Jan 9, 2026Updated 3 months ago
- A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.☆2,937Jan 14, 2026Updated 3 months ago
- Development repository for the Triton language and compiler☆18,902Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A high-performance distributed file system designed to address the challenges of AI training and inference workloads.☆9,800Mar 30, 2026Updated 2 weeks ago
- Inference code for Llama models☆59,324Jan 26, 2025Updated last year
- Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation☆7,971May 15, 2025Updated 11 months ago
- DeepEP: an efficient expert-parallel communication library☆9,105Updated this week
- Inference Llama 2 in one file of pure C☆19,379Aug 6, 2024Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆56,599Nov 12, 2025Updated 5 months ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.☆42,029Updated this week
- 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…☆159,060Apr 9, 2026Updated last week
- The official Meta Llama 3 GitHub site☆29,291Jan 26, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- SGLang is a high-performance serving framework for large language models and multimodal models.☆25,643Updated this week
- A generative world for general-purpose robotics & embodied AI learning.☆28,504Updated this week
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.☆61,312Updated this week
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆90,803Updated this week
- s1: Simple test-time scaling☆6,640Jun 25, 2025Updated 9 months ago
- Open-Sora: Democratizing Efficient Video Production for All☆28,861Apr 9, 2026Updated last week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,417Jul 1, 2024Updated last year
- 深度学习经典、新论文逐段精读☆32,838Mar 22, 2025Updated last year
- verl: Volcano Engine Reinforcement Learning for LLMs☆20,603Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- NanoGPT (124M) in 2 minutes☆5,070Mar 29, 2026Updated 2 weeks ago
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆53,928Sep 18, 2024Updated last year
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆168,287Apr 9, 2026Updated last week
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆99,047Updated this week
- CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image☆33,184Mar 25, 2026Updated 3 weeks ago
- 🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning☆23,204Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆69,794Apr 6, 2026Updated last week