Building LLaMA 4 MoE from Scratch
☆74Apr 15, 2025Updated last year
Alternatives and similar repositories for train-llama4
Users that are interested in train-llama4 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train a 29M parameter GPT from Scratch☆39Mar 4, 2025Updated last year
- ☆11Feb 3, 2025Updated last year
- 动手训练一个简单的CLIP模型,加深对CLIP的理解。☆26May 20, 2025Updated last year
- eIDAS Italian node☆11May 24, 2022Updated 4 years ago
- Synthetic Data Generator for Machine Learning Pipelines☆33Sep 2, 2025Updated 8 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- API for toxic text classification, utilized pre-trained Distilbert and trained on Kaggle datasets. It helps identify and handle toxic con…☆14Apr 30, 2024Updated 2 years ago
- The Multimodal Model for Vietnamese Visual Question Answering (ViVQA)☆21Jul 29, 2024Updated last year
- pubg_sdk☆11Jul 26, 2020Updated 5 years ago
- Parallel_Computer_Architecture经典书籍☆17May 13, 2022Updated 4 years ago
- Automation Chatbot☆21Jan 1, 2025Updated last year
- ☆46May 24, 2025Updated last year
- ☆12Dec 14, 2024Updated last year
- Open-source examples and guides for building with the Qwen. Browse a collection of snippets, advanced techniques and walkthroughs.☆38Nov 20, 2024Updated last year
- ☆12Jun 2, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- TensorRT depth-anything for anyone and anywhere☆15Jan 29, 2024Updated 2 years ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆11Jul 27, 2024Updated last year
- MLOps for Image Caption Generator.☆25Nov 27, 2023Updated 2 years ago
- Simple and efficient memory pool is implemented with C++11.☆10Jun 2, 2022Updated 3 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆21May 20, 2025Updated last year
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Oct 22, 2022Updated 3 years ago
- Python client for Jikan.moe, MyAnimeList unofficial API with good intentions.☆14Dec 20, 2022Updated 3 years ago
- Langchain_CrewAI_Gemini - An Gemini AI powered AI Agent (Multi-Agent) Project.☆14Mar 24, 2024Updated 2 years ago
- Minimal TPU implementation with 8x8 systolic array and PyTorch integration☆61Jan 26, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆28Jun 12, 2025Updated 11 months ago
- A Beginner's Guide to Monetizing Your Python AI Chatbot☆16Apr 22, 2025Updated last year
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- Complete Reinforcement Learning Toolkit for Large Language Models!☆21Aug 2, 2025Updated 9 months ago
- GEMV implementation with CUTLASS☆21Aug 21, 2025Updated 9 months ago
- ☆211Jun 4, 2025Updated 11 months ago
- ☆18Oct 21, 2024Updated last year
- Composition of Multimodal Language Models From Scratch☆15Aug 16, 2024Updated last year
- ☆20Aug 5, 2025Updated 9 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆88Updated this week
- ☆11May 2, 2023Updated 3 years ago
- Flash Attention in ~100 lines of CUDA (forward pass only)☆12Jun 10, 2024Updated last year
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 4 years ago
- DEFCON 30 Car Hacking Village Presentation☆11Sep 11, 2022Updated 3 years ago
- A PyTorch implementation of Vector Quantized Variational Autoencoder (VQ-VAE) with EMA updates, pretrained encoder, and K-means initializ…☆22Mar 26, 2026Updated 2 months ago
- smart chinese LLm☆19Jan 31, 2024Updated 2 years ago