Building LLaMA 4 MoE from Scratch
☆76Apr 15, 2025Updated last year
Alternatives and similar repositories for train-llama4
Users that are interested in train-llama4 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Feb 3, 2026Updated 4 months ago
- Train a 29M parameter GPT from Scratch☆44Mar 4, 2025Updated last year
- ☆12Feb 3, 2025Updated last year
- 动手训练一个简单的CLIP模型,加深对CLIP的理解。☆27May 20, 2025Updated last year
- eIDAS Italian node☆11May 24, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- MICRO 2024 Evaluation Artifact for FuseMax☆17Aug 26, 2024Updated last year
- A curated collection of prompts for Grok Imagine by xAI☆30Jun 6, 2026Updated last week
- ☆13Feb 27, 2024Updated 2 years ago
- Synthetic Data Generator for Machine Learning Pipelines☆33Sep 2, 2025Updated 9 months ago
- ☆17Apr 29, 2025Updated last year
- ☆13Dec 6, 2024Updated last year
- ☆14Jun 16, 2020Updated 6 years ago
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- API for toxic text classification, utilized pre-trained Distilbert and trained on Kaggle datasets. It helps identify and handle toxic con…☆14Apr 30, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Vietnamese Large Language Model (LLM) fine-tuned for the task of Question Answering within the medical and healthcare domain☆26Mar 1, 2024Updated 2 years ago
- Classify documents using Python based on SVM and TF-IDF.☆15Nov 19, 2019Updated 6 years ago
- Our solution to ML Talent Match hackathon☆11Mar 22, 2024Updated 2 years ago
- Notes and commented code for RLHF (PPO)☆135Feb 27, 2024Updated 2 years ago
- ☆46May 24, 2025Updated last year
- ☆12Dec 14, 2024Updated last year
- Automation Chatbot☆20Jan 1, 2025Updated last year
- Open-source examples and guides for building with the Qwen. Browse a collection of snippets, advanced techniques and walkthroughs.☆38Nov 20, 2024Updated last year
- ☆12Jun 2, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- AI Powered Transform raw notes into polished, professional formats☆35Aug 16, 2025Updated 10 months ago
- TensorRT depth-anything for anyone and anywhere☆15Jan 29, 2024Updated 2 years ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆11Jul 27, 2024Updated last year
- Simple and efficient memory pool is implemented with C++11.☆10Jun 2, 2022Updated 4 years ago
- Code from Chris Valasek @nudehaberdasher and Charlie Miller @0xcharlie car hack: http://blog.ioactive.com/2013/08/car-hacking-content.ht…☆15Oct 1, 2020Updated 5 years ago
- A Step-by-Step Implementation of Google Veo 3 Architecture from Scratch☆84Jun 16, 2025Updated last year
- 数据库内核笔记☆14Aug 18, 2022Updated 3 years ago
- Langchain_CrewAI_Gemini - An Gemini AI powered AI Agent (Multi-Agent) Project.☆14Mar 24, 2024Updated 2 years ago
- Learning Pytorch☆13Oct 31, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Minimal TPU implementation with 8x8 systolic array and PyTorch integration☆63Jan 26, 2026Updated 4 months ago
- ☆28Jun 12, 2025Updated last year
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- Complete Reinforcement Learning Toolkit for Large Language Models!☆21Aug 2, 2025Updated 10 months ago
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆209Aug 23, 2024Updated last year
- This is the work from my learnings from the Data Science with Python course offered by DataCamp. This was a really helpful course as it s…☆10Mar 12, 2021Updated 5 years ago
- Python 版 JustAuth,期待你的加入☆10Jun 11, 2021Updated 5 years ago