Building LLaMA 4 MoE from Scratch
☆74Apr 15, 2025Updated last year
Alternatives and similar repositories for train-llama4
Users that are interested in train-llama4 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆77Apr 4, 2025Updated last year
- A Straightforward, Step-by-Step Implementation of a Video Diffusion Model☆82Aug 18, 2025Updated 8 months ago
- Train a 29M parameter GPT from Scratch☆36Mar 4, 2025Updated last year
- ☆11Feb 3, 2025Updated last year
- ☆27Jan 22, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Rucio K8s tutorial☆11Sep 26, 2025Updated 7 months ago
- PyTorch implementation of GRPO.☆15Apr 21, 2025Updated last year
- A curated collection of prompts for Grok Imagine by xAI☆28Oct 19, 2025Updated 6 months ago
- ☆12Feb 27, 2024Updated 2 years ago
- Synthetic Data Generator for Machine Learning Pipelines☆33Sep 2, 2025Updated 8 months ago
- API for toxic text classification, utilized pre-trained Distilbert and trained on Kaggle datasets. It helps identify and handle toxic con…☆14Apr 30, 2024Updated 2 years ago
- Vietnamese Large Language Model (LLM) fine-tuned for the task of Question Answering within the medical and healthcare domain☆26Mar 1, 2024Updated 2 years ago
- The Multimodal Model for Vietnamese Visual Question Answering (ViVQA)☆21Jul 29, 2024Updated last year
- Our solution to ML Talent Match hackathon☆11Mar 22, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Notes and commented code for RLHF (PPO)☆129Feb 27, 2024Updated 2 years ago
- Parallel_Computer_Architecture经典书籍☆17May 13, 2022Updated 3 years ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- ☆46May 24, 2025Updated 11 months ago
- ☆12Dec 14, 2024Updated last year
- ☆12Jun 2, 2024Updated last year
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆11Jul 27, 2024Updated last year
- Code from Chris Valasek @nudehaberdasher and Charlie Miller @0xcharlie car hack: http://blog.ioactive.com/2013/08/car-hacking-content.ht…☆15Oct 1, 2020Updated 5 years ago
- MLOps for Image Caption Generator.☆25Nov 27, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆21May 20, 2025Updated 11 months ago
- Hands-On Tutorial on Building Multimodal RAG Systems☆13Apr 10, 2025Updated last year
- A parser combinator in Ruby, with a pretty DSL☆11Jun 25, 2017Updated 8 years ago
- Python client for Jikan.moe, MyAnimeList unofficial API with good intentions.☆14Dec 20, 2022Updated 3 years ago
- Langchain_CrewAI_Gemini - An Gemini AI powered AI Agent (Multi-Agent) Project.☆14Mar 24, 2024Updated 2 years ago
- Minimal TPU implementation with 8x8 systolic array and PyTorch integration☆60Jan 26, 2026Updated 3 months ago
- A Beginner's Guide to Monetizing Your Python AI Chatbot☆16Apr 22, 2025Updated last year
- Complete Reinforcement Learning Toolkit for Large Language Models!☆21Aug 2, 2025Updated 9 months ago
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆207Aug 23, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Python implementation of an agent swarm system that works with local LLM servers. The system allows you to create multiple agents that …☆13Nov 20, 2024Updated last year
- The official implementation of the paper "Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models" (NeurIPS 2025 Pos…☆71Sep 29, 2025Updated 7 months ago
- ☆211Jun 4, 2025Updated 11 months ago
- ☆24Dec 1, 2023Updated 2 years ago
- Composition of Multimodal Language Models From Scratch☆15Aug 16, 2024Updated last year
- Implementation of 12 AI agents evaluation techniques☆43Jul 31, 2025Updated 9 months ago
- Learn LangChain for Genearative AI with OpenAI API using Python☆11Feb 15, 2024Updated 2 years ago