a simplified version of Meta's Llama 3 model to be used for learning
☆44May 21, 2024Updated last year
Alternatives and similar repositories for minLlama3
Users that are interested in minLlama3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch☆29Mar 22, 2026Updated 2 weeks ago
- ☆13May 9, 2025Updated 11 months ago
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆20Jun 29, 2024Updated last year
- A curated collection of prompts for Grok Imagine by xAI☆26Oct 19, 2025Updated 5 months ago
- A server for simple-bar that allows on the fly refreshes☆19Feb 8, 2026Updated 2 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A logical, reasonably standardized, but flexible project structure for conducting ml research 🍪☆18Mar 31, 2026Updated last week
- Collect papers about Mamba (a selective state space model).☆14Aug 6, 2024Updated last year
- Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"☆119Mar 20, 2026Updated 3 weeks ago
- Kanban board made with TailwindCSS☆11Jun 10, 2021Updated 4 years ago
- ☆12Dec 14, 2024Updated last year
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated last month
- Template for writing reproducible machine learning papers☆11May 18, 2023Updated 2 years ago
- Python client for Jikan.moe, MyAnimeList unofficial API with good intentions.☆14Dec 20, 2022Updated 3 years ago
- Mamba R1 represents a novel architecture that combines the efficiency of Mamba's state space models with the scalability of Mixture of Ex…☆25Oct 13, 2025Updated 5 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆18Sep 13, 2024Updated last year
- A copy of the DirectX Headers from MinGW-64.☆14Sep 7, 2023Updated 2 years ago
- This repo implements Video generation model using Latent Diffusion Transformers(Latte) in PyTorch and provides training and inference cod…☆18Jan 6, 2025Updated last year
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- A replication of the paper "Adaptive Mixtures of Local Experts" applied to the CIFAR-10 image classification dataset.☆12Mar 19, 2021Updated 5 years ago
- This identity based encryption based on weil pairing☆15Jul 22, 2017Updated 8 years ago
- A simple implementation of Llama 1, 2. Llama Architecture built from scratch using PyTorch all the models are built from scratch that inc…☆14May 6, 2024Updated last year
- Deep learning library implemented from scratch in numpy. Mixtral, Mamba, LLaMA, GPT, ResNet, and other experiments.☆54Apr 12, 2024Updated last year
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆11Jun 16, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Template & code for the Continual Learning EValuation Assessment (CLEVA) Compass☆24Dec 13, 2023Updated 2 years ago
- Simulate and Render MuJoCo in the Browser with 3DGS.☆39Feb 4, 2026Updated 2 months ago
- A GPT with self-similar nested properties☆20Mar 19, 2024Updated 2 years ago
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆31Nov 14, 2023Updated 2 years ago
- ☆20Feb 2, 2025Updated last year
- Minimized version of the Orchis server hosted at https://orchis.cherrymint.live☆10Nov 27, 2023Updated 2 years ago
- Low level implementation of the k-rpc network layer that the BitTorrent DHT uses☆25Feb 1, 2023Updated 3 years ago
- Simplistic Implementation of Zipformer:A faster and better encoder for automatic speech recognition in PyTorch☆19Jun 3, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆11Oct 12, 2021Updated 4 years ago
- This repo implements and trains DallE-1 on a synthetically generated dataset which has colored mnist images on texture/solid background a…☆13Oct 30, 2024Updated last year
- Trained a 114 million Parameter LLM from Scratch.☆19Jul 21, 2024Updated last year
- A simple P2P WebRTC chat using P2PT library leveraging WebTorrent trackers☆33Sep 7, 2025Updated 7 months ago
- golang implementation of ecash mint☆26Updated this week
- C implementation of the Kademlia-based Distributed Hash Table (DHT) used in the BitTorrent network (aka "mainline DHT")☆24Oct 12, 2020Updated 5 years ago
- LLaMA 2 implemented from scratch in PyTorch☆369Sep 25, 2023Updated 2 years ago