a simplified version of Meta's Llama 3 model to be used for learning
☆44May 21, 2024Updated last year
Alternatives and similar repositories for minLlama3
Users that are interested in minLlama3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a simplified version of Google's Gemma model to be used for learning☆26Mar 2, 2024Updated 2 years ago
- Eval LLMs☆11May 12, 2024Updated last year
- ☆13May 9, 2025Updated 11 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆30Apr 13, 2026Updated 2 weeks ago
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆20Jun 29, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Python code to monitor a route at your desired frequency, using the Google Directions API☆14Apr 14, 2024Updated 2 years ago
- customizable template GPT code designed for easy novel architecture experimentation☆26Mar 19, 2025Updated last year
- A server for simple-bar that allows on the fly refreshes☆19Apr 10, 2026Updated 2 weeks ago
- ☆17Updated this week
- Train toy models using multi-token prediction objective☆14Apr 18, 2026Updated last week
- A fast vector database written in C.☆35Updated this week
- Collect papers about Mamba (a selective state space model).☆15Aug 6, 2024Updated last year
- Implementation of xLSTM in Pytorch from the paper: "xLSTM: Extended Long Short-Term Memory"☆119Apr 13, 2026Updated 2 weeks ago
- Fork of diux-dev/imagenet18☆16Oct 4, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A web client for Linux from scratch in C for a variety of alternative web protocols☆17Nov 4, 2023Updated 2 years ago
- ☆12Dec 14, 2024Updated last year
- a WIP architecture designed to allow transformers to think in a manner without tokens☆20Apr 12, 2024Updated 2 years ago
- ☆15May 25, 2021Updated 4 years ago
- Codebase for machine learning research in PyTorch.☆14Jun 16, 2025Updated 10 months ago
- ☆18Oct 26, 2024Updated last year
- Template for writing reproducible machine learning papers☆12May 18, 2023Updated 2 years ago
- Pytorch Implementation of Transformers Explained with Comments☆16Apr 23, 2020Updated 6 years ago
- Remove generated stories with stray unicode characters☆12Jan 3, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Hypernetwork training considerations and implementation types in PyTorch. Includes classification and time-series examples alongside 1D G…☆25Jan 4, 2023Updated 3 years ago
- The powerful Git GUI/Client application.☆21Updated this week
- Vite + Mantine + Vanilla extract template☆12Apr 21, 2026Updated last week
- Evaluating majors LLMs on the Abstraction and Reasoning Corpus☆17Nov 9, 2023Updated 2 years ago
- This repo implements Video generation model using Latent Diffusion Transformers(Latte) in PyTorch and provides training and inference cod…☆18Jan 6, 2025Updated last year
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- Code for our tutorial on Discrete Variational Autoencoders☆33May 19, 2025Updated 11 months ago
- This identity based encryption based on weil pairing☆15Jul 22, 2017Updated 8 years ago
- ✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM☆11Jun 16, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!☆13Oct 22, 2023Updated 2 years ago
- Template & code for the Continual Learning EValuation Assessment (CLEVA) Compass☆24Dec 13, 2023Updated 2 years ago
- Simulate and Render MuJoCo in the Browser with 3DGS.☆43Apr 16, 2026Updated 2 weeks ago
- Tiny C Compiler - The Smallest ANSI C compiler☆20Jan 2, 2026Updated 3 months ago
- A GPT with self-similar nested properties☆20Mar 19, 2024Updated 2 years ago
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- Code repository for the public reproduction of the language modelling experiments on "MatFormer: Nested Transformer for Elastic Inference…☆31Nov 14, 2023Updated 2 years ago