Trained a 114 million Parameter LLM from Scratch.
☆19Jul 21, 2024Updated last year
Alternatives and similar repositories for Training-a-Mini-114M-Parameter-Llama-3-like-Model-from-Scratch
Users that are interested in Training-a-Mini-114M-Parameter-Llama-3-like-Model-from-Scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A curated collection of prompts for Grok Imagine by xAI☆28Oct 19, 2025Updated 6 months ago
- ☆19Nov 4, 2025Updated 5 months ago
- Evaluating RNA structure prediction using diverse thermodynamic prediction tasks and high-throughput datasets.☆18Jun 10, 2022Updated 3 years ago
- Simple flight dynamics model published in AIAA☆13Sep 6, 2019Updated 6 years ago
- CV and Deep Learning methods to analyze the data from Traffic Camera☆13Sep 29, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A standalone CXL-enabled system simulator.☆21Jan 10, 2026Updated 3 months ago
- ☆12Dec 14, 2024Updated last year
- Visualization, comparison, and analysis of RNA secondary structures via a cross-platform GUI☆17May 19, 2025Updated 11 months ago
- CacheDirector - Sending Packets to the Right Slice by Exploiting Intel Last-Level Cache Addressing☆11Apr 29, 2019Updated 6 years ago
- This is the respository that holds the artifacts of ASPLOS'25 -- M5: Mastering Page Migration and Memory Management for CXL-based Tiered …☆17Apr 1, 2025Updated last year
- PyTorch Implementation of Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model☆28Oct 10, 2024Updated last year
- A General Toolkit for Advanced Online Learning, Online Active Learning, Online Semi-supervised Learning Approaches☆23Sep 28, 2025Updated 6 months ago
- Sources and examples for ASPLOS20 paper☆14Jul 21, 2020Updated 5 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Mixture of Experts from scratch☆13Apr 12, 2024Updated 2 years ago
- Python client for Jikan.moe, MyAnimeList unofficial API with good intentions.☆14Dec 20, 2022Updated 3 years ago
- Recursive Bayesian Networks☆11May 11, 2025Updated 11 months ago
- ☆15Apr 18, 2023Updated 3 years ago
- ☆18Nov 1, 2021Updated 4 years ago
- GREMLIN is a method to learn a statistical model of a protein family that captures both conservation and co-evolution patterns in the fam…☆23Jul 3, 2016Updated 9 years ago
- Generate Ethereum CREATE2 addresses☆12Aug 1, 2020Updated 5 years ago
- ☆20Mar 18, 2026Updated last month
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆18Sep 13, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An Apple Airplay client in Go (golang)☆14Jan 1, 2015Updated 11 years ago
- VirID: An integrated platform for the discovery and characterization of RNA Viruses☆23Jan 8, 2026Updated 3 months ago
- My personal template for hardhat projects☆11Dec 24, 2021Updated 4 years ago
- A PyTorch implementation of Vector Quantized Variational Autoencoder (VQ-VAE) with EMA updates, pretrained encoder, and K-means initializ…☆21Mar 26, 2026Updated 3 weeks ago
- ☆15Jun 22, 2022Updated 3 years ago
- ☆16Updated this week
- Not regularly updated clone of http://git.dpdk.org/dpdk-stable/ with the purpose to develop a new driver for corundum/mqnic (https://gith…☆15Aug 24, 2023Updated 2 years ago
- plget is a tool used to measure latency packets spent in network stack, NIC driver and on the wire, trace interpacket gap, based as on h/…☆16Nov 18, 2019Updated 6 years ago
- ☆20Jul 5, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A simple implementation of Llama 1, 2. Llama Architecture built from scratch using PyTorch all the models are built from scratch that inc…☆14May 6, 2024Updated last year
- Enhanced PQOS (Intel RDT Software) with DDIO-related Functionalities☆16May 25, 2022Updated 3 years ago
- Estimate geoadditive spatial or spatio-temporal econometric models☆12Jul 4, 2022Updated 3 years ago
- Community Detection algorithms for LightGraphs☆15Mar 12, 2026Updated last month
- Mirror of Apache Spark☆10Jul 30, 2015Updated 10 years ago
- This repository contains the complete source code that we used to conduct experiments in the paper: Text Window Denoising Autoencoder: Bu…☆15Jun 12, 2013Updated 12 years ago
- A Docker image project for compiling STM32 C/C++ projects☆11Jun 30, 2021Updated 4 years ago