Trained a 114 million Parameter LLM from Scratch.
☆19Jul 21, 2024Updated last year
Alternatives and similar repositories for Training-a-Mini-114M-Parameter-Llama-3-like-Model-from-Scratch
Users that are interested in Training-a-Mini-114M-Parameter-Llama-3-like-Model-from-Scratch are comparing it to the libraries listed below
Sorting:
- A curated collection of prompts for Grok Imagine by xAI☆23Oct 19, 2025Updated 4 months ago
- Estimate geoadditive spatial or spatio-temporal econometric models☆12Jul 4, 2022Updated 3 years ago
- CV and Deep Learning methods to analyze the data from Traffic Camera☆13Sep 29, 2018Updated 7 years ago
- ☆13Sep 5, 2025Updated 6 months ago
- ☆12Nov 24, 2020Updated 5 years ago
- Environment equipped with reinforcement learning algorithms to train agents to play tic-tac-toe.☆13Mar 4, 2023Updated 3 years ago
- This repository contains the complete source code that we used to conduct experiments in the paper: Text Window Denoising Autoencoder: Bu…☆15Jun 12, 2013Updated 12 years ago
- Pure Julia implementation for reading/writing data in the Avro format☆17May 10, 2024Updated last year
- Python library for interacting with Dask clusters in Saturn☆12Sep 4, 2025Updated 6 months ago
- A PyTorch implementation of Vector Quantized Variational Autoencoder (VQ-VAE) with EMA updates, pretrained encoder, and K-means initializ…☆21Dec 31, 2024Updated last year
- Sources and examples for ASPLOS20 paper☆14Jul 21, 2020Updated 5 years ago
- A replication of the paper "Adaptive Mixtures of Local Experts" applied to the CIFAR-10 image classification dataset.☆12Mar 19, 2021Updated 4 years ago
- Datasets for training and evaluating Ancient Greek sentence embedding models☆17Jul 12, 2024Updated last year
- A simple implementation of Llama 1, 2. Llama Architecture built from scratch using PyTorch all the models are built from scratch that inc…☆13May 6, 2024Updated last year
- ☆18Nov 1, 2021Updated 4 years ago
- Manipulation testing using local polynomial density methods.☆13Jan 23, 2025Updated last year
- Code for my Medium article: "How you can quickly deploy your ML models with FastAPI"☆12Mar 18, 2021Updated 4 years ago
- ☆15Jun 22, 2022Updated 3 years ago
- Not regularly updated clone of http://git.dpdk.org/dpdk-stable/ with the purpose to develop a new driver for corundum/mqnic (https://gith…☆15Aug 24, 2023Updated 2 years ago
- Simplistic Implementation of Zipformer:A faster and better encoder for automatic speech recognition in PyTorch☆18Jun 3, 2024Updated last year
- ☆18May 8, 2021Updated 4 years ago
- Open source version of DOCA GPUNetIO and DOCA Verbs libraries (limited features) to enable GDAKI technology on RDMA (IB and RoCE)☆32Feb 27, 2026Updated last week
- plget is a tool used to measure latency packets spent in network stack, NIC driver and on the wire, trace interpacket gap, based as on h/…☆16Nov 18, 2019Updated 6 years ago
- Hyperdimensional computing in Julia☆16Jan 21, 2026Updated last month
- ☆13Jan 20, 2017Updated 9 years ago
- A Julia wrapper for Fast Library for Approximate Nearest Neighbors (FLANN)☆18Apr 8, 2024Updated last year
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆20Jun 29, 2024Updated last year
- Repo for the paper: PerAda: Parameter-Efficient Federated Learning Personalization with Generalization Guarantees (CVPR 2024)☆23Aug 14, 2024Updated last year
- an open source dataset and generation pipeline for Large-Scale Reinforcement Learning☆17Apr 14, 2025Updated 10 months ago
- ☆20Feb 2, 2025Updated last year
- ☆49Sep 26, 2025Updated 5 months ago
- Efficient Neural Interaction Functions Search for Collaborative Filtering☆18Feb 15, 2020Updated 6 years ago
- Convolutional Neural Network for Click-Through Rate prediction.☆15Sep 28, 2016Updated 9 years ago
- ☆21Aug 27, 2023Updated 2 years ago
- PyTorch implementation of 2D Sharpened Cosine Similarity layer☆17Feb 1, 2022Updated 4 years ago
- An example using Elixir's GenStage☆19Aug 9, 2016Updated 9 years ago
- Conformer RNN-Transducer☆14May 25, 2022Updated 3 years ago
- PyTorch implementation of Conformer: Convolution-augmented Transformer for Speech Recognition☆18Apr 25, 2021Updated 4 years ago
- A simple CNN classifier example for PyTorch beginners.☆17Mar 18, 2021Updated 4 years ago