Trained a 114 million Parameter LLM from Scratch.
☆19Jul 21, 2024Updated last year
Alternatives and similar repositories for Training-a-Mini-114M-Parameter-Llama-3-like-Model-from-Scratch
Users that are interested in Training-a-Mini-114M-Parameter-Llama-3-like-Model-from-Scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CATBench, the Intel Cache Allocation Technology benchmarking suite described in our tech report, "Simple Cache Partitioning for Networked…☆12Oct 6, 2017Updated 8 years ago
- Small Rust script that cracked Yearn's v2 site password from a hashed copy☆14Dec 7, 2020Updated 5 years ago
- ☆12Dec 14, 2024Updated last year
- CacheDirector - Sending Packets to the Right Slice by Exploiting Intel Last-Level Cache Addressing☆11Apr 29, 2019Updated 7 years ago
- Sources and examples for ASPLOS20 paper☆14Jul 21, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆21May 20, 2025Updated last year
- Subgraphs that power the TreasureDAO ecosystem.☆10May 21, 2025Updated last year
- Mixture of Experts from scratch☆14Apr 12, 2024Updated 2 years ago
- Python client for Jikan.moe, MyAnimeList unofficial API with good intentions.☆14Dec 20, 2022Updated 3 years ago
- Recursive Bayesian Networks☆11May 11, 2025Updated last year
- ☆15Apr 18, 2023Updated 3 years ago
- Perl parser written in TypeScript☆10Jun 26, 2018Updated 8 years ago
- Generate Ethereum CREATE2 addresses☆12Aug 1, 2020Updated 5 years ago
- Evolutionary Mission Trajectory Generator (EMTG)☆19Dec 13, 2020Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆20Mar 18, 2026Updated 3 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆19Sep 13, 2024Updated last year
- My personal template for hardhat projects☆11Dec 24, 2021Updated 4 years ago
- A PyTorch implementation of Vector Quantized Variational Autoencoder (VQ-VAE) with EMA updates, pretrained encoder, and K-means initializ…☆22Mar 26, 2026Updated 3 months ago
- This repo implements Video generation model using Latent Diffusion Transformers(Latte) in PyTorch and provides training and inference cod…☆19Jan 6, 2025Updated last year
- Continual Learning with Gated Incremental Memories for Sequential Data Processing. IJCNN 2020. Continual Learning with Recurrent Neural N…☆15Oct 13, 2021Updated 4 years ago
- Basic autoencoder implementation with Keras.☆10Jul 25, 2018Updated 7 years ago
- AngularJS HTML5 routing with Play Framework 2.1☆30Aug 1, 2021Updated 4 years ago
- plget is a tool used to measure latency packets spent in network stack, NIC driver and on the wire, trace interpacket gap, based as on h/…☆18Nov 18, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A simple implementation of Llama 1, 2. Llama Architecture built from scratch using PyTorch all the models are built from scratch that inc…☆14May 6, 2024Updated 2 years ago
- A simple CMake template for a C++ executable using a static library for logic and GTest for unit testing☆11Mar 8, 2020Updated 6 years ago
- Enhanced PQOS (Intel RDT Software) with DDIO-related Functionalities☆16May 25, 2022Updated 4 years ago
- ☆16Jul 7, 2025Updated 11 months ago
- Course Project for COMP4471 on RWKV☆17Feb 11, 2024Updated 2 years ago
- Community Detection algorithms for LightGraphs☆14Mar 12, 2026Updated 3 months ago
- Environment equipped with reinforcement learning algorithms to train agents to play tic-tac-toe.☆13Mar 4, 2023Updated 3 years ago
- An AMM protocol supporting directional and bidirectional liquidity.☆16Jul 31, 2024Updated last year
- A Docker image project for compiling STM32 C/C++ projects☆11Jun 30, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆20Feb 2, 2025Updated last year
- Resources and Implementations of Generative Adversarial Nets which are focusing on how to control the generated images.☆11Apr 29, 2017Updated 9 years ago
- An ESBuild server for Dark Forest plugin development.☆16Jun 15, 2022Updated 4 years ago
- Intel pmem benchmarks☆18Mar 24, 2022Updated 4 years ago
- Nerf gun go brapp but like way more than you'd think☆19Jul 2, 2024Updated last year
- Generate your financial statement automatically from your account transactions☆12Feb 10, 2015Updated 11 years ago
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆22Jun 29, 2024Updated 2 years ago