Trained a 114 million Parameter LLM from Scratch.
☆19Jul 21, 2024Updated last year
Alternatives and similar repositories for Training-a-Mini-114M-Parameter-Llama-3-like-Model-from-Scratch
Users that are interested in Training-a-Mini-114M-Parameter-Llama-3-like-Model-from-Scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Build machine learning models with scikit-learn power tools☆11Oct 28, 2022Updated 3 years ago
- PAGAN2 multiple sequence aligner☆12Jul 2, 2024Updated last year
- Calculate the RMSD between two protein structures☆12Jun 29, 2022Updated 3 years ago
- CATBench, the Intel Cache Allocation Technology benchmarking suite described in our tech report, "Simple Cache Partitioning for Networked…☆12Oct 6, 2017Updated 8 years ago
- A standalone CXL-enabled system simulator.☆21Apr 19, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆12Dec 14, 2024Updated last year
- A minimum demo for PyTorch distributed extension functionality for collectives.☆15Jul 29, 2024Updated last year
- This is the respository that holds the artifacts of ASPLOS'25 -- M5: Mastering Page Migration and Memory Management for CXL-based Tiered …☆17Apr 1, 2025Updated last year
- the imperative is to whip it etc☆10Sep 24, 2020Updated 5 years ago
- A General Toolkit for Advanced Online Learning, Online Active Learning, Online Semi-supervised Learning Approaches☆23Sep 28, 2025Updated 8 months ago
- Sources and examples for ASPLOS20 paper☆14Jul 21, 2020Updated 5 years ago
- ☆13May 13, 2022Updated 4 years ago
- Mixture of Experts from scratch☆14Apr 12, 2024Updated 2 years ago
- Python client for Jikan.moe, MyAnimeList unofficial API with good intentions.☆14Dec 20, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Recursive Bayesian Networks☆11May 11, 2025Updated last year
- ☆18Nov 1, 2021Updated 4 years ago
- Telegram bot for 'systemctl is-active/start/stop'ing services.☆13Mar 4, 2024Updated 2 years ago
- Evolutionary Mission Trajectory Generator (EMTG)☆19Dec 13, 2020Updated 5 years ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆19Sep 13, 2024Updated last year
- Distributed node-red project☆11Oct 22, 2017Updated 8 years ago
- A platform to do RNA science☆28Mar 7, 2021Updated 5 years ago
- Demo of building a flower image search using GNES Flow API☆14Mar 24, 2023Updated 3 years ago
- My personal template for hardhat projects☆11Dec 24, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Continual Learning with Gated Incremental Memories for Sequential Data Processing. IJCNN 2020. Continual Learning with Recurrent Neural N…☆15Oct 13, 2021Updated 4 years ago
- Basic autoencoder implementation with Keras.☆10Jul 25, 2018Updated 7 years ago
- CMIP6 climate data extraction and treatment by multi-polygon shapefile using ESGF NetCDF and WORLDCLIM datasets☆11Sep 25, 2021Updated 4 years ago
- Not regularly updated clone of http://git.dpdk.org/dpdk-stable/ with the purpose to develop a new driver for corundum/mqnic (https://gith…☆16Aug 24, 2023Updated 2 years ago
- ☆20Jul 5, 2024Updated last year
- Physics library for ngraph☆25Dec 9, 2022Updated 3 years ago
- Enhanced PQOS (Intel RDT Software) with DDIO-related Functionalities☆16May 25, 2022Updated 4 years ago
- Estimate geoadditive spatial or spatio-temporal econometric models☆12Jul 4, 2022Updated 3 years ago
- Course Project for COMP4471 on RWKV☆17Feb 11, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Environment equipped with reinforcement learning algorithms to train agents to play tic-tac-toe.☆13Mar 4, 2023Updated 3 years ago
- Mirror of Apache Spark☆10Jul 30, 2015Updated 10 years ago
- This repository contains the complete source code that we used to conduct experiments in the paper: Text Window Denoising Autoencoder: Bu…☆15Jun 12, 2013Updated 12 years ago
- Autcomplete XML package for Atom editor.☆13Dec 20, 2018Updated 7 years ago
- ☆20Feb 2, 2025Updated last year
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆21Jun 29, 2024Updated last year
- ☆12Jun 4, 2024Updated 2 years ago