Minimal pretraining script for language modeling in PyTorch. Supporting torch compilation and DDP. It includes a model implementation and a data preprocessing script.
☆42Dec 10, 2025Updated 2 months ago
Alternatives and similar repositories for plainLM
Users that are interested in plainLM are comparing it to the libraries listed below
Sorting:
- ☆17Oct 25, 2022Updated 3 years ago
- For the reproduction of research by Agostinelli et al. Learning Activation Functions to Improve Deep Neural Networks. http://arxiv.org/ab…☆19May 14, 2015Updated 10 years ago
- FireQ: Fast INT4-FP8 Kernel and RoPE-aware Quantization for LLM Inference Acceleration☆20Jun 27, 2025Updated 8 months ago
- PyTorch implementation of the paper The Lottery Ticket Hypothesis for Object Recognition☆23Apr 22, 2021Updated 4 years ago
- KV Cache Steering for Inducing Reasoning in Small Language Models☆46Jul 24, 2025Updated 7 months ago
- ☆29Nov 29, 2023Updated 2 years ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆13Apr 29, 2025Updated 10 months ago
- [ICLR'23] Trainability Preserving Neural Pruning (PyTorch)☆34May 21, 2023Updated 2 years ago
- Primus-SaFE(Stability and Fault Endurance)☆52Updated this week
- Source code and dataset for the paper 'Saamayik: A Benchmark and Dataset for English-Sanskrit Translation'☆15Oct 11, 2025Updated 4 months ago
- Lattice Recurrent Unit: Improving Convergence and Statistical Efficiency for Sequence Modeling☆36Jan 30, 2018Updated 8 years ago
- ☆35Sep 22, 2025Updated 5 months ago
- A nonparametric variational information bottleneck (NVIB) layer in Pytorch☆11Apr 15, 2025Updated 10 months ago
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- Bayesian adaptive stimulus placement of psychometric function for MATLAB.☆10Nov 7, 2018Updated 7 years ago
- ☆10Apr 24, 2024Updated last year
- Face2Faceの実装とか☆13Jun 11, 2016Updated 9 years ago
- Generate a menu with selectable menu items as a string☆12Dec 26, 2018Updated 7 years ago
- Open AI Gym Environment for the Dobot Magician Robotic Arm☆12Jul 9, 2018Updated 7 years ago
- A node module for allowing programmatic control of the useful Packer.IO tool☆16Nov 18, 2016Updated 9 years ago
- Synthetic Data Generation with Execution-Based Verification and Grounding for LLM Training.☆19Feb 7, 2025Updated last year
- Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"☆10May 15, 2024Updated last year
- [ICML 2025] Official PyTorch implementation of "NegMerge: Sign-Consensual Weight Merging for Machine Unlearning"☆14Nov 25, 2025Updated 3 months ago
- Official Implementation of Robustifying and Boosting Training-Free Neural Architecture Search☆10Mar 12, 2024Updated last year
- Developing a legal research tool leveraging ChatGPT / GPT-4☆14Mar 10, 2024Updated 2 years ago
- Spell and pronounce words with a neural network☆10Feb 13, 2017Updated 9 years ago
- Config files for my GitHub profile.☆12Jul 18, 2024Updated last year
- ☆10Jun 3, 2019Updated 6 years ago
- Tensegrity Lab is for efficiently exploring spatial structures based on pure pairwise push and pull forces using Rust Language☆11Feb 27, 2026Updated last week
- PCM audio sample rate conversion for Node.js☆15May 13, 2013Updated 12 years ago
- Code to accompany the paper Sparse Linear Networks with a Fixed Butterfly Structure: Theory and Practice☆10Aug 10, 2021Updated 4 years ago
- Accelerating Transfer Learning with Robust Neural Nets☆11Oct 2, 2020Updated 5 years ago
- Node.js Logical reasoning machine (WIP)☆10Dec 18, 2014Updated 11 years ago
- KMean Coreset evaluation and computation.☆12Jun 6, 2017Updated 8 years ago
- Pytorch Implementation of ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation (https://arxiv.org/abs/1606.02147)☆11Jan 24, 2020Updated 6 years ago
- Implementation of accurate coresets for known problems from the field of machine learning.☆11Nov 21, 2019Updated 6 years ago
- ☆14Jul 18, 2025Updated 7 months ago
- Code for reproducing the results in "How Well do Sparse Imagenet Models Transfer?", presented at CVPR 2022☆10Jun 3, 2022Updated 3 years ago
- A Python implementation of a graph-based parser for Abstract Meaning Representation (AMR)☆11Feb 2, 2018Updated 8 years ago