Niccolo-Ajroldi / plainLMView external linksLinks
Minimal pretraining script for language modeling in PyTorch. Supporting torch compilation and DDP. It includes a model implementation and a data preprocessing script.
☆41Dec 10, 2025Updated 2 months ago
Alternatives and similar repositories for plainLM
Users that are interested in plainLM are comparing it to the libraries listed below
Sorting:
- Real-Time RTUs☆11Jan 2, 2025Updated last year
- Code for implementing central flows☆42Sep 5, 2025Updated 5 months ago
- ☆17Oct 25, 2022Updated 3 years ago
- For the reproduction of research by Agostinelli et al. Learning Activation Functions to Improve Deep Neural Networks. http://arxiv.org/ab…☆19May 14, 2015Updated 10 years ago
- Flax (Jax) implementation of DeepSeek-R1-Distill-Qwen-1.5B with weights ported from Hugging Face.☆26Feb 20, 2025Updated 11 months ago
- FireQ: Fast INT4-FP8 Kernel and RoPE-aware Quantization for LLM Inference Acceleration☆20Jun 27, 2025Updated 7 months ago
- PyTorch implementation of the paper The Lottery Ticket Hypothesis for Object Recognition☆23Apr 22, 2021Updated 4 years ago
- ☆28Nov 29, 2023Updated 2 years ago
- KV Cache Steering for Inducing Reasoning in Small Language Models☆46Jul 24, 2025Updated 6 months ago
- This repository is for setting-up cuda-9/8, nvidia-396/387/384 driver, OpenCV-3.3, ROS Kinetic, Tensorflow-1.11/1.7/1.4/1.2.1, Pytorch-0.…☆30Jul 7, 2022Updated 3 years ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆13Apr 29, 2025Updated 9 months ago
- Primus-SaFE(Stability and Fault Endurance)☆50Feb 8, 2026Updated last week
- Source code and dataset for the paper 'Saamayik: A Benchmark and Dataset for English-Sanskrit Translation'☆15Oct 11, 2025Updated 4 months ago
- [ICLR'23] Trainability Preserving Neural Pruning (PyTorch)☆34May 21, 2023Updated 2 years ago
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆25Jun 16, 2025Updated 7 months ago
- Code for reproducing the results from "CrAM: A Compression-Aware Minimizer" accepted at ICLR 2023☆10Mar 1, 2023Updated 2 years ago
- A nonparametric variational information bottleneck (NVIB) layer in Pytorch☆11Apr 15, 2025Updated 10 months ago
- Bayesian adaptive stimulus placement of psychometric function for MATLAB.☆10Nov 7, 2018Updated 7 years ago
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- Notes and code for Programming Massively Parallel Processors☆13Mar 29, 2025Updated 10 months ago
- Code for the paper 'Monte Carlo Tree Search for Asymmetric Trees'☆12May 24, 2018Updated 7 years ago
- Tensegrity Lab is for efficiently exploring spatial structures based on pure pairwise push and pull forces using Rust Language☆11Feb 4, 2026Updated last week
- ☆23Jul 11, 2025Updated 7 months ago
- Face2Faceの実装とか☆13Jun 11, 2016Updated 9 years ago
- Explanation of Mathematics used in Machine Learning Algorithms and some Projects☆14Jul 19, 2018Updated 7 years ago
- Code to accompany the paper Sparse Linear Networks with a Fixed Butterfly Structure: Theory and Practice☆10Aug 10, 2021Updated 4 years ago
- Accelerating Transfer Learning with Robust Neural Nets☆11Oct 2, 2020Updated 5 years ago
- Incremental Consistent Topological Sort for Append-only Logs☆14Jun 28, 2022Updated 3 years ago
- ☆14Aug 29, 2024Updated last year
- Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion☆11Apr 1, 2024Updated last year
- Automatically create an importmap script.☆14Oct 20, 2024Updated last year
- [ICML 2025] MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design☆22Jul 4, 2025Updated 7 months ago
- ☆12Aug 26, 2025Updated 5 months ago
- ☆17Dec 16, 2025Updated last month
- Calculate Mahalanobis distances for multivariate data.☆12Mar 23, 2020Updated 5 years ago
- KMean Coreset evaluation and computation.☆12Jun 6, 2017Updated 8 years ago
- Pytorch Implementation of ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation (https://arxiv.org/abs/1606.02147)☆11Jan 24, 2020Updated 6 years ago
- ☆14Dec 12, 2024Updated last year
- Node.js Logical reasoning machine (WIP)☆10Dec 18, 2014Updated 11 years ago