usamec / lowmem_finetuningLinks
Low memory full parameter finetuning of LLMs
☆53Updated 3 months ago
Alternatives and similar repositories for lowmem_finetuning
Users that are interested in lowmem_finetuning are comparing it to the libraries listed below
Sorting:
- An introduction to LLM Sampling☆79Updated 11 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆72Updated 6 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated 3 weeks ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆66Updated this week
- ☆46Updated 7 months ago
- Simple GRPO scripts and configurations.☆59Updated 9 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆148Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆107Updated 8 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆248Updated last month
- code for training & evaluating Contextual Document Embedding models☆200Updated 6 months ago
- ☆68Updated 5 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆97Updated 3 months ago
- smolLM with Entropix sampler on pytorch☆150Updated last year
- ☆51Updated 9 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆299Updated 2 weeks ago
- ☆86Updated 4 months ago
- ☆48Updated last year
- Train your own SOTA deductive reasoning model☆108Updated 8 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated 2 months ago
- Exploring Applications of GRPO