This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog post.
☆92Jul 14, 2023Updated 2 years ago
Alternatives and similar repositories for pytorch-memory-optim
Users that are interested in pytorch-memory-optim are comparing it to the libraries listed below
Sorting:
- Plan✕ is a platform for creating and publishing digital planning services☆17Updated this week
- Materials for "Transformers from the Ground Up" at PyData Jeddah on August 5, 2021☆20Aug 5, 2021Updated 4 years ago
- ☆10Nov 6, 2024Updated last year
- ☆10Apr 28, 2024Updated last year
- Streamline data pipelines for AI. Process datasets across 1000s of machines, and optimize data for blazing fast model training.☆16Sep 18, 2024Updated last year
- Gzip and nearest neighbors for text classification☆57Aug 1, 2023Updated 2 years ago
- https://footprints.baulab.info☆17Oct 4, 2024Updated last year
- ☆42Dec 6, 2025Updated 2 months ago
- Fluid Language Model Benchmarking☆26Sep 16, 2025Updated 5 months ago
- Scaling Sparse Fine-Tuning to Large Language Models☆18Jan 31, 2024Updated 2 years ago
- Testing paligemma2 finetuning on reasoning dataset☆18Dec 28, 2024Updated last year
- Loop Nest - Linear algebra compiler and code generator.☆20Oct 22, 2022Updated 3 years ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Jun 3, 2024Updated last year
- Python intefrace for evaluation on chatgpt models☆19Feb 13, 2024Updated 2 years ago
- notebooks of cool EBM visualizations☆15Feb 12, 2021Updated 5 years ago
- ☆26Mar 15, 2023Updated 2 years ago
- Implementations of growing and pruning in neural networks☆22Jul 26, 2023Updated 2 years ago
- Linear Models with Python☆22Dec 12, 2024Updated last year
- ☆130Oct 1, 2024Updated last year
- ☆132Oct 25, 2023Updated 2 years ago
- QLoRA for Masked Language Modeling☆23Sep 11, 2023Updated 2 years ago
- Simple unified interface to ROS1 / ROS2 Python API☆18Nov 20, 2024Updated last year
- Chrome Extension for YouTube. Acts as an assistant for the YouTube video you are watching☆23Apr 26, 2023Updated 2 years ago
- ☆239Nov 24, 2025Updated 3 months ago
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Dec 9, 2024Updated last year
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆32Jul 28, 2023Updated 2 years ago
- Minimal open-source implementation of AlphaProof and HyperTree Proof Search.☆66Jan 31, 2026Updated last month
- Due to the huge vocaburary size (151,936) of Qwen models, the Embedding and LM Head weights are excessively heavy. Therefore, this projec…☆33Jan 6, 2026Updated last month
- ☆30Dec 6, 2021Updated 4 years ago
- Google's Gemini implemented with GPT-4 Vision, Whisper and Resemble AI☆26Dec 9, 2023Updated 2 years ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆33May 1, 2025Updated 10 months ago
- ☆137Aug 19, 2024Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Apr 17, 2024Updated last year
- GPT2 fine-tuning pipeline with KerasNLP, TensorFlow, and TensorFlow Extended☆33Sep 6, 2023Updated 2 years ago
- Repository for SD-πXL: Generating Low-Resolution Quantized Imagery via Score Distillation (SIGGRAPH Asia 2024)☆44May 2, 2025Updated 10 months ago
- A puzzle to learn about prompting☆135May 12, 2023Updated 2 years ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆78Aug 17, 2024Updated last year
- MoCap data collected from 17 players of the Caldas-Colombia tennis league.☆11May 22, 2024Updated last year
- ☆16Oct 5, 2025Updated 4 months ago