LLMem: GPU Memory Estimation for Fine-Tuning Pre-Trained LLMs
☆30May 31, 2025Updated last year
Alternatives and similar repositories for LLMem
Users that are interested in LLMem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch Quantization Framework For OCP MX Datatypes.☆16May 30, 2025Updated last year
- ☆33Jul 8, 2024Updated last year
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Nov 1, 2023Updated 2 years ago
- ☆13May 21, 2024Updated 2 years ago
- DELTA-pytorch:DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation☆12Apr 16, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆15Feb 2, 2026Updated 5 months ago
- Vision-Language Models Toolbox: Your all-in-one solution for multimodal research and experimentation☆13Feb 16, 2025Updated last year
- Driving Snax with MLIR☆23Apr 22, 2026Updated 2 months ago
- ☆16Sep 25, 2025Updated 9 months ago
- 浙江大学 2023 学年秋冬学期《数字逻辑设计》实验文档。☆12Jan 11, 2024Updated 2 years ago
- MSVC's implementation of the C++ Standard Library.☆12Jun 26, 2026Updated last week
- Zero Bubble Pipeline Parallelism☆459May 7, 2025Updated last year
- ☆10Jan 3, 2024Updated 2 years ago
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆14May 20, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Paper Crawler for downloading academic papers ( ICML, ICLR, NIPS, UAI, ICRA, CoRL, AAAI, IJCAI )☆11May 14, 2024Updated 2 years ago
- [译] 面向数据科学的概率论☆17Jun 19, 2018Updated 8 years ago
- The repository of Identifying and Mitigating Position Bias.☆72Jun 13, 2025Updated last year
- This repository contains the official implementation of the paper entitled with "FedAPEN: Personalized Cross-silo Federated Learning with…☆14Dec 4, 2023Updated 2 years ago
- Python package for compressing floating-point PyTorch tensors☆13Jul 22, 2024Updated last year
- ☆10Dec 21, 2024Updated last year
- Official completion of “Training on the Benchmark Is Not All You Need”.☆40Dec 31, 2024Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- [NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆42Apr 10, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 使用自然语言绘制流程图,基于OpenAI☆12Nov 13, 2023Updated 2 years ago
- hacking on the stanford natural language inference (SNLI) corpus (in theano)☆15Aug 28, 2016Updated 9 years ago
- Jupyter notebook with examples on how to visualize the dataset of personal texts 📱, after extracting from an iPhone with PhoneView.☆12Aug 16, 2020Updated 5 years ago
- ☆12Sep 11, 2022Updated 3 years ago
- Run GNS3 Server inside Docker☆11Oct 3, 2021Updated 4 years ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆30Nov 24, 2024Updated last year
- Quantize transformers to any learned arbitrary 4-bit numeric format☆58Updated this week
- ☆71Jul 11, 2024Updated last year
- This program looks up the etymologies of words in a text file and color-codes the words according to their origin. It allows a writer to …☆14Jul 26, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆21Jul 24, 2025Updated 11 months ago
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆15Sep 28, 2024Updated last year
- Mobile Federated Learning development kit for FedCampus☆19Feb 3, 2024Updated 2 years ago
- Fast Symbolic Repair of Hardware Design Code☆39Jan 20, 2025Updated last year
- EMNLP 2022: Finding Dataset Shortcuts with Grammar Induction https://arxiv.org/abs/2210.11560☆59Feb 28, 2025Updated last year
- ☆23Mar 18, 2024Updated 2 years ago
- SFS: A Smart OS Scheduler for Serverless Function Workloads (SC'22)☆13Dec 15, 2022Updated 3 years ago