LLMem: GPU Memory Estimation for Fine-Tuning Pre-Trained LLMs
☆29May 31, 2025Updated 9 months ago
Alternatives and similar repositories for LLMem
Users that are interested in LLMem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deployment code for image generative AI and other related image based tasks.☆22May 15, 2023Updated 2 years ago
- PyTorch Quantization Framework For OCP MX Datatypes.☆16May 30, 2025Updated 9 months ago
- ☆33Jul 8, 2024Updated last year
- ☆13May 21, 2024Updated last year
- DELTA-pytorch:DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation☆12Apr 16, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Find the gender of a german word so you know what articles to use ( Der, Die, Das , Ein, Eine)☆10Nov 23, 2022Updated 3 years ago
- Vision-Language Models Toolbox: Your all-in-one solution for multimodal research and experimentation☆12Feb 16, 2025Updated last year
- Driving Snax with MLIR☆20Mar 19, 2026Updated last week
- ☆16Sep 25, 2025Updated 6 months ago
- A PTA exported exercise paper format helper which cleans the results.☆12Jan 7, 2025Updated last year
- Zero Bubble Pipeline Parallelism☆452May 7, 2025Updated 10 months ago
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆14Nov 17, 2025Updated 4 months ago
- Contains Colab Notebooks show cool use-cases of different GCP ML APIs.☆10Nov 5, 2020Updated 5 years ago
- This repository contains the official implementation of the paper entitled with "FedAPEN: Personalized Cross-silo Federated Learning with…☆14Dec 4, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Python package for compressing floating-point PyTorch tensors☆13Jul 22, 2024Updated last year
- ☆10Dec 21, 2024Updated last year
- ☆17Aug 29, 2024Updated last year
- Streamlit Multi AI Platform Chat App☆10Nov 5, 2024Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- ☆20Nov 12, 2025Updated 4 months ago
- [NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆42Jul 7, 2025Updated 8 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆30Nov 24, 2024Updated last year
- ☆14May 13, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Python package for extractive NLP using the OpenAI API☆17Aug 28, 2024Updated last year
- Paper: "Aggregating Capacity in FL through Successive Layer Training for Computationally-Constrained Devices"☆18Jan 10, 2024Updated 2 years ago
- JAX implementation of LLaMA, aiming to train LLaMA on Google Cloud TPU☆14Jul 22, 2023Updated 2 years ago
- [COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models☆19Apr 1, 2025Updated 11 months ago
- Run GNS3 Server inside Docker☆11Oct 3, 2021Updated 4 years ago
- ☆12Sep 11, 2022Updated 3 years ago
- ☆16Jun 8, 2025Updated 9 months ago
- 另一个浙大健康打卡定时任务☆13May 8, 2022Updated 3 years ago
- Quantize transformers to any learned arbitrary 4-bit numeric format☆53Jan 25, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆21Mar 26, 2025Updated last year
- 数据结构与算法课的实验、作业代码,以及课堂ppt☆15Jan 10, 2019Updated 7 years ago
- ☆71Jul 11, 2024Updated last year
- Efficient Exploration through Bayesian Deep-Q Networks.☆18Mar 22, 2022Updated 4 years ago
- This program looks up the etymologies of words in a text file and color-codes the words according to their origin. It allows a writer to …☆14Jul 26, 2016Updated 9 years ago
- PyTorch distributed training from scratch (for educational purposes only)☆22Apr 12, 2025Updated 11 months ago
- Find the origin of words in every language using a Deep Neural Network trained to create an etymological map.☆22May 18, 2018Updated 7 years ago