taehokim20/LLMem

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/taehokim20/LLMem)

taehokim20 / LLMem

LLMem: GPU Memory Estimation for Fine-Tuning Pre-Trained LLMs

☆30

Alternatives and similar repositories for LLMem

Users that are interested in LLMem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wanatpj / h_blind
View on GitHub
Extraction of watermark embedded with E_BLIND method on multiple digital works.
☆13Sep 14, 2016Updated 9 years ago
r-three / mats
View on GitHub
☆33Jul 8, 2024Updated 2 years ago
matthewrenze / jhu-concise-cot
View on GitHub
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆25Nov 25, 2024Updated last year
vtu81 / NaiveVQA
View on GitHub
A Visual Question Answering model implemented in MindSpore and PyTorch. The model is a reimplementation of the paper *Show, Ask, Attend, …
☆10Jul 27, 2021Updated 5 years ago
tom-urkin / Round-Robin
View on GitHub
This repository contains a SystemVerilog implementation of a parametrized Round Robin arbiter with three instantiation options
☆13Jan 28, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tatsu432 / BDCM
View on GitHub
☆17Mar 24, 2026Updated 4 months ago
jeremybennett / verilator
View on GitHub
A fork of the main Verilator project for development work. The changes here are in preparation for committing back to the main project.
☆18Nov 26, 2014Updated 11 years ago
nlesc-recruit / PowerSensor3
View on GitHub
PowerSensor is a low-cost, custom-built device that measures the instantaneous power consumption of GPUs and other devices at a high time…
☆11Jul 20, 2026Updated last week
pittisl / ElasticTrainer
View on GitHub
Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)
☆14Nov 1, 2023Updated 2 years ago
StephanTLavavej / STL
View on GitHub
MSVC's implementation of the C++ Standard Library.
☆12Updated this week
d-matrix-ai / dmx-compressor
View on GitHub
d-Matrix DMX Compressor: A Pytorch toolkit for nn.Module transformations supporting advanced quantization, sparsity, and elementwise func…
☆22Mar 5, 2026Updated 4 months ago
LLM4Ops / Cloud-OpsBench
View on GitHub
A Reproducible Benchmark for Agentic Root Cause Analysis in Cloud Systems
☆20Jul 17, 2026Updated last week
royeisen / reasoning_loading_bar
View on GitHub
☆56Jul 7, 2025Updated last year
Geralt-Targaryen / MC-Evaluation
View on GitHub
☆14May 21, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
letta-ai / letta-leaderboard
View on GitHub
An LLM leaderboard for stateful agents
☆21Oct 16, 2025Updated 9 months ago
FusionBolt / Rc-lang
View on GitHub
a simple programming language under development
☆11Dec 3, 2023Updated 2 years ago
sayakpaul / GCP-ML-API-Demos
View on GitHub
Contains Colab Notebooks show cool use-cases of different GCP ML APIs.
☆10Nov 5, 2020Updated 5 years ago
catid / cuda_float_compress
View on GitHub
Python package for compressing floating-point PyTorch tensors
☆13Jul 22, 2024Updated 2 years ago
allenai / easy-to-hard-generalization
View on GitHub
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Jan 17, 2024Updated 2 years ago
Toby-Shi-cloud / SysY-Compiler-2023
View on GitHub
BUAA Compiler Course Project 2023 by Toby Shi.
☆13Aug 20, 2024Updated last year
dylnbk / chatty-v2
View on GitHub
Streamlit Multi AI Platform Chat App
☆10Nov 5, 2024Updated last year
cnrv / riscv-innovations
View on GitHub
RISC-V is where innovation happens!
☆19Dec 25, 2023Updated 2 years ago
VITA-Group / o1-planning
View on GitHub
[NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability
☆42Apr 10, 2026Updated 3 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
AlienHub / ChatMermaid
View on GitHub
使用自然语言绘制流程图，基于OpenAI
☆12Nov 13, 2023Updated 2 years ago
qagentur / texttunnel
View on GitHub
Python package for extractive NLP using the OpenAI API
☆17Aug 28, 2024Updated last year
ysc3839 / SingleExeXamlIsland
View on GitHub
☆15May 13, 2024Updated 2 years ago
hanxuhu / SeqIns
View on GitHub
The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…
☆30Nov 24, 2024Updated last year
facerless / yolov5-tensorflow
View on GitHub
☆12Mar 12, 2021Updated 5 years ago
k1l1 / SLT
View on GitHub
Paper: "Aggregating Capacity in FL through Successive Layer Training for Computationally-Constrained Devices"
☆18Jan 10, 2024Updated 2 years ago
shenxiangzhuang / presentia
View on GitHub
Elegant presentation template in LaTex and Typst
☆11Apr 20, 2025Updated last year
pampa-labs / llmate
View on GitHub
☆10Oct 31, 2023Updated 2 years ago
FedCampus / FedKit
View on GitHub
Mobile Federated Learning development kit for FedCampus
☆19Feb 3, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
minyoungg / LTE
View on GitHub
☆71Jul 11, 2024Updated 2 years ago
EdisonChen0816 / chatbot
View on GitHub
闲聊机器人
☆11Aug 12, 2020Updated 5 years ago
Merterm / Etymon
View on GitHub
Find the origin of words in every language using a Deep Neural Network trained to create an etymological map.
☆22May 18, 2018Updated 8 years ago
ruebenk / Computer-Science-Imp-Books
View on GitHub
Important Books for Computer Science Students for Algorithms
☆19Jan 25, 2016Updated 10 years ago
flyingzhao / mxnet_VanillaCNN
View on GitHub
VanillaCNN for face alignment
☆24Dec 10, 2016Updated 9 years ago
jamekuma / HIT_data_structure
View on GitHub
数据结构与算法课的实验、作业代码，以及课堂ppt
☆16Jan 10, 2019Updated 7 years ago
mukhal / intrinsic-source-citation
View on GitHub
[COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models
☆19Apr 1, 2025Updated last year