pp1230/LLMGPUMemEstimator

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pp1230/LLMGPUMemEstimator)

pp1230 / LLMGPUMemEstimator

The GPU RAM Estimator provides a simple tool for estimating GPU memory usage during training and inference.

☆35

Alternatives and similar repositories for LLMGPUMemEstimator

Users that are interested in LLMGPUMemEstimator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Lyken17 / hf-torrent
View on GitHub
☆44Mar 22, 2024Updated 2 years ago
DjagbleyEmmanuel / llamafile-convert_gguf_UI
View on GitHub
This GUI aims to simplify the process of converting GGUF files to llamafile format by providing an intuitive and convenient way for users…
☆14Jan 2, 2026Updated 6 months ago
language-agent-tutorial / language-agent-tutorial.github.io
View on GitHub
[EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks
☆10Nov 27, 2024Updated last year
TugLLC / GenNext
View on GitHub
A script for transferring morphs between Daz3D Genesis figures
☆12May 23, 2019Updated 7 years ago
Tegridy-Code / Lars-Ulrich-Challenge
View on GitHub
Algorithmic and AI MIDI Drums Generator Implementation
☆13Apr 1, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ra2map / ra2map.github.io
View on GitHub
RA2 map making tutorial website.
☆16Jun 22, 2025Updated last year
allenai / chime
View on GitHub
Repository containing dataset, models and code associated with the CHIME project
☆18Aug 22, 2024Updated last year
jin1ming / rico-json-processing
View on GitHub
JSON Processing of RICO Dataset
☆15Sep 16, 2022Updated 3 years ago
Qengineering / Paddle-Raspberry-Pi
View on GitHub
Paddle installation wheels for Raspberry Pi 4 (64-bit OS)
☆15Sep 25, 2023Updated 2 years ago
siat-nlp / NLP-docs
View on GitHub
Docs of NLP/deep Learning/machine learning, etc. https://siat-nlp.github.io/docs
☆11Jul 17, 2019Updated 7 years ago
CaoHaiNam / Vietnamese-Address-Standardization
View on GitHub
RIVF 2021: Deep neural network based learning to rank for address standardization
☆10Jul 13, 2024Updated 2 years ago
lucianma05-create / Social-AI-Group
View on GitHub
A curated collection of paper summaries for Social AI research.
☆18Jul 9, 2026Updated last week
tetsu9923 / SciReviewGen
View on GitHub
Official dataset repository for "SciReviewGen: A Large-scale Dataset for Automatic Literature Review Generation."
☆21Jun 4, 2023Updated 3 years ago
jiangjiechen / HedModTmplGen
View on GitHub
Code for our ACL 2019 long paper: "Ensuring Readability and Data-fidelity using Head-modifier Templates in Deep Type Description Generati…
☆11Nov 5, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
The0mikkel / ollama-discord-bot
View on GitHub
Ollama Discord bot, for a chatty experience.
☆12Mar 13, 2025Updated last year
xubuvd / LLMs
View on GitHub
专注于中文领域大语言模型，落地到某个行业某个领域，成为一个行业大模型、公司级别或行业级别领域大模型。
☆125May 19, 2026Updated 2 months ago
HaozheZhao / MIC_tool
View on GitHub
☆14Nov 14, 2023Updated 2 years ago
ellenmellon / DIALKI
View on GitHub
DIALKI: Knowledge Identification in Conversational Systems through Dialogue-Document Contextualization
☆10Aug 3, 2022Updated 3 years ago
momo-journey / mbart-chinese
View on GitHub
多语言降噪预训练模型MBart的中文生成任务
☆11May 27, 2021Updated 5 years ago
kaishxu / DFMed
View on GitHub
Code and data for "Medical Dialogue Generation via Dual Flow Modeling" (ACL 2023 Findings)
☆14Nov 22, 2023Updated 2 years ago
iwangjian / pyloader
View on GitHub
🐳 PyLoader: An asynchronous Python dataloader for loading big datasets, supporting PyTorch and TensorFlow 2.x.
☆11Aug 29, 2021Updated 4 years ago
simonucl / PolySkill
View on GitHub
Official implementation of PolySkill, a framework that enables web agents to learn generalizable and compositional skills through polymor…
☆15Jul 6, 2026Updated 2 weeks ago
rGitcy / TCM-Data-Mining_papers
View on GitHub
☆11Jan 9, 2019Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Yottaxx / NamedEntityRecognitionAsDependencyParsing
View on GitHub
2020-natural-language-processing-project
☆10Dec 18, 2020Updated 5 years ago
wwj718 / PyBaiduYuyin
View on GitHub
A python wrap for Baidu Yuyin API
☆10Aug 3, 2016Updated 9 years ago
wangjs9 / CARE-master
View on GitHub
PyTorch implementation of CARE
☆16Oct 6, 2023Updated 2 years ago
CliBench / CliBench
View on GitHub
CliBench: Multifaceted Evaluation of Large Language Models in Clinical Decisions on Diagnoses, Procedures, Lab Tests Orders and Prescript…
☆19Aug 9, 2024Updated last year
j-mahowald / clip-loc-maps
View on GitHub
Repository for the paper "Integrating Visual and Textual Inputs for Searching Large-Scale Map Collections with CLIP"
☆12Oct 1, 2024Updated last year
mapio / py-web-graph
View on GitHub
A simple package allowing to use WebGraph data in Python (via the Jython interpreter).
☆20Oct 21, 2020Updated 5 years ago
nicholasopuni31 / casio-music-data
View on GitHub
This repository contains MIDI files of renditions, including song bank/piano bank tunes and rhythms, from certain CASIO keyboards, with s…
☆16Jul 4, 2026Updated 2 weeks ago
iwangjian / TRIP
View on GitHub
[TOIS 2024] Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue
☆14Oct 18, 2025Updated 9 months ago
LyzenX / BiliLiveProcessor
View on GitHub
处理b站录播姬的录播文件，可在不修改原始文件的条件下合并片段、渲染弹幕，自动处理分辨率、帧率、横屏竖屏等变化。
☆21Feb 9, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
caiyinqiong / study_notes
View on GitHub
This is my study notes for my PhD in AI, NLP, IR, and more.
☆17Nov 13, 2021Updated 4 years ago
wjn922 / Optimizer-Experiments-Pytorch
View on GitHub
SGD/ADAM/Amsgrad/AdamW/RAdam/Lookahead
☆10Nov 18, 2019Updated 6 years ago
Analytics-for-Forecasting / OpenForecasting
View on GitHub
A general framework for univariate time series forecasting.
☆10Apr 18, 2024Updated 2 years ago
frankkramer-lab / GPTNERMED
View on GitHub
GPTNERMED is a language model-generated, synthetic dataset and an open neural NER model for medical entities designed for German data.
☆15Oct 5, 2023Updated 2 years ago
RenzeLou / AAAR-1.0
View on GitHub
The source code for running LLMs on the AAAR-1.0 benchmark.
☆20Apr 5, 2025Updated last year
EIT-NLP / AccuracyParadox-RLHF
View on GitHub
[EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Langua…
☆13Nov 11, 2024Updated last year
yulijia / LaTeX_UCASthesis
View on GitHub
LaTeX template of graduate Thesis [University of Chinese Academy of Sciences]
☆12Nov 7, 2017Updated 8 years ago