LLMem: GPU Memory Estimation for Fine-Tuning Pre-Trained LLMs
☆29May 31, 2025Updated 10 months ago
Alternatives and similar repositories for LLMem
Users that are interested in LLMem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆33Jul 8, 2024Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆25Nov 25, 2024Updated last year
- This repository contains a SystemVerilog implementation of a parametrized Round Robin arbiter with three instantiation options☆13Jan 28, 2024Updated 2 years ago
- ☆18Oct 17, 2024Updated last year
- Code for paper "ElasticTrainer: Speeding Up On-Device Training with Runtime Elastic Tensor Selection" (MobiSys'23)☆14Nov 1, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- DELTA-pytorch:DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation☆12Apr 16, 2024Updated last year
- Vision-Language Models Toolbox: Your all-in-one solution for multimodal research and experimentation☆12Feb 16, 2025Updated last year
- Memory consistency model checking and test generation library.☆15Oct 14, 2016Updated 9 years ago
- 浙江大学 2023 学年秋冬学期《数字逻辑设计》实验文档。☆12Jan 11, 2024Updated 2 years ago
- A PTA exported exercise paper format helper which cleans the results.☆12Jan 7, 2025Updated last year
- a simple programming language under development☆11Dec 3, 2023Updated 2 years ago
- Zero Bubble Pipeline Parallelism☆452May 7, 2025Updated 11 months ago
- ☆10Jan 3, 2024Updated 2 years ago
- BUAA Compiler Course Project 2023 by Toby Shi.☆13Aug 20, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆14Nov 17, 2025Updated 4 months ago
- A Script oriented compiler☆27Feb 13, 2017Updated 9 years ago
- [译] 面向数据科学的概率论☆16Jun 19, 2018Updated 7 years ago
- The repository of Identifying and Mitigating Position Bias.☆71Jun 13, 2025Updated 10 months ago
- Python package for compressing floating-point PyTorch tensors☆13Jul 22, 2024Updated last year
- ☆10Dec 21, 2024Updated last year
- Official completion of “Training on the Benchmark Is Not All You Need”.☆40Dec 31, 2024Updated last year
- [NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆42Updated this week
- 使用自然语言绘制流程图,基于OpenAI☆12Nov 13, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆30Nov 24, 2024Updated last year
- hacking on the stanford natural language inference (SNLI) corpus (in theano)☆15Aug 28, 2016Updated 9 years ago
- Jupyter notebook with examples on how to visualize the dataset of personal texts 📱, after extracting from an iPhone with PhoneView.☆12Aug 16, 2020Updated 5 years ago
- ☆14May 13, 2024Updated last year
- JAX implementation of LLaMA, aiming to train LLaMA on Google Cloud TPU☆14Jul 22, 2023Updated 2 years ago
- [COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models☆18Apr 1, 2025Updated last year
- ☆12Sep 11, 2022Updated 3 years ago
- 另一个浙大健康打卡定时任务☆13May 8, 2022Updated 3 years ago
- Quantize transformers to any learned arbitrary 4-bit numeric format☆52Apr 8, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆71Jul 11, 2024Updated last year
- Efficient Exploration through Bayesian Deep-Q Networks.☆18Mar 22, 2022Updated 4 years ago
- This program looks up the etymologies of words in a text file and color-codes the words according to their origin. It allows a writer to …☆14Jul 26, 2016Updated 9 years ago
- Generative AI for Semiconductor Design: Engineering Assistant built with Bedrock, Knowledge Bases and Langchain☆28Jun 14, 2024Updated last year
- Find the origin of words in every language using a Deep Neural Network trained to create an etymological map.☆22May 18, 2018Updated 7 years ago
- Elegant presentation template in LaTex and Typst☆11Apr 20, 2025Updated 11 months ago
- ☆20Nov 14, 2022Updated 3 years ago