A low-cost, high-performance deep learning training framework that enables efficient 100B-scale model fine-tuning on a commodity server with a consumer- grade GPU and limited main memory capacity [ICDE 25]
☆23Mar 21, 2025Updated last year
Alternatives and similar repositories for LoHan
Users that are interested in LoHan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RPCNIC: A High-Performance and Reconfigurable PCIe-attached RPC Accelerator [HPCA2025]☆14Dec 9, 2024Updated last year
- CAM: Asynchronous GPU-Initiated, CPU-Managed SSD Management for Batching Storage Access [ICDE'25]☆18Mar 3, 2025Updated last year
- Cost-efficient Out-of-core GNN Training System on TB-scale Graph [ICDE 25]☆22Jan 6, 2025Updated last year
- Demystifying Datapath Accelerator Enhanced Off-path SmartNIC [ICNP24]☆58Dec 5, 2024Updated last year
- [FPGA 2020] A systematic framework for optimizing OpenCL applications on FPGAs☆19Apr 9, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆49May 20, 2025Updated 11 months ago
- Reduction Server in Rust☆14Apr 9, 2024Updated 2 years ago
- Artifacts of EVT ASPLOS'24☆30Mar 6, 2024Updated 2 years ago
- A web-based platform that provides live streaming of classroom sessions at Zhejiang University.☆17Jan 3, 2026Updated 3 months ago
- An open-source simulator framework for neural processing units☆36Mar 23, 2026Updated last month
- ☆14Nov 7, 2025Updated 5 months ago
- FpgaNIC is an FPGA-based Versatile 100Gb SmartNIC for GPUs [ATC 22]☆143Aug 17, 2023Updated 2 years ago
- ☆11Apr 10, 2024Updated 2 years ago
- C-like language compiler, the final project of ZJU Compiler Principle course☆43Oct 9, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- MindSpore Observability☆19May 29, 2020Updated 5 years ago
- Automated Design of Agentic Systems☆10Sep 7, 2024Updated last year
- Does all kind of cool stuff to make analyzing meta classes easier. Now featuring WRedLogger.py, the previous backend of NetDbg☆10Jun 7, 2023Updated 2 years ago
- Hill Space is All You Need☆17Jul 11, 2025Updated 9 months ago
- A gitbook named studying-containerd-notes☆10Dec 17, 2018Updated 7 years ago
- ☆13Apr 13, 2026Updated 2 weeks ago
- Legacy Code of ZJU Campus App for iOS☆11Jan 31, 2024Updated 2 years ago
- linux kernel modules examples☆15Nov 18, 2019Updated 6 years ago
- GoldFinch and other hybrid transformer components☆13Dec 9, 2025Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This project is a implementation in PyTorch for ZO-AdaMU optimization: Adapting Perturbation with the Momentum and Uncertainty in Zeroth-…☆14Dec 12, 2023Updated 2 years ago
- FPGA-based stochastic gradient descent (powered by ZipML - Low-precision machine learning on reconfigurable hardware)☆33Feb 10, 2020Updated 6 years ago
- ☆15Jan 4, 2026Updated 3 months ago
- Zig regex experiment☆13Nov 6, 2025Updated 5 months ago
- KANs and MLPs☆12Jun 7, 2024Updated last year
- ☆16Mar 22, 2025Updated last year
- ☆18Dec 2, 2024Updated last year
- ☆12Jun 5, 2024Updated last year
- ☆36Jun 10, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆23Apr 13, 2026Updated 2 weeks ago
- ☆13Apr 17, 2024Updated 2 years ago
- A python implementation of delta debugging tool.☆26Feb 9, 2024Updated 2 years ago
- ☆10Apr 19, 2014Updated 12 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 2 months ago
- Run GNS3 Server inside Docker☆11Oct 3, 2021Updated 4 years ago
- Shared library for intercepting CUDA Runtime API calls. This was part of my Bachelor thesis: A Study on the Computational Exploitation of…☆14Jun 6, 2024Updated last year