A low-cost, high-performance deep learning training framework that enables efficient 100B-scale model fine-tuning on a commodity server with a consumer- grade GPU and limited main memory capacity [ICDE 25]
☆23Mar 21, 2025Updated last year
Alternatives and similar repositories for LoHan
Users that are interested in LoHan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CAM: Asynchronous GPU-Initiated, CPU-Managed SSD Management for Batching Storage Access [ICDE'25]☆19Mar 3, 2025Updated last year
- Cost-efficient Out-of-core GNN Training System on TB-scale Graph [ICDE 25]☆22Jan 6, 2025Updated last year
- An End-to-End Benchmarking Framework for Retrieval-Augmented Generation Systems☆29Mar 13, 2026Updated 2 months ago
- [FPGA 2020] A systematic framework for optimizing OpenCL applications on FPGAs☆20Apr 9, 2023Updated 3 years ago
- An awesome language and its compiler.☆35Jun 12, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Reduction Server in Rust☆14Apr 9, 2024Updated 2 years ago
- Artifacts of EVT ASPLOS'24☆30Mar 6, 2024Updated 2 years ago
- A web-based platform that provides live streaming of classroom sessions at Zhejiang University.☆17Jan 3, 2026Updated 5 months ago
- Shuhai is a benchmarking-memory tool that allows FPGA programmers to demystify all the underlying details of memories, e.g., HBM and DDR4…☆118Jun 15, 2025Updated 11 months ago
- ☆14Nov 7, 2025Updated 7 months ago
- An open-source simulator framework for neural processing units☆40Mar 23, 2026Updated 2 months ago
- FpgaNIC is an FPGA-based Versatile 100Gb SmartNIC for GPUs [ATC 22]☆144Aug 17, 2023Updated 2 years ago
- ☆10Mar 10, 2024Updated 2 years ago
- ☆10Apr 10, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- C-like language compiler, the final project of ZJU Compiler Principle course☆43Oct 9, 2022Updated 3 years ago
- ☆15Aug 18, 2022Updated 3 years ago
- Automated Design of Agentic Systems☆10Sep 7, 2024Updated last year
- ☆11Aug 9, 2021Updated 4 years ago
- Does all kind of cool stuff to make analyzing meta classes easier. Now featuring WRedLogger.py, the previous backend of NetDbg☆10Jun 7, 2023Updated 3 years ago
- Hill Space is All You Need☆17Jul 11, 2025Updated 10 months ago
- A gitbook named studying-containerd-notes☆10Dec 17, 2018Updated 7 years ago
- Implemented Darius IP (originally target PYNQ) of convolution and maxpool on Xilinx FPGA with SDK☆16Dec 2, 2018Updated 7 years ago
- ☆13May 11, 2026Updated 3 weeks ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- linux kernel modules examples☆15Nov 18, 2019Updated 6 years ago
- Legacy Code of ZJU Campus App for iOS☆11Jan 31, 2024Updated 2 years ago
- GoldFinch and other hybrid transformer components☆13Dec 9, 2025Updated 6 months ago
- ☆13Aug 13, 2024Updated last year
- FPGA-based stochastic gradient descent (powered by ZipML - Low-precision machine learning on reconfigurable hardware)☆33Feb 10, 2020Updated 6 years ago
- This project is a implementation in PyTorch for ZO-AdaMU optimization: Adapting Perturbation with the Momentum and Uncertainty in Zeroth-…☆14Dec 12, 2023Updated 2 years ago
- ☆15Jan 4, 2026Updated 5 months ago
- Zig regex experiment☆13Nov 6, 2025Updated 7 months ago
- KANs and MLPs☆12Jun 7, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Reproduction study of Grassmann Flows for sequence modeling (arXiv 2512.19428). Shows 22.6% gap vs claimed 10-15%, includes CUDA kernels …☆30Dec 26, 2025Updated 5 months ago
- In-kernel RDMA library☆13Nov 7, 2023Updated 2 years ago
- ☆18Dec 2, 2024Updated last year
- ☆12Jun 5, 2024Updated 2 years ago
- ☆36Jun 10, 2024Updated last year
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆23Apr 13, 2026Updated last month
- ☆13Apr 17, 2024Updated 2 years ago