A low-cost, high-performance deep learning training framework that enables efficient 100B-scale model fine-tuning on a commodity server with a consumer- grade GPU and limited main memory capacity [ICDE 25]
☆23Mar 21, 2025Updated last year
Alternatives and similar repositories for LoHan
Users that are interested in LoHan are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RPCNIC: A High-Performance and Reconfigurable PCIe-attached RPC Accelerator [HPCA2025]☆15Dec 9, 2024Updated last year
- CAM: Asynchronous GPU-Initiated, CPU-Managed SSD Management for Batching Storage Access [ICDE'25]☆19Mar 3, 2025Updated last year
- Cost-efficient Out-of-core GNN Training System on TB-scale Graph [ICDE 25]☆24Jan 6, 2025Updated last year
- GPU-initiated Large-scale GNN System [ATC 23]☆19Oct 30, 2024Updated last year
- An End-to-End Benchmarking Framework for Retrieval-Augmented Generation Systems☆29Mar 13, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An awesome language and its compiler.☆35Jun 12, 2022Updated 4 years ago
- ☆50May 20, 2025Updated last year
- Artifacts of EVT ASPLOS'24☆30Mar 6, 2024Updated 2 years ago
- A web-based platform that provides live streaming of classroom sessions at Zhejiang University.☆17Jan 3, 2026Updated 5 months ago
- ZJU mirror front-end☆36Apr 24, 2026Updated 2 months ago
- Shuhai is a benchmarking-memory tool that allows FPGA programmers to demystify all the underlying details of memories, e.g., HBM and DDR4…☆118Jun 15, 2025Updated last year
- FpgaNIC is an FPGA-based Versatile 100Gb SmartNIC for GPUs [ATC 22]☆144Aug 17, 2023Updated 2 years ago
- C-like language compiler, the final project of ZJU Compiler Principle course☆42Oct 9, 2022Updated 3 years ago
- Automated Design of Agentic Systems☆10Sep 7, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Aug 9, 2021Updated 4 years ago
- Centaur, a framework for hybrid CPU-FPGA databases☆28May 2, 2017Updated 9 years ago
- Does all kind of cool stuff to make analyzing meta classes easier. Now featuring WRedLogger.py, the previous backend of NetDbg☆10Jun 7, 2023Updated 3 years ago
- Hill Space is All You Need☆17Jul 11, 2025Updated 11 months ago
- Implemented Darius IP (originally target PYNQ) of convolution and maxpool on Xilinx FPGA with SDK☆16Dec 2, 2018Updated 7 years ago
- ☆13Jun 10, 2026Updated 2 weeks ago
- linux kernel modules examples☆15Nov 18, 2019Updated 6 years ago
- Legacy Code of ZJU Campus App for iOS☆11Jan 31, 2024Updated 2 years ago
- GoldFinch and other hybrid transformer components☆15Dec 9, 2025Updated 6 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆13Aug 13, 2024Updated last year
- Running LLMs against a sandbox airport to see if they can make the correct decisions in real time☆29Jul 22, 2025Updated 11 months ago
- This project is a implementation in PyTorch for ZO-AdaMU optimization: Adapting Perturbation with the Momentum and Uncertainty in Zeroth-…☆14Dec 12, 2023Updated 2 years ago
- ☆15Jan 4, 2026Updated 5 months ago
- Zig regex experiment☆13Nov 6, 2025Updated 7 months ago
- KANs and MLPs☆12Jun 7, 2024Updated 2 years ago
- Reproduction study of Grassmann Flows for sequence modeling (arXiv 2512.19428). Shows 22.6% gap vs claimed 10-15%, includes CUDA kernels …☆30Dec 26, 2025Updated 6 months ago
- a simple DBMS for DB course in ZJU with go☆12Aug 21, 2022Updated 3 years ago
- ☆16Mar 22, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- In-kernel RDMA library☆13Nov 7, 2023Updated 2 years ago
- ☆18Dec 2, 2024Updated last year
- ☆12Jun 5, 2024Updated 2 years ago
- [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization☆24Apr 13, 2026Updated 2 months ago
- ☆10Apr 19, 2014Updated 12 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆19Feb 9, 2026Updated 4 months ago
- ☆15Apr 20, 2022Updated 4 years ago