casys-kaist / DaCapoLinks
☆19Updated 11 months ago
Alternatives and similar repositories for DaCapo
Users that are interested in DaCapo are comparing it to the libraries listed below
Sorting:
- ☆73Updated 5 months ago
 - ☆54Updated 11 months ago
 - ☆103Updated 2 years ago
 - [DATE 2023] Pipe-BD: Pipelined Parallel Blockwise Distillation☆11Updated 2 years ago
 - [ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs☆120Updated 3 months ago
 - Study parallel programming - CUDA, OpenMP, MPI, Pthread☆60Updated 3 years ago
 - NEST Compiler☆118Updated 8 months ago
 - ☆91Updated last year
 - LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆146Updated 3 months ago
 - Study Group of Deep Learning Compiler☆165Updated 2 years ago
 - A performance library for machine learning applications.☆184Updated 2 years ago
 - Experimental deep learning framework written in Rust☆15Updated 3 years ago
 - PiKV: KV Cache Management System for Mixture of Experts [Efficient ML System]☆41Updated 2 weeks ago
 - A version of XRBench-MAESTRO used for MLSys 2023 publication☆25Updated 2 years ago
 - QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference☆118Updated last year
 - ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization☆110Updated last year
 - Neural Network Acceleration such as ASIC, FPGA, GPU, and PIM☆54Updated 5 years ago
 - Official Github repository for the SIGCOMM '24 paper "Accelerating Model Training in Multi-cluster Environments with Consumer-grade GPUs"☆72Updated last year
 - OwLite is a low-code AI model compression toolkit for AI models.☆50Updated 5 months ago
 - Open source version of ArchGym project.☆121Updated 6 months ago
 - Official implementation for Training LLMs with MXFP4☆101Updated 6 months ago
 - ☆27Updated last year
 - [HPCA'24] Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System☆49Updated 3 months ago
 - FriendliAI Model Hub☆91Updated 3 years ago
 - Neural Network Acceleration using CPU/GPU, ASIC, FPGA☆63Updated 5 years ago
 - 삼각형의 실전! Triton☆16Updated last year
 - ☆79Updated last year
 - Ditto is an open-source framework that enables direct conversion of HuggingFace PreTrainedModels into TensorRT-LLM engines.☆49Updated 3 months ago
 - Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆13Updated 10 months ago
 - ☆48Updated last year