ysyisyourbrother / Galaxy-LMLinks

Work in progress LLM framework.

☆14

Alternatives and similar repositories for Galaxy-LM

Users that are interested in Galaxy-LM are comparing it to the libraries listed below

Sorting:

xumengwei / Edge-AI-Paper-List
☆209Updated last year
wenh18 / AdaptiveNet
☆16Updated 2 years ago
UbiquitousLearning / Paper-list-resource-efficient-large-language-model
☆100Updated last year
ysyisyourbrother / awesome-on-device-AI
A curated list of awesome projects and papers for AI on Mobile/IoT/Edge devices. Everything is continuously updating. Welcome contributio…
☆45Updated 2 years ago
artpad6 / gemel_nsdi23
☆22Updated last year
yuanmu97 / InFi
InFi is a library for building input filters for resource-efficient inference.
☆39Updated last year
UbiquitousLearning / Efficient_Foundation_Model_Survey
Survey Paper List - Efficient LLM and Foundation Models
☆258Updated last year
Kyrie-Zhao / awesome-real-time-AI
This is a list of awesome edgeAI inference related papers.
☆98Updated last year
wenh18 / AdaptiveNet_artifact
☆15Updated 2 years ago
James-QiuHaoran / LLM-serving-with-proxy-models
Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction | A tiny BERT model can tell you the verbosity of an …
☆46Updated last year
MoE-Inf / awesome-moe-inference
Curated collection of papers in MoE model inference
☆285Updated last month
usc-isi / PipeEdge
PipeEdge: Pipeline Parallelism for Large-Scale Model Inference on Heterogeneous Edge Devices
☆35Updated last year
duowuyms / NetLLM
NetLLM: Adapting Large Language Models for Networking (SIGCOMM 2024) - Official Repository
☆166Updated 10 months ago
NetX-lab / Ayo
[ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo
☆45Updated 2 months ago
UbiquitousLearning / MobileFM
One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…
☆29Updated last year
csu-eis / CoDL
☆78Updated 2 years ago
vuhpdc / jellyfish
Source code for Jellyfish, a soft real-time inference serving system
☆14Updated 2 years ago
inpluslab-wuhui / Systems-for-Foundation-Models
☆19Updated 5 months ago
S-Lab-System-Group / Awesome-DL-Scheduling-Papers
☆313Updated last year
DicardoX / Research-Space
This repository is established to store personal notes and annotated papers during daily research.
☆155Updated 2 weeks ago
SophiaLi06 / BytePS_THC
THC: Accelerating Distributed Deep Learning Using Tensor Homomorphic Compression
☆19Updated last year
Thesys-lab / Helix-ASPLOS25
Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"
☆67Updated this week
S-Lab-System-Group / ChronusArtifact
☆23Updated 3 years ago
jeho-lee / Awesome-On-Device-AI-Systems
☆87Updated last week
SymbioticLab / ModelKeeper
A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
☆35Updated 2 years ago
S-Lab-System-Group / HeliosArtifact
HeliosArtifact
☆21Updated 3 years ago
edge-video-services / ekya
Source code and datasets for Ekya, a system for continuous learning on the edge.
☆109Updated 3 years ago
tonyzhao-jt / LLM-PQ
Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and …
☆34Updated last month
mental2008 / awesome-papers
Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and o…
☆126Updated this week
PKUFlyingPig / MIT6.5940_TinyML
Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing
☆60Updated 9 months ago