brave-experiments / MELT-public
codebase for "MELTing Point: Mobile Evaluation of Language Transformers"
☆18Updated 9 months ago
Alternatives and similar repositories for MELT-public
Users that are interested in MELT-public are comparing it to the libraries listed below
Sorting:
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆27Updated last year
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆105Updated 3 years ago
- (HotMobile'24) Salted Inference: Enhancing Privacy while Maintaining Efficiency of Split Inference in Mobile Computing☆15Updated last year
- Survey Paper List - Efficient LLM and Foundation Models☆248Updated 7 months ago
- ☆201Updated last year
- ☆13Updated last year
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆28Updated 6 months ago
- This is a list of awesome edgeAI inference related papers.☆96Updated last year
- An LLM inference engine, written in C++☆14Updated 4 months ago
- ☆16Updated last year
- Split Learning Simulation Framework for LLMs☆20Updated 8 months ago
- Federated Learning Systems Paper List☆73Updated last year
- Awesome Mobile LLMs☆184Updated last month
- How much energy do GenAI models consume?☆42Updated this week
- ☆99Updated last year
- [NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models☆164Updated 4 months ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆34Updated 2 years ago
- [DMLR 2024] FedAIoT: A Federated Learning Benchmark for Artificial Intelligence of Things☆55Updated 8 months ago
- Official Repo for "LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization"☆31Updated last year
- EE-LLM is a framework for large-scale training and inference of early-exit (EE) large language models (LLMs).☆62Updated 11 months ago
- Videoconferencing research platform☆65Updated 6 months ago
- ☆49Updated 5 months ago
- ☆45Updated 10 months ago
- The official implementation of TinyTrain [ICML '24]☆22Updated 9 months ago
- Compression for Foundation Models☆31Updated last month
- ☆27Updated 10 months ago
- A resilient distributed training framework☆95Updated last year
- Enhancing Efficiency in Multidevice Federated Learning through Data Selection☆12Updated last year
- Collections of paper reviews in SEELab, related to IoT/HD/ML etc.☆30Updated this week
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆26Updated 5 months ago