brave-experiments / MELT-publicLinks
codebase for "MELTing Point: Mobile Evaluation of Language Transformers"
☆18Updated 11 months ago
Alternatives and similar repositories for MELT-public
Users that are interested in MELT-public are comparing it to the libraries listed below
Sorting:
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆28Updated last year
- Libraries for efficient and scalable group-structured dataset pipelines.☆26Updated last week
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆106Updated 3 years ago
- (HotMobile'24) Salted Inference: Enhancing Privacy while Maintaining Efficiency of Split Inference in Mobile Computing☆17Updated last year
- EE-LLM is a framework for large-scale training and inference of early-exit (EE) large language models (LLMs).☆64Updated last year
- Survey Paper List - Efficient LLM and Foundation Models☆248Updated 9 months ago
- Enhancing Efficiency in Multidevice Federated Learning through Data Selection☆14Updated last year
- ☆52Updated 2 weeks ago
- Compression for Foundation Models☆32Updated 3 months ago
- [MobiCom '23] AccuMO: Accuracy-Centric Multitask Offloading in Edge-Assisted Mobile Augmented Reality☆15Updated last year
- The official implementation of TinyTrain [ICML '24]☆22Updated 11 months ago
- ☆13Updated last year
- ☆16Updated last year
- Code associated with the paper **Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees**.☆28Updated 2 years ago
- ☆202Updated last year
- ☆99Updated last year
- This is a list of awesome edgeAI inference related papers.☆95Updated last year
- Implementation for FedConv: A Learning-on-Model Paradigm for Heterogeneous Federated Clients☆20Updated 10 months ago
- Federated Learning Systems Paper List☆73Updated last year
- An LLM inference engine, written in C++☆15Updated last week
- ☆52Updated 6 months ago
- Miro[ACM MobiCom '23] Cost-effective On-device Continual Learning over Memory Hierarchy with Miro☆15Updated last year
- Federated Few-shot Learning for Mobile NLP. Conditionally accepted by MobiCom'23.☆16Updated last year
- zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation☆25Updated 4 years ago
- How much energy do GenAI models consume?☆44Updated last month
- Official Repo for "LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization"☆34Updated last year
- PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation☆30Updated 7 months ago
- Microsoft's open source max-min fair solver for cluster scheduling and traffic engineering☆12Updated 3 months ago
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆47Updated 3 months ago
- [ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs☆108Updated 2 months ago