brave-experiments / MELT-publicLinks
codebase for "MELTing Point: Mobile Evaluation of Language Transformers"
☆18Updated last year
Alternatives and similar repositories for MELT-public
Users that are interested in MELT-public are comparing it to the libraries listed below
Sorting:
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆28Updated last year
- Survey Paper List - Efficient LLM and Foundation Models☆255Updated 11 months ago
- (HotMobile'24) Salted Inference: Enhancing Privacy while Maintaining Efficiency of Split Inference in Mobile Computing☆16Updated last year
- This is a list of awesome edgeAI inference related papers.☆97Updated last year
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆107Updated 3 years ago
- Measure and optimize the energy consumption of your AI applications!☆290Updated 3 weeks ago
- ☆78Updated last week
- ☆100Updated last year
- Enhancing Efficiency in Multidevice Federated Learning through Data Selection☆13Updated last year
- [ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs☆113Updated 2 months ago
- The official implementation of TinyTrain [ICML '24]☆22Updated last year
- ☆207Updated last year
- How much energy do GenAI models consume?☆47Updated 3 months ago
- Efficient LLM Inference Acceleration using Prompting☆50Updated 10 months ago
- Awesome Mobile LLMs☆241Updated last month
- Miro[ACM MobiCom '23] Cost-effective On-device Continual Learning over Memory Hierarchy with Miro☆16Updated last year
- ☆25Updated last year
- EE-LLM is a framework for large-scale training and inference of early-exit (EE) large language models (LLMs).☆67Updated last year
- zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation☆26Updated 4 years ago
- A curated list of early exiting (LLM, CV, NLP, etc)☆58Updated last year
- Compression for Foundation Models☆35Updated last month
- Libraries for efficient and scalable group-structured dataset pipelines.☆25Updated 2 months ago
- [DMLR 2024] FedAIoT: A Federated Learning Benchmark for Artificial Intelligence of Things☆58Updated last year
- ☆56Updated 9 months ago
- Code repo for the paper "SpinQuant LLM quantization with learned rotations"☆318Updated 6 months ago
- Federated Learning Systems Paper List☆75Updated last year
- Compressing Large Language Models using Low Precision and Low Rank Decomposition☆97Updated 9 months ago
- Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and …☆34Updated last week
- ☆123Updated 10 months ago
- [NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models☆177Updated 8 months ago