brave-experiments / MELT-publicLinks
codebase for "MELTing Point: Mobile Evaluation of Language Transformers"
☆18Updated last year
Alternatives and similar repositories for MELT-public
Users that are interested in MELT-public are comparing it to the libraries listed below
Sorting:
- One-size-fits-all model for mobile AI, a novel paradigm for mobile AI in which the OS and hardware co-manage a foundation model that is c…☆29Updated last year
- Survey Paper List - Efficient LLM and Foundation Models☆258Updated last year
- EE-LLM is a framework for large-scale training and inference of early-exit (EE) large language models (LLMs).☆71Updated last year
- (HotMobile'24) Salted Inference: Enhancing Privacy while Maintaining Efficiency of Split Inference in Mobile Computing☆17Updated last year
- A canonical source of GenAI energy benchmark and meausrements☆50Updated 3 weeks ago
- Awesome Mobile LLMs☆282Updated 3 weeks ago
- ☆211Updated last year
- ☆101Updated 3 weeks ago
- ☆25Updated last year
- This is a list of awesome edgeAI inference related papers.☆97Updated 2 years ago
- Measure and optimize the energy consumption of your AI applications!☆320Updated 3 weeks ago
- Simulation framework for accelerating research in Private Federated Learning☆345Updated last month
- A curated list of early exiting (LLM, CV, NLP, etc)☆69Updated last year
- Compression for Foundation Models☆34Updated 5 months ago
- zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation☆27Updated 4 years ago
- Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"☆148Updated 2 years ago
- Compressing Large Language Models using Low Precision and Low Rank Decomposition☆106Updated 3 weeks ago
- Enhancing Efficiency in Multidevice Federated Learning through Data Selection☆13Updated last year
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆111Updated 3 years ago
- Efficient LLM Inference Acceleration using Prompting☆51Updated last year
- Federated Learning Systems Paper List☆75Updated last year
- ☆102Updated last year
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆349Updated 7 months ago
- Code for studying the super weight in LLM☆121Updated last year
- Libraries for efficient and scalable group-structured dataset pipelines.☆25Updated 6 months ago
- Official implementation for Training LLMs with MXFP4☆115Updated 7 months ago
- Prototyp MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism☆24Updated 8 months ago
- FL_PyTorch: Optimization Research Simulator for Federated Learning☆35Updated 2 years ago
- ☆63Updated last year
- LLM checkpointing for DeepSpeed/Megatron☆22Updated 3 weeks ago