A family of efficient edge language models in 100M~1B sizes.
☆19Feb 14, 2025Updated last year
Alternatives and similar repositories for EfficientLLM
Users that are interested in EfficientLLM are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer☆30Dec 6, 2023Updated 2 years ago
- [ICLR 2026] ParallelBench: Understanding the Tradeoffs of Parallel Decoding in Diffusion LLMs☆42Feb 27, 2026Updated last week
- ☆14Jan 23, 2026Updated last month
- [ICLR 2026] Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding☆30Jan 27, 2026Updated last month
- Directed masked autoencoders☆14Feb 20, 2026Updated 2 weeks ago
- Automatic stabilizing and auto-piloting system for RC flying wing☆14Mar 3, 2016Updated 10 years ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- [NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models☆187Jan 1, 2025Updated last year
- Quantization of LLMs and benchmarking.☆10Apr 3, 2024Updated last year
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆25May 31, 2025Updated 9 months ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- 2SSP: A Two-Stage Framework for Structured Pruning of LLMs☆20Aug 18, 2025Updated 6 months ago
- Python client to integrate Cleanlab Codex with your AI Agent☆19Nov 19, 2025Updated 3 months ago
- Self-Distribution BNN☆10Mar 8, 2022Updated 3 years ago
- Code Repository for the NeurIPS 2024 Paper "Toward Efficient Inference for Mixture of Experts".☆19Oct 30, 2024Updated last year
- Fast and memory-efficient exact attention☆18Updated this week
- Gallery for Industry AI demos☆18May 1, 2023Updated 2 years ago
- ☆17Mar 14, 2024Updated last year
- Code to implement the experiments in "Post-training Quantization for Neural Networks with Provable Guarantees" by Jinjie Zhang, Yixuan Zh…☆11Jun 2, 2023Updated 2 years ago
- Python scripts to help ACs with OpenReview☆11Feb 7, 2026Updated last month
- aloam☆11Nov 15, 2019Updated 6 years ago
- Multiple Generalized Additive Models implemented in Python (EBM, XGB, Spline, FLAM). Code for our KDD 2021 paper "How Interpretable and T…☆13Aug 15, 2021Updated 4 years ago
- ☆13Oct 13, 2025Updated 4 months ago
- Efficient 2:4 sparse training algorithms and implementations☆59Dec 8, 2024Updated last year
- ☆12Apr 27, 2024Updated last year
- A Minimum Working Example of the Dissertation Template for UW-Madison.☆13May 4, 2024Updated last year
- Source code of ICLR2020 submisstion: Zeno++: Robust Fully Asynchronous SGD☆14Feb 2, 2020Updated 6 years ago
- GoLU, a novel, self-gated and element-wise activation function that performs well over a diverse set of tasks☆24Oct 4, 2025Updated 5 months ago
- ☆18Oct 6, 2024Updated last year
- [ICLR'25] The first benchmark aiming to evaluate whether LMMs can assist oracle bone inscription processing tasks☆22Mar 21, 2025Updated 11 months ago
- ☆13Jul 20, 2021Updated 4 years ago
- ☆19Mar 28, 2022Updated 3 years ago
- ☆12Apr 1, 2017Updated 8 years ago
- Code which uses the Intel Realsense D435 camera for object detection along with estimation of distance of object.☆12Nov 2, 2018Updated 7 years ago
- pytorch implementation of mvp: a multi-stage vision-language pre-training framework☆11Apr 23, 2022Updated 3 years ago
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago
- ☆12Nov 22, 2022Updated 3 years ago
- d-Matrix DMX Compressor: A Pytorch toolkit for nn.Module transformations supporting advanced quantization, sparsity, and elementwise func…☆21Oct 22, 2025Updated 4 months ago
- Invariant Feature Regularization for Fair Face Recognition (ICCV'23)☆15Oct 23, 2023Updated 2 years ago