chandar-lab / EfficientLLMs
☆12Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for EfficientLLMs
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated last week
- The Efficiency Spectrum of LLM☆52Updated 11 months ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Scaling Sparse Fine-Tuning to Large Language Models☆17Updated 9 months ago
- Here we will test various linear attention designs.☆56Updated 6 months ago
- ☆25Updated 11 months ago
- ☆47Updated 9 months ago
- ☆18Updated 3 months ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆44Updated last year
- ☆35Updated 9 months ago
- ☆30Updated this week
- ☆18Updated 5 months ago
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆43Updated 4 months ago
- Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆22Updated last month
- A Closer Look into Mixture-of-Experts in Large Language Models☆40Updated 3 months ago
- ☆21Updated 8 months ago
- [EMNLP 2023]Context Compression for Auto-regressive Transformers with Sentinel Tokens☆21Updated last year
- Using FlexAttention to compute attention with different masking patterns☆40Updated 2 months ago
- ☆17Updated last year
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Updated last year
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆69Updated last month
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆30Updated 6 months ago
- Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)☆53Updated last month
- PyTorch building blocks for OLMo☆18Updated this week
- This repo is based on https://github.com/jiaweizzhao/GaLore☆19Updated 2 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆24Updated 7 months ago
- Long Context Extension and Generalization in LLMs☆39Updated 2 months ago
- Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)☆38Updated this week
- Repository for Skill Set Optimization☆12Updated 3 months ago