Adam-Mazur / Lazy-LlamaView external linksLinks
An implementation of LazyLLM token pruning for LLaMa 2 model family.
☆13Jan 6, 2025Updated last year
Alternatives and similar repositories for Lazy-Llama
Users that are interested in Lazy-Llama are comparing it to the libraries listed below
Sorting:
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆22Jun 26, 2024Updated last year
- ☆32Jun 5, 2025Updated 8 months ago
- Resources for our ACL 2023 paper: Distilling Script Knowledge from Large Language Models for Constrained Language Planning☆36Aug 19, 2023Updated 2 years ago
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆41Oct 11, 2024Updated last year
- Create string diagrams with LaTeX!☆14Jan 3, 2025Updated last year
- Implement some method of LLM KV Cache Sparsity☆41Jun 6, 2024Updated last year
- ☆11Aug 20, 2025Updated 5 months ago
- Repository for Screen2AX paper☆17Aug 6, 2025Updated 6 months ago
- SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia☆42Mar 13, 2023Updated 2 years ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 9 months ago
- The officalimplement of dLLM-Factory☆25Jul 12, 2025Updated 7 months ago
- The official implementation of Bi-Mamba☆14Oct 22, 2025Updated 3 months ago
- ☆13May 21, 2023Updated 2 years ago
- Repository for paper Decrypting Cryptic Crosswords☆10Jan 15, 2022Updated 4 years ago
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 7 months ago
- ☆14Jan 24, 2025Updated last year
- The Compositionality article class.☆13Jun 12, 2025Updated 8 months ago
- ☆12Jul 25, 2023Updated 2 years ago
- BAD: BiAs Detection for Large Language Models in the context of candidate screening (EECS 692)☆12Feb 14, 2024Updated 2 years ago
- EMNLP 2022: Analyzing and Evaluating Faithfulness in Dialogue Summarization☆13Mar 20, 2025Updated 10 months ago
- ☆10Jun 19, 2024Updated last year
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆13Jan 2, 2024Updated 2 years ago
- Enhancing Sentence Embedding with Generalized Pooling☆11Jul 26, 2018Updated 7 years ago
- My implementation of Symbolic Transfer Entropy (STE): a measure of asymmetric information flow between stochastic processes.☆10Jul 9, 2019Updated 6 years ago
- Official implementation of "How Important is Importance Sampling for Deep Budgeted Training?"☆11Oct 18, 2022Updated 3 years ago
- Diffusing States and Matching Scores: A New Framework for Imitation Learning☆21Nov 16, 2024Updated last year
- GoldFinch and other hybrid transformer components☆45Jul 20, 2024Updated last year
- This repo contains the source code for: Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs☆44Aug 14, 2024Updated last year
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluation☆61Feb 10, 2026Updated last week
- [KDD'22] Learned Token Pruning for Transformers☆102Feb 27, 2023Updated 2 years ago
- ☆54Oct 29, 2024Updated last year
- ☆11Feb 12, 2024Updated 2 years ago
- INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness☆14Nov 10, 2025Updated 3 months ago
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆12Dec 13, 2023Updated 2 years ago
- Code for Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution (ACL2021)☆13Jun 2, 2021Updated 4 years ago
- LCA-on-the-line (ICML 2024 Oral)☆13Feb 13, 2025Updated last year
- This is the code repo for our paper "Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression".☆12Feb 27, 2024Updated last year
- ☆11May 18, 2025Updated 8 months ago
- ☆12Aug 22, 2023Updated 2 years ago