hao-ai-lab / Consistency_LLMLinks
[ICML 2024] CLLMs: Consistency Large Language Models
☆404Updated last year
Alternatives and similar repositories for Consistency_LLM
Users that are interested in Consistency_LLM are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads☆499Updated 9 months ago
- scalable and robust tree-based speculative decoding algorithm☆361Updated 9 months ago
- Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆175Updated last year
- Explorations into some recent techniques surrounding speculative decoding☆290Updated 10 months ago
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"☆249Updated 9 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…