sanowl / Drag-and-Drop-LLMs-Zero-Shot-Prompt-to-WeightsLinks
☆29Updated 2 months ago
Alternatives and similar repositories for Drag-and-Drop-LLMs-Zero-Shot-Prompt-to-Weights
Users that are interested in Drag-and-Drop-LLMs-Zero-Shot-Prompt-to-Weights are comparing it to the libraries listed below
Sorting:
- Query-agnostic KV cache eviction: 3–4× reduction in memory and 2× decrease in latency (Qwen3/2.5, Gemma3, LLaMA3)☆99Updated this week
- Easy to use, High Performant Knowledge Distillation for LLMs☆92Updated 3 months ago
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆59Updated 10 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM