microsoft / LLMLinguaLinks
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
☆5,201Updated 3 months ago
Alternatives and similar repositories for LLMLingua
Users that are interested in LLMLingua are comparing it to the libraries listed below
Sorting:
- structured outputs for llms☆10,876Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,550Updated last month
- Supercharge Your LLM Application Evaluations 🚀☆9,799Updated this week
- NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.☆4,848Updated this week
- Structured Outputs☆11,990Updated this week
- Knowledge Agents and Management in the Cloud☆4,035Updated this week
- Tools for merging pretrained large language models.☆5,937Updated 2 weeks ago
- A language for constraint-guided and efficient LLM programming.☆3,980Updated last month
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆11,759Updated last week
- A framework for prompt tuning using Intent-based Prompt Calibration☆2,644Updated 2 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆4,071Updated 10 months ago
- Superfast AI decision making and intelligent processing of multi-modal data.☆2,663Updated this week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,205Updated this week
- Harness LLMs with Multi-Agent Programming☆3,441Updated this week
- Large Language Model Text Generation Inference☆10,265Updated last week
- A blazing fast inference solution for text embeddings models☆3,758Updated this week
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆2,116Updated last year
- Adding guardrails to large language models.☆5,199Updated last month
- All things prompt engineering☆5,639Updated last year
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,788Updated last week
- A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain☆3,476Updated last year
- A unified evaluation framework for large language models☆2,656Updated last month
- [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings☆1,973Updated 5 months ago
- Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.☆17,080Updated this week
- PyTorch native post-training library☆5,296Updated this week
- The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.☆7,756Updated 11 months ago
- DSPy: The framework for programming—not prompting—language models☆26,016Updated this week
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,609Updated 9 months ago
- An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents☆5,634Updated 9 months ago
- A code-first agent framework for seamlessly planning and executing data analytics tasks.☆5,788Updated last month