Library to facilitate pruning of LLMs based on context
☆32Jan 31, 2024Updated 2 years ago
Alternatives and similar repositories for contextual-pruning
Users that are interested in contextual-pruning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13May 21, 2023Updated 2 years ago
- ☆54Feb 12, 2025Updated last year
- VECMAN (Vector Manager) - A VQ-VAE based vector database for efficient text embeddings and retrieval. This package provides a memory-effi…☆23Jul 21, 2025Updated 8 months ago
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆21Jul 31, 2023Updated 2 years ago
- ☆18Sep 5, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Simulation, multi-path estimation, and CBR parsing code of SIGCOMM2023 BeamSense CBR-Sensing☆10Jan 14, 2024Updated 2 years ago
- The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)☆27Oct 31, 2022Updated 3 years ago
- ☆41Sep 13, 2025Updated 7 months ago
- [NeurIPS 2025] Bag of Tricks for Inference-time Computation of LLM Reasoning☆17Sep 20, 2025Updated 6 months ago
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆24Nov 29, 2025Updated 4 months ago
- Talk to your shell in natural language. Locally.☆54Feb 15, 2026Updated last month
- Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading" (https://arxiv.org/abs/2404.17399)☆12Apr 29, 2024Updated last year
- ☆17Apr 13, 2025Updated last year
- [SIGCOMM 2023] PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale☆15Jul 1, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Stack-safe recursion schemes on dissectible data structures.☆13May 6, 2022Updated 3 years ago
- This is a repository where I show how to use Mistral 7B☆10Oct 26, 2023Updated 2 years ago
- A collection of papers on LLM applications in the IoT field.☆18Jan 21, 2026Updated 2 months ago
- ☆12Nov 2, 2025Updated 5 months ago
- ☆17Oct 19, 2023Updated 2 years ago
- Provides a Purescript wrapper around react-testing-library to be used with purescript-react-basic-hooks☆15Jul 18, 2023Updated 2 years ago
- ☆13Feb 18, 2024Updated 2 years ago
- ☆15Sep 28, 2022Updated 3 years ago
- scratch to play with autogen agents☆14Oct 10, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PureScript version management in PureScript.☆14Jan 27, 2023Updated 3 years ago
- 📀 You finally scored a record deal.☆11Apr 11, 2023Updated 3 years ago
- A set of useful cryptographic utilities for blockchain development.☆12Sep 15, 2022Updated 3 years ago
- A notebook interface that makes working with AI agents easier.☆15Jul 12, 2025Updated 9 months ago
- Code for the EMNLP 2022 Findings short paper "SAT: Improving Semi-Supervised Text Classification with Simple Instance-Adaptive Self-Train…☆13Feb 25, 2023Updated 3 years ago
- Types for the least and greatest fixed points of functors.☆15Apr 27, 2022Updated 3 years ago
- codebase for "MELTing Point: Mobile Evaluation of Language Transformers"☆18Jul 19, 2024Updated last year
- Homology reduced UniProt, train-/valid-/testsets for language modeling☆16Apr 20, 2022Updated 3 years ago
- AI town https://github.com/a16z-infra/ai-town Patches to run on Hugging Face Spaces☆21Jun 6, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 机器学习实战:一、用朴素贝叶斯分类器推测11篇存在争议的《联邦党人文集》的作者 二、使用KNN、SVM、逻辑回归、K-Means方法对高斯分布分类 三、实现协同滤波算法进行电影推荐系统☆14Feb 1, 2023Updated 3 years ago
- A wrapper for Node's Stream API☆19Jan 26, 2025Updated last year
- A unidirectional value-based JSON codec library.☆15Oct 9, 2023Updated 2 years ago
- Routing management for Halogen☆15May 6, 2022Updated 3 years ago
- Rust crate for generating Markdown files☆12May 5, 2022Updated 3 years ago
- ☆13Dec 10, 2022Updated 3 years ago
- Likelihood Ratio Attack (LiRA) in PyTorch☆17Mar 3, 2025Updated last year