tval2 / contextual-pruning
Library to facilitate pruning of LLMs based on context
☆31Updated 11 months ago
Alternatives and similar repositories for contextual-pruning:
Users that are interested in contextual-pruning are comparing it to the libraries listed below
- Code for TrackTheMind☆67Updated last month
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆30Updated 3 months ago
- ☆35Updated last year
- Data preparation code for CrystalCoder 7B LLM☆43Updated 8 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Updated 7 months ago
- ☆46Updated 2 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆53Updated 4 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 6 months ago
- The first dense retrieval model that can be prompted like an LM☆65Updated 4 months ago
- This is the official repository for Inheritune.☆109Updated 3 months ago
- ☆20Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 10 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆80Updated 10 months ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated 2 months ago
- ☆47Updated 4 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated 10 months ago
- ☆24Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆74Updated 2 months ago
- ☆79Updated last week
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 11 months ago
- Evaluating LLMs with CommonGen-Lite☆87Updated 9 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- Set of scripts to finetune LLMs☆36Updated 9 months ago
- ☆108Updated 3 months ago
- Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…☆27Updated last year
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated 10 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- ☆89Updated this week
- A repository for research on medium sized language models.☆76Updated 7 months ago