tval2 / contextual-pruning
Library to facilitate pruning of LLMs based on context
☆31Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for contextual-pruning
- ☆38Updated this week
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆19Updated 9 months ago
- Evaluating LLMs with CommonGen-Lite☆84Updated 7 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated this week
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆78Updated 8 months ago
- Collection of autoregressive model implementation☆66Updated this week
- ☆24Updated last year
- ☆91Updated last month
- ☆74Updated last week
- This is the official repository for Inheritune.☆105Updated last month
- ☆73Updated 10 months ago
- The first dense retrieval model that can be prompted like an LM☆62Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 3 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆30Updated last month
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- ☆68Updated 2 months ago
- ☆62Updated last month
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- ☆20Updated last year
- ☆61Updated 2 months ago
- ☆46Updated 9 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated 10 months ago
- ☆111Updated last month
- Using multiple LLMs for ensemble Forecasting☆16Updated 9 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆102Updated 6 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆61Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆111Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 9 months ago