cognitivecomputations / spectrum
☆75Updated 3 weeks ago
Related projects: ⓘ
- Set of scripts to finetune LLMs☆36Updated 5 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- ☆64Updated 3 months ago
- ☆109Updated last month
- Low-Rank adapter extraction for fine-tuned transformers model☆154Updated 4 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆73Updated 6 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆217Updated 2 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆158Updated 2 months ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆175Updated 4 months ago
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 2 months ago
- ☆48Updated 11 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆230Updated 3 months ago
- A pipeline for LLM knowledge distillation☆68Updated last month
- Let's build better datasets, together!☆195Updated last month
- Small and Efficient Mathematical Reasoning LLMs☆69Updated 7 months ago
- ☆58Updated 3 weeks ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆64Updated 2 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆101Updated last week
- ☆85Updated 7 months ago
- ☆82Updated 3 weeks ago
- ☆89Updated 11 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆107Updated last year
- experiments with inference on llama☆106Updated 3 months ago
- Code for NeurIPS LLM Efficiency Challenge☆52Updated 5 months ago
- QLoRA with Enhanced Multi GPU Support☆35Updated last year
- ☆48Updated 6 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆112Updated last week
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆192Updated 4 months ago
- Codebase accompanying the Summary of a Haystack paper.☆65Updated 2 months ago