cbh123 / llmboxing
LLM boxing matches
☆56Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for llmboxing
- QLoRA: Efficient Finetuning of Quantized LLMs☆77Updated 7 months ago
- Data preparation code for Amber 7B LLM☆82Updated 6 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆111Updated last year
- Full finetuning of large language models without large memory requirements☆93Updated 10 months ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆101Updated 3 months ago
- A pipeline for LLM knowledge distillation☆77Updated 3 months ago
- Set of scripts to finetune LLMs☆36Updated 7 months ago
- ☆73Updated 10 months ago
- ☆64Updated 5 months ago
- ☆40Updated last week
- ☆49Updated 7 months ago
- ☆20Updated last year
- Just a bunch of benchmark logs for different LLMs☆114Updated 3 months ago
- Track the progress of LLM context utilisation☆53Updated 3 months ago
- ☆72Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆76Updated 7 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆22Updated 8 months ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆38Updated 3 weeks ago
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- Unofficial Implementation of Evolutionary Model Merging☆33Updated 7 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- Score LLM pretraining data with classifiers☆55Updated last year
- ☆92Updated last month
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- ☆104Updated 7 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- Code repository for the c-BTM paper☆105Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆201Updated 6 months ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆42Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆129Updated last month