v-prgmr / mergekit
Tools for merging pretrained large language models.
☆19Updated 10 months ago
Alternatives and similar repositories for mergekit:
Users that are interested in mergekit are comparing it to the libraries listed below
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- Build Agentic workflows with function calling using open LLMs☆26Updated this week
- ☆16Updated 6 months ago
- ☆24Updated last year
- Code for NeurIPS LLM Efficiency Challenge☆57Updated last year
- ☆28Updated 5 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year
- 🤝 Trade any tensors over the network☆30Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 3 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 5 months ago
- ☆20Updated last year
- ☆45Updated 6 months ago
- ☆48Updated 5 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆59Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated this week
- Explore the use of DSPy for extracting features from PDFs 🔎☆39Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- ☆19Updated 8 months ago
- ☆15Updated last year
- Analysis on the cost of encoder based models☆11Updated 2 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 8 months ago
- ☆14Updated 10 months ago
- ☆40Updated 2 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- ☆19Updated 6 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆30Updated 7 months ago
- ☆19Updated 2 months ago
- Universal text classifier for generative models☆23Updated 8 months ago
- ☆14Updated last year