v-prgmr / mergekit
Tools for merging pretrained large language models.
☆19Updated 10 months ago
Alternatives and similar repositories for mergekit:
Users that are interested in mergekit are comparing it to the libraries listed below
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- ☆20Updated last year
- Build Agentic workflows with function calling using open LLMs☆26Updated 3 weeks ago
- ☆19Updated 8 months ago
- 💙 Unstructured Data Connectors for Haystack 2.0☆16Updated last year
- ☆24Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆39Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- 🤝 Trade any tensors over the network☆30Updated last year
- ☆21Updated 2 months ago
- ☆28Updated 5 months ago
- ☆45Updated 7 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 4 months ago
- Analysis on the cost of encoder based models☆11Updated 2 months ago
- ☆14Updated last year
- ☆43Updated 2 months ago
- ☆18Updated 7 months ago
- ☆47Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 6 months ago
- BH hackathon☆14Updated last year
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 9 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated 2 weeks ago
- Set of scripts to finetune LLMs☆37Updated last year
- Table detection with Florence.☆13Updated 9 months ago
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…☆21Updated 6 months ago
- QLoRA for Masked Language Modeling☆22Updated last year
- ☆48Updated 5 months ago
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated 3 weeks ago