v-prgmr / mergekitLinks
Tools for merging pretrained large language models.
☆19Updated last year
Alternatives and similar repositories for mergekit
Users that are interested in mergekit are comparing it to the libraries listed below
Sorting:
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 11 months ago
- Build Agentic workflows with function calling using open LLMs☆28Updated 2 weeks ago
- ☆23Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆35Updated last year
- ☆20Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- ☆47Updated 4 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆40Updated last year
- BH hackathon☆14Updated last year
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated 2 months ago
- 🤝 Trade any tensors over the network☆30Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated last month
- ☆20Updated 3 weeks ago
- ☆21Updated 4 months ago
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…☆21Updated 8 months ago
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated 3 weeks ago
- Writing Blog Posts with Generative Feedback Loops!☆48Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- ☆14Updated last year
- ☆14Updated last year
- ☆46Updated 8 months ago
- PyTorch implementation for MRL☆18Updated last year
- ☆20Updated last year
- ☆19Updated 10 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆31Updated 9 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- ☆30Updated 7 months ago
- QLoRA for Masked Language Modeling☆22Updated last year
- Simple GRPO scripts and configurations.☆58Updated 4 months ago