v-prgmr / mergekit
Tools for merging pretrained large language models.
☆19Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for mergekit
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- ☆20Updated 9 months ago
- Build Agentic workflows with function calling☆20Updated this week
- ☆41Updated last month
- ☆24Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆46Updated this week
- ☆40Updated 2 weeks ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated last month
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- LLM reads a paper and produce a working prototype☆36Updated last week
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆37Updated 7 months ago
- Code for NeurIPS LLM Efficiency Challenge☆54Updated 7 months ago
- ☆59Updated last month
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆59Updated 3 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago
- 💙 Unstructured Data Connectors for Haystack 2.0☆16Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆50Updated this week
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆36Updated last month
- Training hybrid models for dummies.☆15Updated 3 weeks ago
- ☆75Updated 5 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated 10 months ago
- Set of scripts to finetune LLMs☆36Updated 7 months ago
- ☆13Updated 10 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last week
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 2 weeks ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆15Updated 2 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆33Updated 8 months ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆22Updated 9 months ago
- BH hackathon☆14Updated 7 months ago