v-prgmr / mergekitLinks
Tools for merging pretrained large language models.
☆19Updated 11 months ago
Alternatives and similar repositories for mergekit
Users that are interested in mergekit are comparing it to the libraries listed below
Sorting:
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- Build Agentic workflows with function calling using open LLMs☆26Updated 3 weeks ago
- ☆20Updated last year
- ☆18Updated 8 months ago
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated last month
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- ☆21Updated 3 months ago
- ☆23Updated last year
- ☆43Updated 3 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 3 weeks ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆35Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆40Updated last year
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- ☆14Updated last year
- Simple GRPO scripts and configurations.☆58Updated 3 months ago
- ☆45Updated 8 months ago
- 🤝 Trade any tensors over the network☆30Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆55Updated 2 weeks ago
- ☆49Updated 6 months ago
- ☆47Updated last year
- ☆28Updated 2 years ago
- ☆19Updated 9 months ago
- ☆29Updated 6 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆61Updated last year
- QLoRA for Masked Language Modeling☆22Updated last year
- Analysis on the cost of encoder based models☆11Updated 3 months ago
- ☆77Updated 11 months ago
- Writing Blog Posts with Generative Feedback Loops!☆48Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 6 months ago
- BH hackathon☆13Updated last year