center-for-humans-and-machines / transformer-heads
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆236Updated last week
Related projects: ⓘ
- awesome synthetic (text) datasets☆213Updated this week
- Let's build better datasets, together!☆195Updated last month
- An Open Source Toolkit For LLM Distillation☆284Updated last month
- Manage scalable open LLM inference endpoints in Slurm clusters☆217Updated 2 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆217Updated 6 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAG☆295Updated 3 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆230Updated 3 months ago
- Automatically evaluate your LLMs in Google Colab☆511Updated 4 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆195Updated 4 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆252Updated 2 months ago
- ☆75Updated 3 weeks ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆158Updated 2 months ago
- ☆126Updated 2 months ago
- A compact LLM pretrained in 9 days by using high quality data☆225Updated 2 weeks ago
- ☆418Updated 2 months ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆388Updated 3 weeks ago
- ☆203Updated 2 months ago
- Easily embed, cluster and semantically label text datasets☆433Updated 5 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆124Updated this week
- Low-Rank adapter extraction for fine-tuned transformers model☆154Updated 4 months ago
- ☆109Updated last month
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆101Updated last week
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆101Updated last week
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆223Updated 4 months ago
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆161Updated 4 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆89Updated this week
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆362Updated 7 months ago
- ☆276Updated 3 weeks ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆260Updated last month
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆192Updated 4 months ago