center-for-humans-and-machines / transformer-heads
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆273Updated last month
Alternatives and similar repositories for transformer-heads:
Users that are interested in transformer-heads are comparing it to the libraries listed below
- ☆113Updated last week
- Manage scalable open LLM inference endpoints in Slurm clusters☆254Updated 9 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 5 months ago
- Late Interaction Models Training & Retrieval☆276Updated this week
- awesome synthetic (text) datasets☆267Updated 5 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆417Updated last year
- This is our own implementation of 'Layer Selective Rank Reduction'☆234Updated 10 months ago
- Let's build better datasets, together!☆257Updated 3 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆124Updated last year
- code for training & evaluating Contextual Document Embedding models☆180Updated 3 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆198Updated 11 months ago
- Automatically evaluate your LLMs in Google Colab☆614Updated 11 months ago
- Easily embed, cluster and semantically label text datasets☆522Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆106Updated 6 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆273Updated 9 months ago
- ☆209Updated 9 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆171Updated 11 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAG☆319Updated 5 months ago
- Generalist and Lightweight Model for Text Classification☆115Updated this week
- Set of scripts to finetune LLMs☆37Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated last month
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆232Updated last month
- ☆129Updated 7 months ago
- ☆199Updated last year
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆193Updated 6 months ago
- ☆117Updated 7 months ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆129Updated 3 months ago
- Official repository for ORPO☆447Updated 10 months ago
- data cleaning and curation for unstructured text☆329Updated 8 months ago