center-for-humans-and-machines / transformer-heads
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆276Updated 2 months ago
Alternatives and similar repositories for transformer-heads:
Users that are interested in transformer-heads are comparing it to the libraries listed below
- Let's build better datasets, together!☆259Updated 4 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆255Updated 9 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 6 months ago
- awesome synthetic (text) datasets☆278Updated 6 months ago
- ☆117Updated last month
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆198Updated last year
- This is our own implementation of 'Layer Selective Rank Reduction'☆237Updated 11 months ago
- Late Interaction Models Training & Retrieval☆306Updated this week
- Banishing LLM Hallucinations Requires Rethinking Generalization☆273Updated 9 months ago
- code for training & evaluating Contextual Document Embedding models☆183Updated 3 weeks ago
- A bagel, with everything.☆320Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆215Updated 6 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAG☆320Updated 6 months ago
- experiments with inference on llama☆104Updated 11 months ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆202Updated this week
- A set of scripts and notebooks on LLM finetunning and dataset creation☆108Updated 7 months ago
- ☆210Updated 10 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆180Updated 4 months ago
- ☆129Updated 8 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆124Updated last year
- Set of scripts to finetune LLMs☆37Updated last year
- Low-Rank adapter extraction for fine-tuned transformers models☆173Updated last year
- Website for hosting the Open Foundation Models Cheat Sheet.☆267Updated 3 weeks ago
- A comprehensive deep dive into the world of tokens☆222Updated 10 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆76Updated 6 months ago
- Generalist and Lightweight Model for Text Classification☆124Updated last week
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated last year
- ☆151Updated 5 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆199Updated 9 months ago
- Easily embed, cluster and semantically label text datasets☆530Updated last year