mchl-labs / stambecco
The home of Stambecco 🦌: Italian Instruction-following LLaMA Model
☆20Updated last year
Related projects: ⓘ
- Camoscio: An Italian instruction-tuned language model based on LLaMA☆124Updated 9 months ago
- Get ready to meet Fauno - the Italian language model crafted by the RSTLess Research Group from the Sapienza University of Rome.☆78Updated last year
- ☆36Updated 9 months ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆177Updated 4 months ago
- An Open Source Toolkit For LLM Distillation☆284Updated last month
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆217Updated 6 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆237Updated last week
- ☆75Updated 3 weeks ago
- Embed arbitrary modalities (images, audio, documents, etc) into large language models.☆170Updated 5 months ago
- Let's build better datasets, together!☆195Updated last month
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆158Updated 2 months ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆388Updated 3 weeks ago
- Prune transformer layers☆60Updated 3 months ago
- ☆181Updated 3 months ago
- Domain Adapted Language Modeling Toolkit - E2E RAG☆295Updated 3 months ago
- awesome synthetic (text) datasets☆213Updated last week
- ☆276Updated 3 weeks ago
- A framework for few-shot evaluation of autoregressive language models.☆13Updated 7 months ago
- Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind☆161Updated last week
- ☆82Updated 3 weeks ago
- The 🌟ANITA project🌟 *(Advanced Natural-based interaction for the ITAlian language)* wants to provide Italian NLP researchers with an im…☆14Updated last week
- Automatically evaluate your LLMs in Google Colab☆511Updated 4 months ago
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆154Updated 11 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆217Updated 2 months ago
- Set of scripts to finetune LLMs☆36Updated 5 months ago
- Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates☆423Updated 4 months ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆155Updated 2 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆229Updated 3 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆107Updated last year
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆99Updated last month