mchl-labs / stambecco
The home of Stambecco ๐ฆ: Italian Instruction-following LLaMA Model
โ20Updated last year
Alternatives and similar repositories for stambecco:
Users that are interested in stambecco are comparing it to the libraries listed below
- Camoscio: An Italian instruction-tuned language model based on LLaMAโ127Updated last year
- Get ready to meet Fauno - the Italian language model crafted by the RSTLess Research Group from the Sapienza University of Rome.โ80Updated last year
- โ37Updated last year
- Toolkit for attaching, training, saving and loading of new heads for transformer modelsโ270Updated 3 weeks ago
- Let's build better datasets, together!โ257Updated 3 months ago
- โ168Updated last year
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.โ460Updated 7 months ago
- minLoRA: a minimal PyTorch library that allows you to apply LoRA to any PyTorch model.โ454Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAGโ317Updated 4 months ago
- awesome synthetic (text) datasetsโ265Updated 5 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'โ233Updated 10 months ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" ๐ฎ๐นโ30Updated 9 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for freeโ230Updated 5 months ago
- [PhD Course] Robust and Reproducible Experimental Deep Learning Settingโ13Updated last month
- Extend existing LLMs way beyond the original training length with constant memory usage, without retrainingโ691Updated 11 months ago
- Manage scalable open LLM inference endpoints in Slurm clustersโ253Updated 8 months ago
- โ42Updated 2 months ago
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answeโฆโ150Updated last year
- Repository for the EM German Modelโ108Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100sโ709Updated last year
- Maybe the new state of the art vision model? we'll see ๐คทโโ๏ธโ161Updated last year
- Late Interaction Models Training & Retrievalโ264Updated last week
- Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmindโ174Updated 6 months ago
- โ201Updated 10 months ago
- A framework for few-shot evaluation of autoregressive language models.โ13Updated last year
- Resources relating to the DLAI event: https://www.youtube.com/watch?v=eTieetk2dSwโ184Updated last year
- ๐ฆ XโLLM: Cutting Edge & Easy LLM Finetuningโ400Updated last year
- Set of scripts to finetune LLMsโ37Updated last year
- โ120Updated 5 months ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Modelsโ227Updated 11 months ago